Archive for category Administrative

An Australian sport wiki

Posted by Laura on Monday, 11 October, 2010

I’m currently toying around with the idea of creating a large Australian sport wiki. I’ve had a lot of data for a while to allow me to do this. Some of the data I have includes:

  • A list of Australian sport teams.  List is about 3,000 teams long.  It often contains some other information like when teams were founded, team colors, etc.  This can be combined with a list of Australian sport teams on Twitter and Facebook.
  • A list of Australian sport venues.  This can be merged with a list of Australian sport venues on Gowalla and Foursquare.
  • A list of Australian athletes on Twitter and Facebook.  This is probably about 100 large.
  • A list of Australian sport organizations on Twitter and Facebook.  This is probably about 50 large.
  • A list of Australian sport fans on Twitter, Facebook, LiveJournal and its clones, bebo, orkut, Wikipedia.  This list is probably close to about 20,000 long.

This data could easily be supplemented with the following information that would be easily mineable:

  • A list of AFL, NRL and other Australian professional athletes sorted by team.
  • A list of other Australian athletes.
  • A list of Australian sport organizations.
  • A list of Australian sport journalists and broadcasters.
  • A list of Australian sport websites.

This list could be further supplemented by:

  • Importing Australian sport pictures from Flickr.
  • Importing Australian sport videos from YouTube.

The quality of the wiki wouldn’t necessarily be all that high.  Most of the initial content would be stub.  Still, if I had a bot programmer and I was willing to devote a month to the project, it would be reasonably comprehensive in terms of initial content for others to use to build from.  It would possibly help in terms of developing a large idea of the scope of sport in Australia by giving an idea as to the size of the fan community, the depth of the interest in various sports, the number of teams, when those teams were created and folded, where sports are more popular than others.  (Though that all depends on the data mining that goes into the initial content.)  What keeps me from doing this is the fact that I feel like it might be a time sink of a month to create this content and I’m not sure if the time sink is worth the effort.  Basically, I need external motivation.

Related Posts:

Twitter Weekly Updates for 2010-10-10

Posted by Laura on Sunday, 10 October, 2010

Powered by Twitter Tools

Related Posts:

Expanded Profile of Australian En-WP users

Posted by Laura on Sunday, 10 October, 2010

My dissertation topic involves doing a demographic and geographic study of Australian sport fandom online. There are several sites and social networks where you can get publicly available demographic data to begin to formulate a picture of the user population, and then segment that population out by interest in a league, sport and athlete. I’ve spent a lot of time looking at Twitter, Facebook and LiveJournal. Recently, partly because of a trip to the Wikimedia Foundation and discussions with a few people at UCNISS, my interest in who was contributing to Australian sport wiki articles on Wikipedia increased.

Finding out who edited Wikipedia articles using publicly available information is a bit of a challenge. The most reliable information for who edited comes from IP address information. IP addresses can provide an idea as to the geographic location of the contributor. It is easy enough, with the help of a friend, to create a tool that pull the history of a Wikipedia article, get a list of IP addresses that edited the article, feed the IP address into another tool that will pull up the general location of the contributor. (One of my favorite visualizations of this type of information is WikipediaVision.) The data isn’t always accurate and if I was looking primarily at New Zealand, a country without its own dedicated IP address range, this would be even less reliable. Still, for my purposes, this data works pretty well.

This data is still pretty limited. There are a lot of articles that are edited by non-anonymous users. Sometimes, it is possible to get demographic and geographic information about Wikipedia contributors by viewing their profile pages. This can just be time consuming to do manually if an article has a large number of contributors as you need to view a lot of user pages. It becomes a deterrence for trying to collect geographic information about article contributors.

I was looking for a more time effective and accurate method of collecting geographic and demographic information about contributors that is publicly available on their user pages. The easiest and quickest way to get this information on a mass scale is to utilize user box information. Many user boxes, when included on a user page, put the user into a category. These categories are often then linked through the Wikipedian category structure. Beyond that, user boxes involve templates. It is easy to get a list of articles (user pages) that the template is included on.

The methodology that I selected from this point is rather straightforward. It involved:

1. Select a category.
2. Copy and paste the list of articles (user pages) in the selected category to an Excel spreadsheet. Sort the list alphabetically. Copy and paste only the user pages to Notepad. Replace * User with blank. Copy and paste this list back to Excel.
3. Create a filter where the cell contains / . Select those cells. Copy them to notepad, replace / with [tab] in order to remove user subpages from the list. Copy this back to Excel. Select only the column with usernames.
4. Run an advance filter in order to remove all duplicate rows.
5. Copy this list back to the dedicated spreadsheet. Label all those users with the category from which they were pulled in a unique column.
6. Repeat steps 1 to 5 until all the categories that you want to have included are included.
7. Merge/Group all the rows by username.

This method may not be the most efficient way of going about doing this. It can probably be improved by automating some of these steps. In my case, step 7 was not able to be completed using Excel. I had to e-mail the file to @woganmay, who I believe converted the file to a mySQL database, used the group feature, converted the results back to csv and e-mailed the file to me.

In my case, I did not complete this for every category. Some categories did not seem worth it time wise as they had too few user pages to be included. In other cases, the categories were just too big to do. This included all the members of User de, User en, User es, User fr, User it, User jp. Only a selected number of categories were included because of time constraints. Data gathering was focused on categories that I perceived would have the greatest number of Australians and other possible contributors to Australian related articles. When these categories were more exhausted, categories with between 1,00 and 5,000 articles were selected.

There are all sort of limitations to this data. First, not everyone includes userboxes on their profile pages. This means that there could be a lot more Australians on Wikipedia than indicated by userbox inclusion on a user page. The assumption for the resulting data is that proportional representation exists for various categories. So while there are X amount of Christians and Y amount of Atheists, the assumption that the relationship between X and Y will always be proportional to the actual population on Wikipedia. Whatever data is available thus has to be viewed as good enough or supplemented by going to individual user ages to see if other information is available when a user appears where no information for someone when running against the history of the article.

Second, even when they do exist, there are often useful pieces of information that are missing. For example, in an Australian context, there is a userbox for Rugby League fans. There is not however a userbox for Australian rules footy fans. There are also not user boxes and categories for fans of NRL or AFL teams. (This type of user box and category exists for National Hockey League teams.)

About halfway through this process, I realized that this data could be useful for analysis beyond who is editing Wikipedia. At the moment, I’ve only totaled data I have for Australians. It is pretty fascinating and would be neat to go further with: How does the proportional size of the Australian Wikipedian population compare against the actual population? Does the size of the Australian Atheist versus Christiah community actively reflect the proportions in Australian society? Or is the Australian Wikipedian community demographically distinct from the greater population?

The following tables include the data based on people who were included in Wikipedians in Australia and its subcategories and Australian Wikipedians. A copy of the raw data can be found at October 9 – Wikipedia English Data – Australians.xls. The data is provided without comment though any attempts at explaining the patterns found are very much appreciated.

Country Count
Bangladesh 3
Canada 2
Egypt 2
India 1
Indonesia 2
Ireland 3
Jamaica 2
Japan 5
New Zealand 17
Papua New Guinea 1
Republic of Ireland 5
Singapore 5
South Africa 2
South Korea 1
Sri Lanka 2
Tanzania 2
Turkey 2
United States 16
State Count
Australian Capital Territory 89
Canterbury 1
New South Wales 345
Northern Territory 5
Otago 1
Queensland 208
South Australia 144
Southland 1
Tasmania 54
Victoria 370
Wellington 2
Western Australia 145
Degree Count
BA degrees 21
BCom degrees 2
BCS degrees 3
BE degrees 18
BMus degrees 1
BS degrees 41
MS degrees 5
PhD degrees 18
University/Alma Mater Count
Australian National University 14
Avondale College 1
Charles Sturt University 1
Curtin University of Technology 7
Deakin University 6
Flinders University 7
Griffith University 1
James Cook University 2
La Trobe University 2
Macquarie University 5
Massey University 1
Monash University 19
Royal Melbourne Institute of Technology 10
University of Adelaide 4
University of Alberta 1
University of Canberra 3
University of Melbourne 21
University of New England 4
University of New South Wales 24
University of Newcastle 8
University of Sydney 16
University of Tasmania 3
University of Technology, Sydney 4
University of Western Australia 11
University of Wollongong 4
Victorian College of the Arts 1
Student type Count
Business students 3
College students 26
Law students 9
Medical students 8
University students 59
Website Count
Open Directory Project 1
OpenStreetMap 2
Wookieepedia 1
Religion Count
Anglican and Episcopalian 8
Antitheist 3
Atheist 97
Buddhist 13
Catholic 7
Christian 47
Eastern Orthodox 2
Hindu 1
Jewish 4
Lutheran 1
Methodist 2
Muslim 4
Non-denominational Christian 2
Objectivist 2
Pastafarian 17
Presbyterian 3
Protestant 11
Roman Catholic 10
Ethnicity and nationality Count
Argentine 2
Bangladeshi 2
British 3
English 10
Latino/Hispanic 1
Skill Count
Aircraft pilots 5
Artists 3
Engineers 17
Filmmakers 17
Homebrewers 10
Mechanical engineers 1
Professional writers 1
Surfers 2
Profession Count
Accountants 2
Actor 5
Actuaries 2
Aircraft pilots 5
Biologist 9
Broadcasters 5
Chemist 6
Composers 28
Computer scientists 7
Engineers 17
Filmmakers 17
Geoscientists 2
Mechanical engineers 1
Scientists 7
Teacher 18
University teacher 4
Web designers 2
Web developers 1
Interest Count
Chemistry 27
Cooking 1
Physics 34
Strings (physics) 6
Sports Count
Cavers 2
Cross-country runners 4
Dancers 3
Detroit Red Wings fans 2
Equestrians 2
Fencers 2
Geocachers 8
Hikers 2
Hunters 7
Outdoor pursuits 2
Rugby league fans 50
Runners 2
Sailing 1
Scuba divers 8
Snowboarders 2
Swimmers 16
Swing dancers 1
Toronto Maple Leafs fans 1
Ultimate Fighting Championship fans 2
Vancouver Canucks fans 3
WikiProject Tennis members 4
Wikipedia Status Count
Administrator hopefuls 41
Administrators 45
Administrators who will provide copies of deleted articles 11
Bureaucrats 1
Contribute to Wikimedia Commons 1
Create userboxes 3
Opted out of automatic signing 4
Reviewers 10
Rollbackers 27
Service Award Level 01 12
Service Award Level 02 14
Service Award Level 03 10
Service Award Level 04 5
Service Award Level 05 6
Service Award Level 06 9
Service Award Level 07 11
Service Award Level 08 3
Service Award Level 09 2
Wikimedia Commons administrators 2
Philosophy Count
Hindu 1
Humanist 6
Materialist 9
Pastafarian 16
Theist 9

Related Posts:

Methodology: Draft/Free writing (part 5)

Posted by Laura on Tuesday, 5 October, 2010

This feels mostly finished in terms of content to add to it. I need to get feedback from my supervisors regarding what is missing or what could be improved. (I know it isn’t perfect. I just wanted it written so I could get on with editing it and realizing where the problems are.) Any and all feedback is appreciated.



Methodology


Types of Social Media Research

When conducting social media research, there are ten general methods that can be used to gather and analyze data. These are:

  1. Individual case studies for how a business uses social media and the web;
  2. Search and traffic analytics analysis;
  3. Sentiment analysis and reputation management;
  4. Content analysis;
  5. Usability studies;
  6. Interaction and collaboration analysis;
  7. Relationship analysis to try to determine how people interact and to identify key influencers;
  8. Population/demographic studies;
  9. Online target analysis of behavior and psychographics; and
  10. Predictive analysis.

Each of these methods offers insights into various aspects of the web and its population. The type of analysis used is often specific to the purpose of the research, involved blended approaches from traditional analysis types, and different methods are often used in conjunction with each other. These methods often blend quantitative and qualitative analysis. Choosing the correct method of gathering analyzing data can be one of the biggest hurdles for being able to measure ROI and understand how a community works.

This section will provide a brief summary of each type, explain how to conduct this type of research and give examples that used that methodology.

Individual case studies for how a business uses social media and the web.

Case studies on social media usage are often done to measure the effectiveness of specific actions taken by an organization.

Bronwyn et al. (2005) say case studies “typically examine the interplay of all variables in order to provide as complete an understanding of an event or situation as possible. This type of comprehensive understanding is arrived at through a process known as thick description, which involves an in-depth description of the entity being evaluated, the circumstances under which it is used, the characteristics of the people involved in it, and the nature of the community in which it is located.”

This methodology often incorporates components of all the other methods discussed in this section. The specific methods often depend on the goals of the person or organization conducting the case study.

Vincenzini (2010) did a case study regarding the use of the social media by the NBA in an attempt to define why they have been successful in using it to promote the league. The author used quantitative analysis to measure the size of the community, the volume of content they were viewing on sites like YouTube and the volume of content they were creating on sites like Twitter. The quantitative analysis was synthesized with explanations from NBA employees to explain their practices in the context of their own business decisions as they pertained to social media. This was followed up with an explanation as to what worked and what did not worked and offered advise for others involved with sport and social media to help them leverage their own position.

Case studies are a mixed methodology approach, borrowing from other approaches. The major difference is that the case study focuses on a narrower perspective with the goal of tracking behavioral changes, or in advising others on how an organization changed practices and how those lessons can be applied elsewhere.

Search and traffic analytics analysis.

Search engine and traffic analytics generally is done internally to determine how to optimize a site in order to increase the amount of visitors a site gets and the total number of pages that they view. This method involves identifying how people arrive at a specific site and the pages they visit while at the site. Traffic analytics analysis often includes six different components: Search engine visitors, paid search advertisements, pay per click, organic traffic, direct traffic and internal site traffic.

Ramos and Cota (2009) define traffic analytics as “Tools that analyze and compare customer activity in order to make business decisions and increase sales. Analytics tools can report the number of conversions, the keywords that brought conversions, the sites that sent converting traffic, conversion by campaign, and so on.”

There are a number of different methods and tools that allow for this type of analysis. Early in the history of the Internet, one of the most popular tools and methods involved analyzing web server log files. (Jansen 2009) Another popular early method of analysis was page tagging, which involved embedding an invisible image on a page, which, when the image is triggered, “triggers JavaScript to send information about the page and the user back to a remote server.” (Jansen 2009) These earlier tools have advanced a bit and now include tools like Quantcast and Google Analytics. Kaushik (2010) recommends Google Analytics, a free tool that involves putting a bit of code on all pages of a site. Kaushik (2010) points out that various types of traffic analysis can be done using the various tools provided by Google Analytics. The author claims that Google Analytics allows you to break the analysis down into “three important pieces: campaign response, website behavior, and business outcomes.” (Kaushik 2010)

Fang (2007) completed a case study at the Rutgers-Newark Law Library in order to track library website usage, track visitor behavior and determine how to improve the website to better serve users. Earlier work done by the library had involved surveys handed out to patrons, analysis of log files, and the use of counters. (Fang 2007) The author changed methods because of some inherent flaws in using those methods to analyze website needs. They used Google Analytics in order to track user activity on the library’s website. The library “found out how many users were accurately following the path we had designed to reach a target page.” (Fang 2007) This sort of path following navigation was one of the goals they had when they designed their site. They also found out that “Visitor Segmentation showed that 83% of visitors were coming from the United States. About 50% of U.S. visitors were from New Jersey, and 76% of these were from Belleville and Newark. These results matched our predictions for patrons’ geographical patterns.” (Fang 2007) The results of this analysis enabled the library to make changes to improve their website.

This type of methodology lends itself more to a case study approach and often requires the consent of the website involved in order to get private logs. It can be used in conjunction with other methods, but should be used in a more targeted analysis of highly specific research areas.

Sentiment analysis and reputation management.

Sentiment analysis involves identifying content related to a topic and identifying the emotion connected to that content. In a sport context, sentiment analysis could involve determining if newspapers are providing positive or negative coverage of a team. In a social media context, sentiment analysis would involve determining attitudes being expressed on Twitter in individual tweets. Reputation management goes one further: After sentiment has been determined, a decision needs to be made on if and how negative and positive sentiment content should be responded to. Sentiment analysis is passive analysis where non-stakeholders can conduct analysis. Reputation management is active analysis that is primarily conducted by stakeholders as part of on going activities to improve a brand, be it personal or corporate.

While sentiment analysis and reputation management are similar in their desire to monitor a response to a situation, the tools available vary differently for each type. There a variety of different tools for sentiment analysis. One of the tools for conducting sentiment analysis are freely available lists of words “that evoke positive or negative associations.” (Wanner et al. 2009) Sterne (2010) suggests that content being ReTweeted on Twitter can be seen as a tool to measure positive sentiment. Sterne (2010) suggests that the ratio of follows/followers is not an effective tool for measuring sentiment on Twitter. Reputation management tools include Trackur. It allows you to “set up searchers and the system automatically monitors the Web for key words that appear on news sites, blogs, and other social media.” (Weber 2009)

Wanner et al. (2009) did a sentiment analysis of RSS feeds that focused on the 2008 United States presidential elections. They selected 50 feeds connected to the elections and collected updates to these feeds every 30 minutes for one-month starting 9 October 2008. For each item they collected off the feeds, they also recorded the date, title, description and feed id. (Wanner et al. 2009) After that, they eliminated all noise, which mostly consisted of non-content like URLS. (Wanner et al. 2009) The next step was to filter out all items that did not contain one of the following terms: “Obama”, “McCain”, “Biden”, “Palin”, “Democrat” and “Republican”. (Wanner et al. 2009) Sentiment was then analyzed using freely available lists “that evoke positive or negative associations.” (Wanner et al. 2009) The results were then visualized. Five events that happened during this period were chosen for a more detailed visual examination. They found that the news regarding possible abuse of power by Sarah Palin in Alaska resulted in many negative posts. They also found that the debates resulted in low sentiment scores for both candidates as the candidates attacked each other. The authors concluded that the visual tool they created would be useful for monitoring public debates.

This methodology can overlap with influencer identification (Weber 2009) as part of reputation management involves determining which people are worth responding to. It can also overlap with psychographics. Despite the obvious overlaps, this type of research often appears independently and not as part of a larger study.

Content analysis.

Content analysis involves looking at the individual components of something larger and analyzing it. In a social media context, the content could be comments on a Facebook fanpage, or all the tweets made by a person or group. Content analysis can be either qualitative or quantitative, depending on the purpose of the research.

With content analysis, the researcher views data as “data as representation not of physical events but of texts, images and expressions that are created to be seen, read, interpreted, and acted on for their meanings, and therefore be analyzed with such uses in mind.” (Krippendorff 2007) Krippendorff (2007) defines the basic methodology used in content analysis as unitizing, sampling, recording, reducing, inferring, and narrating.

An example of content analysis is a 2009 study by Kian, Mondello, & Vincent. It looked at ESPN and CBS’s internet coverage of men and women’s NCAA basketball tournament, also called March Madness. The methodology was spelled out by the authors as: “All 249 (N D 249) byline articles from CBS SportsLine and ESPN Internet were read, coded, and content analyzed to determine the descriptors in Internet articles.” (Kian, E., Mondello, M., & Vincent, J., 2009) The authors used multiple coders to help prevent bias in terms of interpretation of gendered language. The two sites in the sample were chosen because they were the largest. All types of March Madness content was included. Only the text of the content was included. Titles and authors were not. Categories for encoding gendered language were based on previous research by sport media researchers. Only descriptors were used for encoding. Totals for gendered descriptors were then calculated and an analysis was completed.

This method of analysis is can be done with other forms of analysis like sentiment analysis, as part of a usability study or collaboration study. It can also be done separately. It often appears most successful when done separately as part of a larger study to help provide context for other data analysis.

Usability studies.

In a social media context, usability studies look at how people use some aspect of the Internet or software that connects to it.

According to Sweeney, Dorey and MacLellan (2006), one of the purposes of a usability study is “point out specific usability problems with your Web site interface in line with how well your Web site speaks to your audience and their goals.” Jerz (2002) cautions that “Simply gathering opinions is not usability testing — you must arrange an experiment that measures a subject’s ability to use your document.” That caution also explains the general methodology of a usability study outlined by Jerz (2002): Collect both quantitative and qualitative data. The quantitative should involve some type of measurement. The qualitative should allow testers to express their opinions. Jerz (2002) suggests that you use at least five tests for the first run. Then, after fixing errors and problems based on tester feedback, you get another five testers to test the site to determine that those errors have been fixed.

An example of a usability study is one conducted by Sturgil, A., Pierce, R., & Wang, Y. (2010). The study tried to determine how much content readers of Internet news sites really wanted. The methodology involved conducting a focus group, and think-aloud sessions. In both cases, the researchers observed participants using Internet news sites. They also asked them questions regarding what content they visited and why. The methodology relied heavily on qualitative analysis with a small quantitative component.

Usability studies can be done in conjunction with traffic analysis and search analytics as the purposes are often similar: Improve the user experience and try to get users to complete certain tasks.

Interaction and collaboration analysis.

Interaction and collaboration analysis focuses on how people work together in an online environment. Collaboration analysis often looks more at how people work together to create something, such as contributing to a wiki or to create an event like an unconference, where everyone is working towards a common goal. Interaction analysis tends to focus on how people engage each other when there is no common goal in the group.

Software Services, Dale Carnegie & Associates, Inc., & Shedletsky, L. (2000) explain the methodology for interaction analysis. They encourage researchers to look at topics discussed, purposes of individuals’ utterances, structure of conversation, and how properties of talk affect outcomes when completing an interaction analysis. The researcher should determine the setting for which this type of analysis will be conducted: In a controlled setting such as a laboratory, by selecting samples of existing conversations, or by examining all conversation that the research is capable of overhearing. The researcher needs to determine if they will use prompted or unprompted interaction. They also need to determine how they will record conversations and if their analysis will be quantitative or qualitative in nature. Once these things have been determined, then a methodology for data collection can be figured out.

Viégas et al. (2007) did a collaboration analysis focusing on Wikipedia. The purpose of their work was to examine historical editing patterns and how editing practices have evolved over time. They built on work done by Viégas, Wattenberg and Dave in 2003. The methodology they used involved getting the editing history of articles across several different Wikipedia namespaces. The history of the articles was then examined using several visualization tools, metrics and methods depending on the established cultural practices for that namespace. One tool they used was a history flow visualization application. A method they used was the manual classification of “all user posts in a purposeful sample”. (Viégas et al., 2007) Metrics they used included count of horizontal rules, signed user names, new indentations levels, votes in polls and total “references to internal Wikipedia resources.” (Viégas et al., 2007) These tools, metrics and methods allowed them to examine how collaboration and interaction had changed over time.

This type of analysis often stands alone. It could be used as part of a usability study or relationship analysis to provide context for the results of those analysis types.

Relationship analysis.

Relationship analysis involves examining the relationships between users on a social network, message board or mailing list. The goal is to identify cliques of different sizes or people who are particularly influential in a particular group online. This type of analysis is important to many brands including Starbucks (Plimsoll, 2010). The purpose of relationship analysis is to identify key influencers and social who influencers who are or who have the potential to be brand evangelists. (Plimsoll, 2010)

Lord and Singh (2010) define social media influence marketing as being “about recognizing, accounting and tapping into the fact that as your potential consumer makes a purchasing decision, he or she is being influenced by different circles of people through conversations with them, both online and off.”

The methodology for influence identification is not clearly spelled out as identifying influencers can be heavily dependent on the network being examined and how the community on a specific site functions. As a result, social media marketers suggest an array of tools like Twitalyzer that can be used to help determine your own influence. (Ankeny 2009) Twitalyzer’s Peterson and Katz (2010) explain their site-specific method of determining influence as including the following variables: Engagement level, total followers, total following, hashtags cited, lists included on, frequency of updates, references by others, references of others, times content is retweeted, urls cited and a number of other variables. Sterne (2010) suggests using WeFollow.com to find people who use topic specific #hashtags on Twitter. The people who tweet the most about a topic are likely to be influencers in that others looking for tweets around a topic are likely to read them. In a wider web context, Sterne (2010) suggests using Technorati to identify bloggers who have clout and influence around a certain topic.

This type of research can be viewed as a fundamental component to sentiment analysis; social media marketing companies like Razorfish often package the two together. (Lord & Singh, 2010)

Population /Demographic studies.

Population studies involve defining the demographic characteristics of a community. In a population study, the goal is also to define the limits and size of the community that is being studied. Because of the complexity in defining the boundaries of a population and in sampling the whole of it, this type of research is rarely done in terms of social media.

Daugherty and Kammeyer (1995) define a population study as the assembling “of numerical data on the sizes of populations.” This sort of data is defined by the authors as “descriptive demographic statistics.” Daugherty and Kammeyer (1995) say “population numbers are always changing, so even if they are accurate when gathered they are soon out of date and inaccurate.” Daugherty and Kammeyer (1995) say the basic purpose of conducting a demographic study “is to explain or predict changes or variations in the population variables or characteristics.” Given the definition of a population study, the methodology involves counting all members of a select population.

The most famous example of a population study is the census. In the United States, this is done every ten years. According to the U.S. Census Bureau (n.d), the goal of the 2010 US census is ” to count all U.S. residents—citizens and non-citizens alike.” This is done by sending all citizens a ten-question questionnaire, requiring that people complete it by law and having a census taker follow up for all households did not return completed questionnaires. (U.S. Census Bureau, n.d.) The results are then calculated and are used by the government to make decisions.

This type of research often stands on its own. The results will often be utilized for marketing purposes in conducting other research, such as psychographics, to make that that sampling contains representative populations.

Online target analysis of behavior and psychographics.

Online targeting of and marketing towards a specific audience because of their demographic characteristics is extremely common on the Internet. Psychographics is a term that includes targeting towards a specific demographic group except it includes the offline component.

Sutherland and Canwell (2004) define psychographics as “market research and market segmentation technique used to measure lifestyles and to develop lifestyle classifications.” (p. 247) Nicolas (2009) defines online behaviorial analysis as a series of steps: Collecting user data across several sites, organizing information about users based on the sites they visit and their behavior on those sites, “infer demographics and interest data”, and classifying new users based on the collected data in order to deliver relevant ads and content based their demographic profiles. Kinney, McDaniel, and DeGaris (2008) define psychographics as attitude towards something such as a brand or involvement with an organization.

Given the methodology involved, much of this type of research involves action research in that it is done in a specific content, based on internal models to address specific situations.

An example of this type of research was done by Kinney, McDaniel, and DeGaris (2008) who investigated the demographic characteristics of NASCAR fans and their attitudes towards NASCAR, its sponsors and sponsor involvement with NASCAR. The research found that age, gender and education were all important variables in determining sponsor recall: Younger, more educated males had the best brand recall amongst NASCAR fans.

This type of research can be viewed as a subcomponent of a population study in that demographic information is sought about the population. In an online context, it often works in conjunction with search and traffic analytics analysis, content analysis, and interaction and collaboration analysis.

Predictive analysis.

A search on 13 July 2010 on SPORTDiscus had three results for “predictive analysis.” A search on the same date on Scopus had 605 results, 275 of which were in engineering, 132 in computer science and 102 in medicine. Predictive analysis is probably one of the least used analysis methods, especially in social media and fandom.

What is predictive analysis? At its simplest, it is identifying a future event or events, monitoring selection actions that precede the event and seeing if those events can be used to predict the outcome of similar events in the future. If a predictive value is found, an organization can monitor behaviors to help make more informed decisions.

An example of this type of research is “Predicting the Future With Social Media” by Asur and Huberman (2010). Their goal was to determine if tweet volume and sentiment on Twitter prior to a movie being released could be used to predict how well a movie performs at the box office. Their methodology involved identifying movie wider release dates that took place on a Friday, creating a list of keyword searches related to those movies, and using the Twitter API to collect all tweets and aggregate date that mention those keywords over a three month time period. The authors then compared the tweet volume to box office performance. They concluded that social media “can be used to build a powerful model for predicting movie box-office revenue.” (Asur & Huberman, 2010)

This type of research can be used in conjunction with other methods. It can be used along side a population study to see if certain actions will result in demographic changes.

Rational for Population Study

The literature review provides insight into the lack general quantitative analysis regarding the demographic and geographic characteristics of Australian sport fans in general and AFL fans in specific. Much of what is written involves observations based on match attendance, attendance statistics, common historical tropes based on the experience of the authors as members of the sport community or analysis based on demographic data around the community for which a club was based. The methodology rarely is spelled out. There is little reason to doubt the demographic composition of fans because most accounts match very well and there are a variety of citations that refer to a wide variety of sources. The literature review demonstrates a lack of quantitative research in terms of population characteristics.

The research regarding fan demographics in the Australian sport online community is even sparser. The focus on research being done tends to focus on fan production, such as the transition from fanzines to online mailing lists. It is often not quantitative in nature.

Given the hole in the research, there is a clear need to fill it to better understand the existing population of AFL fans who are increasingly using the Internet in order to facilitate their love of their chosen club. To do this, an appropriate methodology needs to be chosen. The previous section examined the major methodological approaches available for conducting research into social media and online populations. Most of these methods involve some form of interaction analysis or textual analysis. They do not offer a clear method of understanding the characteristics of a large group and its subcomponents.

Methodological Approach

The methodology used in this study will be a population study. To provide context for the findings, other methods will be utilized. The exact method for conducting the population study will differ depending on the site being examined. Therefore, most of the methodology used in this study will appear inside specific chapters.

Throughout this study, there is a dependence on user listed locations to determine the geographic location of members of the Australian sport fan community. To provide consistency across all sites looked at, a list was developed that included user generated location, city, state and country. The city, state and country were determined based on intelligent guesswork. For example, as the focus of the research is Australia, if Melbourne was standing alone, the assumption was that the user meant Melbourne, Victoria, Australia and not Melbourne, Florida, United States. Spelling variations and nicknames were also used to determine location. For example, Brisvegas is a nickname for Brisbane, Queensland, Australia. Often patterns of cities were looked for assuming the standard pattern of city, state, country or country, state, city or city, country or city, state. To aid in processing location lists more quickly, when using an automated tool such as the one for Twitter, the user-generated list was supplemented with one created by the author. This list included all Australian cities listed using the patterns of postal code and city, state and city, country, and city, state, country. The completed list contains over 65,000 variants that were listed by the author or user created.

References

Ankeny, J. (2009). HOW TWITTER IS REVOLUTIONIZING BUSINESS. Entrepreneur, 37(12), 26-32. Retrieved from Business Source Premier database.

Asur, S., & Huberman, B. A. (2010). Predicting the Future With Social Media. Social Computing Lab. Retrieved from http://www.hpl.hp.com/research/scl/papers/socialmedia/socialmedia.pdf

Bronwyn, B., Dawson, P., Devine, K., Hannum, C., Hill, S., Leydens, J., Matuskevich, D. Traver, C, and Palmquist, M. (2005). Case Studies. Writing@CSU. Colorado State University Department of English. Retrieved August 26, 2010 from http://writing.colostate.edu/guides/research/casestudy/.

Daugherty, H. G., & Kammeyer, K. C. W. (1995). An introduction to population. New York: Guilford Press.

Fang, W. (2007), Using Google Analytics for improving library website content and design: A case study. Library Philosophy and Practice 2007 (June), LPP Special Issue on Libraries and Google. Retrieved September 2, 2010 from http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.84.5924&rep=rep1&type=pdf

Jansen, B. J. (2009). Understanding user-Web interactions via Web analytics. San Rafael, Calif.: Morgan & Claypool Publishers.

Jerz, D. (2002, November 11). Usability Testing: What is it? Dennis G. Jerz; Seton Hill University. Retrieved October 4, 2010, from http://jerz.setonhill.edu/design/usability/intro.htm

Kaushik, A. (2010). Web analytics 2.0: The art of online accountability & science of customer centricity. Hoboken, N.J: Wiley.

Kian, E., Mondello, M., & Vincent, J. (2009). ESPN—The Women’s Sports Network? A Content Analysis of Internet Coverage of March Madness. Journal of Broadcasting & Electronic Media, 53(3), 477-495. doi:10.1080/08838150903102519.

Kinney, L., McDaniel, S., & DeGaris, L. (2008). Demographic and psychographic variables predicting NASCAR sponsor brand recall. International Journal of Sports Marketing & Sponsorship, 9(3), 169-179. Retrieved from SPORTDiscus with Full Text database.

Krippendorff, K. (2007). Content analysis: An introduction to its methodology. Thousand Oaks, Calif.

Lord, B., & Singh, S. (2010). Fluent: The Razorfish Social Influence Marketing Report. Razorfish. Retrieved August 25, 2010, from http://fluent.razorfish.com/publication/?m=6540&l=1

Nicolas, P. (2009, December 17). “Online audience behavior analysis and targeting.” Patrick Nicolas Official Home Page. Retrieved August 1, 2010, from http://www.pnexpert.com/Analytics.html

Peterson, E., & Katz, J. (2010). Twitalyzer Help and Company Information | Twitalyzer: Serious Analytics for Social Media and Social CRM. Twitalyzer. Retrieved August 25, 2010, from http://www.twitalyzer.com/help.asp

Plimsoll, S. (2010). Find and target customers in the social media maze. Marketing (00253650), 10-11. Retrieved from Business Source Premier database.

Ramos, A., & Cota, S. (2009). Search engine marketing. New York: McGraw-Hill.

Software Services, Dale Carnegie & Associates, Inc., & Shedletsky, L. (2000, September 7). Interaction Analysis. University of Southern Main Communications Department. Retrieved October 5, 2010, from http://www.usm.maine.edu/com/chapter8/

Sterne, J. (2010). Social media metrics: How to measure and optimize your marketing investment. Hoboken, N.J: John Wiley.

Sturgil, A., Pierce, R., & Wang, Y. (2010). Online News Websites: How Much Content Do Young Adults Want?. Journal of Magazine & New Media Research, 11(2), 1-18. Retrieved from Communication & Mass Media Complete database.

Sutherland, J., & Canwell, D. (204). Key Concepts in Marketing. Palgrave Key Concepts. Hampshire, England: Palgrave MacMillan.

Sweeney, S., Dorey, E., & MacLellan, A. (2006). 3G marketing on the internet: Third generation internet marketing strategies for online success. Gulf Breeze, FL: Maximum Press.

Viégas, F. B., Wattenberg, M., Kriss, J., & van Ham, F. (2007). Talk Before You Type: Coordination in Wikipedia. Proceedings of the 40th Hawaii International Conference on System Sciences. Big Island, Hawaii. Retrieved from http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.84.6907&rep=rep1&type=pdf

Vincenzini, A. (2010, July 14). A Case Study: The NBA’s Social Media Strategy & Tactics. Retrieved August 26, 2010, from http://www.slideshare.net/AdamVincenzini/the-nba-and-social-media-a-case-study

U.S. Census Bureau. (n.d.). How We Count America – 2010 Census. U.S. Census Bureau. Retrieved August 28, 2010, from http://2010.census.gov/2010census/how/how-we-count.php

Wanner, F., Rohrdantz, C., Mansmann, F., Oelke, D. and Keim, D., Visual Sentiment Analysis of RSS News Feeds Featuring the US Presidential Election in 2008. in Workshop on Visual Interfaces to the Social and Semantic Web, (2009).

Weber, L. (2009). Sticks and Stones: How Digital Business Reputations Are Created over Time and Lost in a Click. John Wiley & Sons Inc.

Related Posts:

Twitter: A Solution to the Follow Spammers

Posted by Laura on Tuesday, 5 October, 2010

I’m having another period of annoyance with Twitter. I really feel like I should probably turn off alerts for followers again because right now? I’m pretty much putting people on a spammer list if they have 2,000 people they already follow. I’m also sending out cranky DMs blasting people for doing this sort of following.

For the past two months, I’ve spent a lot of time looking at Twitter. I’ve looked at follower counts. I’ve looked at follower geographic patterns. I’ve looked at people’s descriptions. I’ve looked at people’s geographic locations. The point of this is often to determine the geographic location of Australian sport fandom. I’ve read a fair bit on technology blogs about Twitter to help further my own understanding of Twitter to help me with intended mini-literature review in my Twitter chapter of my dissertation. I’ve basically been ODing on Twitter. There is a lot of interesting stuff out there.

But as a user? I’m getting pretty cranky. Seriously cranky. Every day, it feels like I’m getting 2 to 10 follows (across about 3 different accounts) from people who I don’t know, who are not geographically close to where I’m writing about, who don’t appear interested in professional sports, who have low interaction rates, who have 2,000+ people they follow. As part of my research, I constantly ask: What is the ROI for a team on Twitter in terms of where their audience is located? How can they best leverage their network? What can they provide for their fans to induce them to follow them? How can their fans help them? As a user, I can’t see how the people like I describe who follow me gain any benefit from that. (They can’t read me. I can barely keep up with 350. I function more or less because Americans get neglected as they post while I sleep.) (In one case, I got followed and unfollowed by about 5 times by the same user with 4,000 followers. ) As a user, I can’t see what they offer me. They rarely bother to explain.

And this is killing my desire to stay on Twitter. Seriously killing my desire to stay on Twitter. I just can’t. There are people I want and need to keep track of on Twitter for professional reasons. (The personal ones are almost exclusively on Facebook these days. On that level, I don’t feel the need to stay.) If you’re not active on Twitter and you cover social media, people sometimes doubt your legitimacy because you’re not using the product you’re discussing.

What I’d really like is for Twitter to make the following reforms:

1. Add a field for follow philosophy. It can be selecting from a list. It can be freeform writing. This way, when people follow others, they can see if they have a mutual philosophy. “I follow back people everyone.” “I follow friends, family and professional acquaintances.” “I follow celebrities.” “I follow only people with less than 1,000 followers.”
2. Allow people to block people with certain follow totals unless you follow them first. (I want to block anyone with 1,500 people they follow from following me first. If you want to follow me, interact with me first. Otherwise, add me to a list.) This way, spam following by power users is cut back.

The two following methods would help to kill off the Twitter spam following (and yes, your unwanted e-mail notification that you followed me to never read me is spam. It is unwanted and unsolicited and you didn’t indicate any mutual interest.) and help prevent my own fatigue. I use and prefer Facebook more than Twitter precisely because I’m not inundated with unwanted announcements like that.

Related Posts:

Twitter Weekly Updates for 2010-10-03

Posted by Laura on Sunday, 3 October, 2010
  • I'm at Wikimedia Foundation (149 New Montgomery, San Francisco): #
  • I'm at Wikimedia Foundation (149 New Montgomery, San Francisco): #
  • I'm at San Francisco International Airport (SFO) (1 S McDonnell Rd, at S Link Rd, San Francisco) w/ 45 others. 4sq.com/MTivk #
  • I am reading Traffic: Why We Drive the Way We Do #Traffic bit.ly/ceSiAA #
  • Back in Australia. :D Had great time at WMF. Learned lots. Excited to be back home. :) #
  • Library toilet stall wall… twitpic.com/2tgtiv #
  • I earned the Check-in Pro sticker on @GetGlue! bit.ly/aaMSne #
  • One of my favorite tools changed and I can't use it anymore. :( Need a new tool for batch reverse geocoding with output to table for excel. #
  • Got sport books. More cricket books than other sports here… (@ Vinnies) 4sq.com/ahUNhm #
  • Quick question: Which one is better? Virgin or Qantas for short haul Canberra to Melbourne? Qantas is $30 more. #
  • Went Virgin Blue. #
  • Anyone in Melbourne willing to put me up for a few days around the 21 to 25 Oct? :) Poor student needs help. :) #
  • Id #
  • I earned the Triathlon sticker on @GetGlue! bit.ly/dy23gJ #
  • Pictures from WikiProject Screencast: bit.ly/aTMqE5 #

Powered by Twitter Tools

Related Posts:

Twitter Weekly Updates for 2010-09-26

Posted by Laura on Sunday, 26 September, 2010
  • My bike and nifty bike bags. twitpic.com/2q7oip #
  • I should do a weekly podcast of my degree progress and treat it like a sport report: Stats, major accomplishments, needs improvement… #
  • Is the word cunt viewed as less offensive in Australia than the USA? Where it is pretty hugely offensive? #
  • Greyhound to Sydney airport. Leaves in 90 minutes. Yay! (@ Jolimont Tourist Centre) 4sq.com/dbUpz1 #
  • Greyhound bus station in Canberra. twitpic.com/2qggqw #
  • iPhone plugged in to not waste battery. twitpic.com/2qgh63 #
  • Yay! Arrived way to early to checkin but here! (@ Sydney International Airport (SYD) ✈ w/ 2 others) 4sq.com/aVXNPp #
  • Cranky at airport. Not healthy mindset. #
  • I earned the Lone Wolf sticker on @GetGlue! bit.ly/dck62N #
  • I can play with my iPod the whole long flight! twitpic.com/2qk52c #
  • Flight departure delayed because of refueling issues… Fun. *frets* #
  • My iphone does not work here. No internet! Scary! #
  • Plane landed in san francisco. #
  • Walking across the golden gate bridge wi twitpic.com/2r8so4 #
  • Motel 6 plugs. twitpic.com/2ratee #
  • I'm at Motel 6 (111 Mitchell Avenue, South San Francisco): #
  • Having fun in the server room. All ur wiki r belong 2 me. (@ Wikimedia Foundation) 4sq.com/8bSfZo #
  • My internet access is limited while I am in SanFran at WMF so I'm not checking / updating. If need me, e-mail me. :) #
  • I'm at Wikimedia Foundation (149 New Montgomery, San Francisco): #
  • Photo at Wikimedia Foundation gowal.la/c/2zatK?139 #
  • I'm at Wikimedia Foundation (149 New Montgomery, San Francisco): #

Powered by Twitter Tools

Related Posts:

Twitter Weekly Updates for 2010-09-19

Posted by Laura on Sunday, 19 September, 2010
  • Most popular Australian athletes, clubs, leagues and sport organizations on Twitter : bit.ly/cAG7kS #afl #nrl #nbl #aleague #
  • I just ousted Flo T. as the mayor of University Of Canberra Library on @foursquare! 4sq.com/aQU5Lb #
  • I just became the mayor of University of Caberra, Building 9 on @foursquare! 4sq.com/9bkJ4Y #
  • More shopping carts. … I should make a coffee table book. twitpic.com/2nxg8x #
  • Another shopping trolley… twitpic.com/2nysdi #
  • I'm getting bossy via e-mail. By that I mean telling them they were right and I was not so right so go fix that. #
  • Rethinking my dissertation topic. This causes me to be very stressed out. Thinking of switching leagues and doing a case study. :/ #
  • Move to Australia. Can't escape Oprah. She's coming here. #
  • Carts/trolleys. twitpic.com/2o7oik #
  • My television is an Oprah free zone. No no and no. I get it. Australians love $$$Oprah$$$. I love what she's done for Chicago too. #
  • Trying to improve my list of Australian sport clubs, leagues, organizations, athletes. Will never be complete but found 15 more. #
  • Which AFL team do Canberrans support in the Grand Final? bit.ly/cGNZHd #afl #gosaints #gocats #godogs #gopies #
  • Why does it feel like that by the time I leave next week, the AUD and USD will be equal? This is really scary. #
  • Canberra and the Raiders on Twitter and Facebook : bit.ly/drGcKx #nrl #goraiders #
  • Wrote 3,000 words between 7am and 4:30pm. I need a nap now. Or maybe pizza. mmm. pizza. #
  • I love Piled Higher and Deeper: bit.ly/9QI3yl Had one of those moments recently. #
  • Most popular Australian athletes, clubs, leagues and sport organizations on Twitter (version 2) : bit.ly/cfzEvk #
  • My collection of Australian sport books. twitpic.com/2osbmg #
  • I just ousted Christian as the mayor of Angus & Robertson on @foursquare! 4sq.com/9q0hjG #
  • Facebook's Twitter directory has a severe American bias. This is annoying. #
  • Watched Survivor. Few moments of squee: OMG! IT IS JIMMY JOHNSON! Australian friend was "Who?" I love Survivor. :D #
  • Bathroom writing at civic centre. twitpic.com/2p6c7y #
  • Rode my bike 50 KM today. Ouchies. Don't want to do that again any time soon. Did get bike bags. Yay! #
  • ACT government ranger people called about my swooping magpie report. Good. I hate them. #
  • Information on bulk loading a wiki: bit.ly/bADfqd #

Powered by Twitter Tools

Related Posts:

Twitter Weekly Updates for 2010-09-12

Posted by Laura on Sunday, 12 September, 2010
  • Twitter: @NRL followers vs. @AFL followers: bit.ly/aYHkzA #
  • I just became the mayor of UCNISS on @foursquare! 4sq.com/aBe9pE #
  • Confusing world of Aussie sport merchandise: yes to Duke, no to Canberra Raiders… twitpic.com/2lp4x1 #
  • Twitter: @NRL followers vs. @AFL followers : bit.ly/aYHkzA #
  • I'm in sport studies. Some one must be able to inflate a football. Will find out tomorrow. #
  • I'm in sport studies. Some one must be able to inflate a football. Will find out tomorrow. #
  • Another spam follower: @interloperinc . When following 42000, you don't need to follow more. #
  • Uni. Aus. guy on ABC24: Saying Aussie unis rock because international students pay lots of revenues? It makes Aussie unis seem SUCK. #
  • Uni. Aus. guy on ABC24: International students should not be seen as commodity. Misplaced goals: WE WANTS MONEY is not YAY! EDUCATION! #
  • I love @TimBull. :D Very helpful with Twitter stuff. :D Highly recommend his site: bit.ly/9dYrVc #
  • It does not sound right when an Aussie imitates a Canadian by saying "Aye." #
  • Productive day: wrote 800 words on my dissertation. Good times. #
  • I have more free time in SFO than I thought. Win! :D #
  • I meant to do writing today. It just didn't happen. #
  • The Top News Story on WIN Canberra? A warning that magpie swooping season has begun. Oh Canberra… I don't know what to say. #
  • I think I am going through sport culture shock. It would explain my current attitude problems… #
  • Sport culture shock: America vs. Australia : bit.ly/cDYl8Y #
  • The trick in life is to turn existential angst over social media into something productive. Like getting closer to your dream job. #
  • Support #teamgws in Canberra: youtu.be/_o0a-CYkoxQ #
  • Canberra people: What is the easiest non-car way to get from Gungahlin to Wanniassa? Bike says 30km each way and eek! #
  • RT @ someone "Researching IT in Education: Theory, Practice and Future Directions" amzn.to/9gWV7y #
  • Who is more sport mad (re: professional sport)? Americans or Aussies? Methodology question : bit.ly/aDCDAE #
  • Rugby Union in the ACT. twitpic.com/2n9djh #
  • More… twitpic.com/2n9eld #
  • Open later on Saturday than Krispy Kreme. Need hot chocolate. (@ Gloria Jeans @ Borders) 4sq.com/b0VzuG #
  • It only took 45 minutes but a family member can now update Facebook from their mobile. Yay! :D #
  • Is it bad to support the All Blacks because I know it will tweak some Australians? (Hint: Some one get me a shirt.) #
  • Saw a dissertation on library website case study with 250 pages of Appendices. It gives me much hope. :D #

Powered by Twitter Tools

Related Posts:

Who is more sport mad (re: professional sport)? Americans or Aussies? Methodology question

Posted by Laura on Saturday, 11 September, 2010

I had a discussion the other day. My observations, based in a small part based on my American issues, is that Americans are more sport bad and are more likely to support their professional sport teams than Australians are. These sort of allegiances (well expressing them) just appear more fundamental to being an American than being an Australian. (This is based on the perception of people wearing sport related apparel for local teams.) The question is: How do you test this? Is there a way to prove that Americans or Aussies are more likely to barrack for their national professional leagues and clubs?

I’m not sure how feasible doing that is and any methodology seems like it could have holes easily torn in it. That said, coming from a place of some one who likes online population studies, the following is my proposed methodology:

  1. Get all the follower location data for people who follow the major national based leagues Twitter accounts in the United States and Australia.  In this case, @AFL, @NRL for Australia and @MLB, @NCAA, @NFL, @NBA for the USA.
  2. Get a follower count of the total Australians following Australia’s leagues.  Get a count of Americans following Australian leagues.  Get a follower count of the total Americans following American leagues.  Get a follower count of the total Australians following American leagues.
  3. Determine the percentage of the total population in Australia following Australian and American leagues.  Determine the percentage of the total population of the USA following Australian and American leagues.
  4. Repeat the above process focusing on local teams: @sydneyswans, @nrl_bulldogs for Sydney. @stkildafc @MelbStormRLC for Melbourne. @NHLBlackhawks , @chicagobulls @whitesox @Cubs for Chicago. @Mets , @YANKEES, @thenyknicks , @nyjets for New York.  Instead of national focus, include only population from the metro area for each city.

Combine the above with a Facebook geography:

  1. Go to https://www.facebook.com/ads/create/ . Get the total population of fans in Australia and the United States for the AFL, A-League, W-League, NRL, WNBL, and ANZ Championship.  Get the total population of fans in Australia and the United States for the NFL, NBA, MLB, NHL, NCAA, MLS, WPS and WNBA.
  2. Determine what the percentage of Facebook fans is relative to the total population of the country.
  3. Repeat for local teams in Sydney, Melbourne and Brisbane in Australia, Chicago, New York, Los Angeles in the United States.  Count only people living with in 50 miles of the city.

The results should show that one country has a greater population expressing interest in professional sport.  This country could them be deemed more sport mad than the other.

Does it sound like a methodology that could show what I think it shows?  Are there any better methods for determining which country is more sport mad?


So I decided to get some of this data for Facebook. When getting interests, I tried to use every relevant interest related to a team. The population data for the metro regions comes from Wikipedia. The population data for the country comes from Google. Analysis later. Early observation: Australia as a whole more dedicated to leagues. Americans are more dedicated to local club support.

League/Team Based in Country/City Fans Country/Metro population % fans Country/Metro population on Facebook % fans
AFL Australia Australia 261,580 21,431,800 1.221% 9,621,400 2.719%
AFL Australia United States 4,720 307,006,550 0.002% 133,925,380 0.004%
NBA United States Australia 5,040 21,431,800 0.024% 9,621,400 0.052%
NBA United States United States 573,340 307,006,550 0.187% 133,925,380 0.428%
NFL United States Australia 3,860 21,431,800 0.018% 9,621,400 0.040%
NFL United States United States 438,020 307,006,550 0.143% 133,925,380 0.327%
NRL Australia Australia 71,100 21,431,800 0.332% 9,621,400 0.739%
NRL Australia United States 940 307,006,550 0.000% 133,925,380 0.001%
WNBA United States Australia 20 21,431,800 0.000% 9,621,400 0.000%
WNBA United States United States 3,740 307,006,550 0.001% 133,925,380 0.003%
A-League Australia Australia 3,900 21,431,800 0.018% 9,621,400 0.041%
A-League Australia United States 100 307,006,550 0.000% 133,925,380 0.000%
NHL United States Australia 4,920 21,431,800 0.023% 9,621,400 0.051%
NHL United States United States 447,000 307,006,550 0.146% 133,925,380 0.334%
NCAA United States Australia 320 21,431,800 0.001% 9,621,400 0.003%
NCAA United States United States 254,560 307,006,550 0.083% 133,925,380 0.190%
ANZ Championship Australia Australia 1,480 21,431,800 0.007% 9,621,400 0.015%
ANZ Championship Australia United States 20 307,006,550 0.000% 133,925,380 0.000%
WPS United States Australia 18,100 21,431,800 0.084% 9,621,400 0.188%
WPS United States United States 160 307,006,550 0.000% 133,925,380 0.000%
W-League Australia Australia 1,340 21,431,800 0.006% 9,621,400 0.014%
W-League Australia United States 60 307,006,550 0.000% 133,925,380 0.000%
Chicago Cubs Chicago Chicago 819,260 9,569,624 8.561% 5,915,800 13.849%
Chicago Bears Chicago Chicago 825,400 9,569,624 8.625% 5,915,800 13.952%
Chicago Bulls Chicago Chicago 373,640 9,569,624 3.904% 5,915,800 6.316%
Chicago Blackhawls Chicago Chicago 360,940 9,569,624 3.772% 5,915,800 6.101%
Chicago White Sox Chicago Chicago 211,160 9,569,624 2.207% 5,915,800 3.569%
Chicago Fire Chicago Chicago 23,900 9,569,624 0.250% 5,915,800 0.404%
Chicago Bandits Chicago Chicago 1,140 9,569,624 0.012% 5,915,800 0.019%
Brisbane Broncos Brisbane Brisbane 41,160 2,004,262 2.054% 1,422,400 2.894%
Brisbane Lions Brisbane Brisbane 6,680 2,004,262 0.333% 1,422,400 0.470%
Queensland Maroons Brisbane Brisbane 68,900 2,004,262 3.438% 1,422,400 4.844%
Queensland Reds Brisbane Brisbane 4,880 2,004,262 0.243% 1,422,400 0.343%
Queensland Roar Brisbane Brisbane 2,480 2,004,262 0.124% 1,422,400 0.174%

Related Posts: