Archive for the ‘mere empirics’ Category
So far, the Patriots have been nailed on two cheating scandals – deflation gate 2015 and the 2006 spying scandal. Each of these is interesting in its own right but there is one implication that few are willing to utter. The Patriots are probably cheating in more ways than we imagine.
The intuition is simple. Cheating incidents are not independent. It is not likely that every person will cheat with equal probability. Rather, people who want to cheat are the most likely to cheat and do so over and over. Also, consider incentives. They have been caught cheating multiple times and that hasn’t seemed to harm them much at all. The conclusion is that it is highly likely the Patriots are cheating in other ways.
I think it would be interesting for the fans of vanquished teams to conduct Levitt style analyses of the Patriots. I would guess that looking at other data in addition to the now famous fumble analysis will yeild some interesting answers.
Question for readers who teach networks: What software should I use for low tech undergrads? So far, I am having some real challenges…
I have an undergrad class where the first major assignment is to download one’s Facebook network and analyze it. I have been using NetVizz, an app inside Facebook, to extract network data. But it suddenly disappeared! One solution is to use the Facebook importer in NodeXL. That works but… Windows 8 is highly allergic to NodeXL. And lots of people have Windows 8 and they have endless installation problems. And the Java version is an issue. Even when it does work, NodeXL gets stuck downloading data from some student accounts. No explanation. It just does.
Then one can try Gephi, which is a whole ball of wax. The issue with Gephi is that it is highly sensitive to OS version. Luckily, there are fixes but they often involve Mac esoterica (e.g., Apple support does weird things in Safari, but not Chrome). Even then, students have all kinds of unexplained Gephi problems (e.g., the visualization pane simply doesn’t work on some Macs).
I need people to download a spreadsheet of data (e.g., centrality scores for people in your network) and not just pictures, so the Wolfram App and others are of limited value. Also, Wolfram seems to stall on some machines (including a Mac I have at home). I tried installing UCINET on Windows 8 as an end run… but had installation problems.
Here are my requirements. I need software that:
- Can be easily used by low-math undergrads
- Low cost/free
- Is very stable in terms of Windows 7, 8 and various Mac OS versions.
- If possible, a way to import Facebook data, and produce spreadsheets of data.
The last time two times I did this course, NetVizz, Gephi and UCINET did the trick. But there is a new generation of operating systems and the usual software hasn’t been upgraded and thoroughly tested. In previous years, I might have only or two students who couldn’t get network software running. This semester, it is a third of the class. Argh.
Any advice is welcome.
Within informatics, there is a healthy body of research showing how social media data can be used for forecasting future consumption. The latest is from a study by Nielsen, which shows some preliminary evidence that Twitter activity forecasts television program popularity. In their model, adding Twitter data increases the explained variance in how well a TV show will in addition to data on promotions and network type. Here’s the summary from Adweek.
Before the holiday, we asked – what should computational sociologists know? In this post, I’ll discuss what sociology programs can do:
- Hire computational sociologists. Except for one or two cases, computational sociologists have had a very tough time finding jobs in soc programs, especially the PhD programs. That has to change, or else this will be quickly absorbed by CS/informatics. We should have an army of junior level computational faculty but instead the center of gravity is around senior faculty.
- Offer courses: This is a bit easier to do, but sociology lags behind. Every single sociology program at a serious research university, especially those with enginerring programs should offer undergrad and grad courses.
- Certificates and minors: Aside from paperwork, this is easy. Hand out credentials for a bundle of soc and CS courses.
- Hang out: I have learned so much from hanging out with the CS people. It’s amazing.
- Industry: This deserves its own post, but we need to develop a model for interacting with industry. Right now, sociology’s model is: ignore it if we can, lose good people to industry, and repeat. I’ll offer my own ideas next week about how sociology can fruitfully interact with the for profit sector.
Add your own ideas in the comments.
My co-bloggers are on a roll. Zynep Tufekci and Brayden King have an op-ed in the New York Times on the topic of privacy and data:
UBER, the popular car-service app that allows you to hail a cab from your smartphone, shows your assigned car as a moving dot on a map as it makes its way toward you. It’s reassuring, especially as you wait on a rainy street corner.
Less reassuring, though, was the apparent threat from a senior vice president of Uber to spend “a million dollars” looking into the personal lives of journalists who wrote critically about Uber. The problem wasn’t just that a representative of a powerful corporation was contemplating opposition research on reporters; the problem was that Uber already had sensitive data on journalists who used it for rides.
Buzzfeed reported that one of Uber’s executives had already looked up without permission rides taken by one of its own journalists. Andaccording to The Washington Post, the company was so lax about such sensitive data that it even allowed a job applicant to view people’s rides, including those of a family member of a prominent politician. (The app is popular with members of Congress, among others.)