orgtheory.net

Archive for the ‘computer science’ Category

the web of commons, a talk by karissa mckelvey

Long time friend Karissa McKelvey talks about solving commons problems in a key note address at Full Stack 2017.

50+ chapters of grad skool advice goodness: Grad Skool Rulz ($4.44 – cheap!!!!)/Theory for the Working Sociologist (discount code: ROJAS – 30% off!!)/From Black Power/Party in the Street / Read Contexts Magazine– It’s Awesome 

Advertisements

Written by fabiorojas

October 2, 2017 at 4:02 am

the most metal words of all time

At Degenerate State, there was an interesting post where someone applied natural language processing models to heavy metal lyrics. From the article:

To get the lyrics, I scraped www.darklyrics.com. While darklyrics doesn’t have a robots.txt file, I tried to be gentle with my requests. After cleaning the data up, identifying the languages and splitting albums into songs, we are left with a dataset containing lyrics to 222,623 songs from 7,364 bands spread over 22,314 albums.

Before anyone asks, I have no intention of releasing either the raw lyric files or the code used to scrape the website. I collected the lyrics for my own entertainment, and it would be too easy for someone to use this data to copy darklyrics. If people are interested I may release some n-gram data of the corpus.

So what do you find? A few tidbits  – the heavy metal word cloud:

Tag Cloud of All Metal Lyrics

Then, the most “metal words:”

Rank Word Metalness
1 burn 3.81
2 cries 3.63
3 veins 3.59
4 eternity 3.56
5 breathe 3.54
6 beast 3.54
7 gonna 3.53
8 demons 3.53
9 ashes 3.51
10 soul 3.40
11 sorrow 3.40
12 sword 3.38
13 goodbye 3.28
14 dreams 3.28
15 gods 3.24
16 pray 3.22
17 reign 3.15
18 tear 3.12
19 flames 3.12
20 scream 3.11

And the least metal words:

Rank Word Metalness
1 particularly -6.47
2 indicated -6.32
3 secretary -6.29
4 committee -6.16
5 university -6.09
6 relatively -6.08
7 noted -5.85
8 approximately -5.75
9 chairman -5.69
10 employees -5.67
11 attorney -5.66
12 membership -5.64
13 administrative -5.61
14 considerable -5.60
15 academic -5.51
16 literary -5.49
17 agencies -5.48
18 measurements -5.47
19 fiscal -5.45
20 residential -5.45

The bottom line? Academia, the law and administration are the least metal topics of all time. Who knew?

50+ chapters of grad skool advice goodness: Grad Skool Rulz ($5 – cheap!!!!)/Theory for the Working Sociologist/From Black Power/Party in the Street  

Written by fabiorojas

April 19, 2017 at 1:46 am

sas programming

“How do you feel about programming in SAS?”

“Here’s how I feel. When I program in SAS, I feel like I got my master’s degree in statistics in 1980 and I’ve been running the same basic analysis over and over again for my corporate bosses for the last twenty years. I then feel like it’s Friday afternoon and I’m just slogging through this code so I can meet my buddies after work at Chili’s and talk about this weekend’s big game.”

“That is exactly how I feel.”

50+ chapters of grad skool advice goodness: Grad Skool Rulz ($2!!!!)/From Black Power/Party in the Street

Written by fabiorojas

September 19, 2016 at 3:29 am

trump symposium week

This week, we’ll have a few posts about the candidacy of Donald Trump. It will have three parts:

  1. Tom Gill will post on Trump as a political performer.
  2. Then, Josh Pacewicz will dig into Trump’s poll numbers.
  3. We’ll wrap up with a post by me on Trump, where I’ll add some of my own thoughts.

I’ll focus on the following points. Interested readers should send me questions:

  1. How predictable/unpredictable was the Trump candidacy?
  2. Using the Entertainment Theory of the GOP to understand Trump’s nomination and likely November loss.
  3. Using Trump to explain when social science theories do/do not work.

What do you want to know about Trump? Use the comments or send me email.

50+ chapters of grad skool advice goodness: Grad Skool Rulz ($2!!!!)/From Black Power/Party in the Street

Written by fabiorojas

August 29, 2016 at 12:01 am

on the mindset of coders

Kieran has linked to a very interesting set of remarks at the SASE conference by Maciej Cegłowski about the mentality of computer programmers. A few choice clips:

But as anyone who’s worked with tech people knows, this intellectual background can also lead to arrogance. People who excel at software design become convinced that they have a unique ability to understand any kind of system at all, from first principles, without prior training, thanks to their superior powers of analysis. Success in the artificially constructed world of software design promotes a dangerous confidence.

About the economy of collecting information:

Surveillance capitalism has some of the features of a zero-sum game. The actual value of the data collected is not clear, but it is definitely an advantage to collect more than your rivals do. Because human beings develop an immune response to new forms of tracking and manipulation, the only way to stay successful is to keep finding novel ways to peer into people’s private lives. And because much of the surveillance economy is funded by speculators, there is an incentive to try flashy things that will capture the speculators’ imagination, and attract their money.

This creates a ratcheting effect where the behavior of ever more people is tracked ever more closely, and the collected information retained, in the hopes that further dollars can be squeezed out of it.

Read the whole thing.

50+ chapters of grad skool advice goodness: Grad Skool Rulz ($2!!!!)/From Black Power/Party in the Street 

Written by fabiorojas

July 6, 2016 at 12:32 am

agent based models in sociology, circa 2016

A few days ago, we had a discussion about the different meanings of the word “computational sociology.” A commenter wrote the following:

Are agent based models/simulations a dead end? Are smart people still using that technique? Have there been any important results? I didn’t realize it peaked in the 1980s.

I’m a current doctoral student considering pursuing ABM, but if it’s a dead end then maybe not.

I think that olderwoman’s response is on target. There is nothing out of style about ABM’s, but sociology is mainly a discipline of empiricists. You will find scholars who occasionally to ABMs but no one who ONLY does is very, very rare. Examples of people who have done simulations: Damon Centola, Kathleen Carley, Carter Butts. In my department, I can think of two people who have published simulations (Clem Brooks, Steve Benard, and myself) and those who do methods research often employ simulations. Olderwoman is also correct in that writing simulations helps you develop programming skills that are now required for “big data” work and for industry.

So don’t write an all simulation dissertation, but by all means, if you have good ideas, simulate them!

50+ chapters of grad skool advice goodness: Grad Skool Rulz ($2!!!!)/From Black Power/Party in the Street

Written by fabiorojas

June 14, 2016 at 12:01 am

three computational sociologies

I was having a discussion with a visiting scholar about what computational sociology means right now. In my career, the term has been used in at least three different ways:

  • Statistics – for the baby boomer generation of social scientists, “computing in socioal science” meant applied statistics. Remember, it requires a lot of knowledge and skill to store data and estimate models on computes with limited computing power.
  • Agent based models – in the 1980s and 1990s, “computational” meant running simulations.
  • Big data/CS techniques – currently, the term seems to refer to either (a) large data generated by online behavior  and/or (b) using computer science techniques (e.g., topic models or sentiment analysis) to study social science data

Use the comments to discuss other uses of the term.

50+ chapters of grad skool advice goodness: Grad Skool Rulz ($2!!!!)/From Black Power/Party in the Street

Written by fabiorojas

June 9, 2016 at 2:11 am