This semester, I decided to take a statistics class at the UC Berkeley extension in Redwood City. Stats has never been one of my strengths so I decided to take the class to bolster my skills and refresh my memory. For my class project, I decided to conduct a survey of engagement with social media sites, focusing on Facebook, flickr, and del.icio.us. My main question was whether there is a relationship between a user’s age and their engagement with a social network site. To assess engagement, I looked at two metrics - size of social network and size of digital artifacts collection. I defined digital artifacts for each site as Facebook wall posts, flickr photos, and del.icio.us bookmarks. In my sample, I found a weak negative association between age and level of engagement, meaning that younger users tended to be more engaged. Nonetheless, this relationship was fairly weak. These results seem counterintuitive to me - I hypothesized a strong negative association. I believe that these results are due to my sampling procedures, which were not random at all and most participants tended to be around my own age.
Out of curiosity (and because the Excel CORREL function is super easy to use), I started correlating all sorts of various metrics, hoping I can find a strong association somewhere in my data! I found a strong positive association with the size of a del.ico.us user’s network and the size of her bookmarks collection. This means that as a user’s social network increased in size, so did their bookmarks collection. These results are interesting but not too surprising given the nature of del.icio.us. del.icio.us users with larger social networks are exposed to more bookmarks, merely because they know more users who are posting more bookmarks and those bookmarks are winding up in their Network page. And once those bookmarks end up on their Network page, they can easily save those bookmarks themselves.
For those who may be interested, my project slides are now available online.






