Web 2.0 Authors: Pat Romanski, Liz McMillan, Carmen Gonzalez, Imran Akbar, Elizabeth White

Blog Feed Post

The most influential data scientists on Twitter

Twitter is emerging as an important medium for determining influence in many fields. Social ranking sites like Klout and Traackr include Twitter as a heavily-weighted component of their ranking algorithms, for example. Twitter isn't representative of the members of any field, but in areas where the members primarily engage online, it can be a useful proxy. SocialFlow's Gilad Lotan has used an analysis of social networks on Twitter to rank influences in such fields, including the communities of python users and data scientists. Here's the network chart for data scientists: Each circle represents a data scientists on Twitter (anyone with Data Science, Data Scientist, Machine Learning, Data Strategy or the like in their bio). The circle size is proportional to their influence during a week this past October. If you're active in the R or Data Science communities on Twitter, you'll recognize many of the names: Hilary Mason (@hmason) from bitly, Pete Skomoroch (@peteskomoroch) from LinkedIn; DJ Patil (@dpatil), author of Building Data Science Teams, and #rstats regulars Ryan Rosario (@datajunkie), John Myles White (@johnmyleswhite) and Mark Alen (@siah). (I'm down there in the bottom right corner: @revodavid.) The colors represent connected clusters within the social graph, detected autonomously. With Hilary Mason's help, Gilad assigns meaning to these clusters: Purple seems to be a mix of east coast and academics, while the dark blue is the west coast data drinking crew. Yellow looks like west coast social network folks while green have been doing it for a while. Although @BigDataBorat is identified within that segment… hmmm… The orange cluster is harder to nail down. Perhaps more academic, applied math and less tech-scene? The analysis of the social network code was performed with Python, and the visualization was created with Gephi. You can find the details of how it was done, including slides and Python code, at the link below. Gilad Lotan:  Mapping Twitter’s Python and Data Science Communities (via Quora)

Read the original blog entry...

More Stories By David Smith

David Smith is Vice President of Marketing and Community at Revolution Analytics. He has a long history with the R and statistics communities. After graduating with a degree in Statistics from the University of Adelaide, South Australia, he spent four years researching statistical methodology at Lancaster University in the United Kingdom, where he also developed a number of packages for the S-PLUS statistical modeling environment. He continued his association with S-PLUS at Insightful (now TIBCO Spotfire) overseeing the product management of S-PLUS and other statistical and data mining products.<

David smith is the co-author (with Bill Venables) of the popular tutorial manual, An Introduction to R, and one of the originating developers of the ESS: Emacs Speaks Statistics project. Today, he leads marketing for REvolution R, supports R communities worldwide, and is responsible for the Revolutions blog. Prior to joining Revolution Analytics, he served as vice president of product management at Zynchros, Inc. Follow him on twitter at @RevoDavid