{"id":803,"date":"2023-12-06T23:29:43","date_gmt":"2023-12-06T23:29:43","guid":{"rendered":"https:\/\/teaching.sociology.ul.ie\/bhalpin\/wordpress\/?p=803"},"modified":"2023-12-06T23:49:23","modified_gmt":"2023-12-06T23:49:23","slug":"network-sampling-blueskytes","status":"publish","type":"post","link":"https:\/\/teaching.sociology.ul.ie\/bhalpin\/wordpress\/?p=803","title":{"rendered":"Network Sampling Blueskytes"},"content":{"rendered":"\n<p>\nI&#8217;m curious about Bluesky, and have accidentally found I can access part of its API. In particular, given a user&#8217;s handle, I can download their skytes, their posts. I got tipped off here: <a href=\"https:\/\/mastodon.social\/@badlogic@mastodon.gamedev.place\/111246801418649849\">https:\/\/mastodon.social\/@badlogic@mastodon.gamedev.place\/111246801418649849<\/a>\n<\/p>\n\n\n\n\n\n<!--more-->\n\n\n\n<p><\/p>\n\n\n\n<p>\nEarlier this week, I decided to use this to see who&#8217;s who in Bluesky. Starting with 10 &#8220;seed&#8221; accounts, I downloaded their 100 most recent skytes (blurts?) and scanned them for references to other Blueskytes. From this enhanced list of users, I continued the process until I had 10,000 usernames (slow, overnight). Not good network sampling, either, but simple and brute force. On the way I keep track of how many references there are to each user, so I end up with data on how often different accounts are @-ed or mentioned. Here is the top 50:\n<\/p>\n\n<table border=\"0\" cellspacing=\"0\" cellpadding=\"6\" rules=\"groups\">\n\n\n<colgroup>\n<col align=right \/>\n\n<col align=left \/>\n\n<col align=right \/>\n<\/colgroup>\n<thead>\n<tr>\n<th scope=\"col\" align=right>Rank<\/th>\n<th scope=\"col\" align=left>Handle<\/th>\n<th scope=\"col\" align=right>Mentions<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td align=right>1<\/td>\n<td align=left>neilhimself.neilgaiman.com<\/td>\n<td align=right>476<\/td>\n<\/tr>\n\n<tr>\n<td align=right>2<\/td>\n<td align=left>bsky.app<\/td>\n<td align=right>427<\/td>\n<\/tr>\n\n<tr>\n<td align=right>3<\/td>\n<td align=left>pfrazee.com<\/td>\n<td align=right>421<\/td>\n<\/tr>\n\n<tr>\n<td align=right>4<\/td>\n<td align=left>darthbluesky.bsky.social<\/td>\n<td align=right>242<\/td>\n<\/tr>\n\n<tr>\n<td align=right>5<\/td>\n<td align=left>jay.bsky.team<\/td>\n<td align=right>233<\/td>\n<\/tr>\n\n<tr>\n<td align=right>6<\/td>\n<td align=left>nytimes.com<\/td>\n<td align=right>206<\/td>\n<\/tr>\n\n<tr>\n<td align=right>7<\/td>\n<td align=left>alttext.bsky.social<\/td>\n<td align=right>178<\/td>\n<\/tr>\n\n<tr>\n<td align=right>8<\/td>\n<td align=left>washingtonpost.com<\/td>\n<td align=right>168<\/td>\n<\/tr>\n\n<tr>\n<td align=right>9<\/td>\n<td align=left>profmusgrave.bsky.social<\/td>\n<td align=right>158<\/td>\n<\/tr>\n\n<tr>\n<td align=right>10<\/td>\n<td align=left>scalzi.com<\/td>\n<td align=right>150<\/td>\n<\/tr>\n\n<tr>\n<td align=right>11<\/td>\n<td align=left>deck.blue<\/td>\n<td align=right>126<\/td>\n<\/tr>\n\n<tr>\n<td align=right>12<\/td>\n<td align=left>danirabaiotti.bsky.social<\/td>\n<td align=right>119<\/td>\n<\/tr>\n\n<tr>\n<td align=right>13<\/td>\n<td align=left>support.bsky.team<\/td>\n<td align=right>108<\/td>\n<\/tr>\n\n<tr>\n<td align=right>14<\/td>\n<td align=left>jun.de<\/td>\n<td align=right>106<\/td>\n<\/tr>\n\n<tr>\n<td align=right>15<\/td>\n<td align=left>faineg.bsky.social<\/td>\n<td align=right>102<\/td>\n<\/tr>\n\n<tr>\n<td align=right>16<\/td>\n<td align=left>graysky.app<\/td>\n<td align=right>100<\/td>\n<\/tr>\n\n<tr>\n<td align=right>17<\/td>\n<td align=left>why.bsky.team<\/td>\n<td align=right>97<\/td>\n<\/tr>\n\n<tr>\n<td align=right>18<\/td>\n<td align=left>chuckwendig.bsky.social<\/td>\n<td align=right>90<\/td>\n<\/tr>\n\n<tr>\n<td align=right>19<\/td>\n<td align=left>jbouie.bsky.social<\/td>\n<td align=right>89<\/td>\n<\/tr>\n\n<tr>\n<td align=right>20<\/td>\n<td align=left>aoc.bsky.social<\/td>\n<td align=right>87<\/td>\n<\/tr>\n\n<tr>\n<td align=right>21<\/td>\n<td align=left>scoremywordle.bsky.social<\/td>\n<td align=right>87<\/td>\n<\/tr>\n\n<tr>\n<td align=right>22<\/td>\n<td align=left>furryli.st<\/td>\n<td align=right>87<\/td>\n<\/tr>\n\n<tr>\n<td align=right>23<\/td>\n<td align=left>jaketapper.bsky.social<\/td>\n<td align=right>86<\/td>\n<\/tr>\n\n<tr>\n<td align=right>24<\/td>\n<td align=left>emily.bsky.team<\/td>\n<td align=right>80<\/td>\n<\/tr>\n\n<tr>\n<td align=right>25<\/td>\n<td align=left>berduck.deepfates.com<\/td>\n<td align=right>79<\/td>\n<\/tr>\n\n<tr>\n<td align=right>26<\/td>\n<td align=left>tkingfisher.bsky.social<\/td>\n<td align=right>76<\/td>\n<\/tr>\n\n<tr>\n<td align=right>27<\/td>\n<td align=left>estrogenempress.gay<\/td>\n<td align=right>76<\/td>\n<\/tr>\n\n<tr>\n<td align=right>28<\/td>\n<td align=left>carlomasala1.bsky.social<\/td>\n<td align=right>75<\/td>\n<\/tr>\n\n<tr>\n<td align=right>29<\/td>\n<td align=left>gailsimone.bsky.social<\/td>\n<td align=right>74<\/td>\n<\/tr>\n\n<tr>\n<td align=right>30<\/td>\n<td align=left>mmasnick.bsky.social<\/td>\n<td align=right>74<\/td>\n<\/tr>\n\n<tr>\n<td align=right>31<\/td>\n<td align=left>fotovorschlag.bsky.social<\/td>\n<td align=right>72<\/td>\n<\/tr>\n\n<tr>\n<td align=right>32<\/td>\n<td align=left>kenwhite.bsky.social<\/td>\n<td align=right>71<\/td>\n<\/tr>\n\n<tr>\n<td align=right>33<\/td>\n<td align=left>rauchway.bsky.social<\/td>\n<td align=right>68<\/td>\n<\/tr>\n\n<tr>\n<td align=right>34<\/td>\n<td align=left>thoughtbubbleuk.bsky.social<\/td>\n<td align=right>65<\/td>\n<\/tr>\n\n<tr>\n<td align=right>35<\/td>\n<td align=left>jamesgunn.bsky.social<\/td>\n<td align=right>64<\/td>\n<\/tr>\n\n<tr>\n<td align=right>36<\/td>\n<td align=left>taz.de<\/td>\n<td align=right>64<\/td>\n<\/tr>\n\n<tr>\n<td align=right>37<\/td>\n<td align=left>premeemohamed.com<\/td>\n<td align=right>62<\/td>\n<\/tr>\n\n<tr>\n<td align=right>38<\/td>\n<td align=left>jack.bsky.social<\/td>\n<td align=right>60<\/td>\n<\/tr>\n\n<tr>\n<td align=right>39<\/td>\n<td align=left>tagesspiegel.de<\/td>\n<td align=right>60<\/td>\n<\/tr>\n\n<tr>\n<td align=right>40<\/td>\n<td align=left>aliafonzy.bsky.social<\/td>\n<td align=right>59<\/td>\n<\/tr>\n\n<tr>\n<td align=right>41<\/td>\n<td align=left>jaz.bsky.social<\/td>\n<td align=right>58<\/td>\n<\/tr>\n\n<tr>\n<td align=right>42<\/td>\n<td align=left>roterpander.bsky.social<\/td>\n<td align=right>54<\/td>\n<\/tr>\n\n<tr>\n<td align=right>43<\/td>\n<td align=left>zeit.de<\/td>\n<td align=right>54<\/td>\n<\/tr>\n\n<tr>\n<td align=right>44<\/td>\n<td align=left>financialtimes.com<\/td>\n<td align=right>54<\/td>\n<\/tr>\n\n<tr>\n<td align=right>45<\/td>\n<td align=left>attackerman.bsky.social<\/td>\n<td align=right>54<\/td>\n<\/tr>\n\n<tr>\n<td align=right>46<\/td>\n<td align=left>zeitonline.bsky.social<\/td>\n<td align=right>53<\/td>\n<\/tr>\n\n<tr>\n<td align=right>47<\/td>\n<td align=left>moinbot.bsky.social<\/td>\n<td align=right>53<\/td>\n<\/tr>\n\n<tr>\n<td align=right>48<\/td>\n<td align=left>briankeene.bsky.social<\/td>\n<td align=right>52<\/td>\n<\/tr>\n\n<tr>\n<td align=right>49<\/td>\n<td align=left>realkachelmann.bsky.social<\/td>\n<td align=right>52<\/td>\n<\/tr>\n\n<tr>\n<td align=right>50<\/td>\n<td align=left>janboehm.bsky.social<\/td>\n<td align=right>51<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n\n<p>\nI think Neil Gaiman is a nice guy, and a great writer, but why is he the biggest thing in Bluesky? He seems very engaged there, though. I recognise a lot of the other accounts as people I used to see in the former place. I have to say, it seems a bit dull!\n<\/p>\n\n<p>\nThis isn&#8217;t good sampling, but checking 10k accounts in a population of 2 million should be starting to get good coverage. I wouldn&#8217;t be surprised if I was missing out whole language-defined zones, though (some DE stuff there, but very little east Asian).\n<\/p>\n\n<p>\nKuba Suder <a rel=\"mention\" class=\"u-url mention\" href=\"https:\/\/martianbase.net\/@mackuba\">@<span>mackuba<\/span><\/a> saw me tooting about it on Mastodon, and has access to a 30-day Bluesky firehose. He ran a similar query, but without the network sampling issues. His results are different:\n<\/p>\n\n<p>\n<a href=\"https:\/\/mastodon.social\/@mackuba@martianbase.net\/111530599484964224\">https:\/\/mastodon.social\/@mackuba@martianbase.net\/111530599484964224<\/a>\n<\/p>\n\n<p>\nA lot more very highly-mentioned bots, and some users @-ed much more than in my data. However, a lot of the key figures are in similar positions. Well, if we exclude the bots, there is at least a correlation! \n<\/p>\n\n\n<div id=\"org2afa71a\" class=\"figure\">\n<p><img decoding=\"async\" src=\"https:\/\/teaching.sociology.ul.ie\/bhalpin\/kubax.png\" alt=\"Scatterplot of Kuba's and my data\" \/>\n<\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>I&#8217;m curious about Bluesky, and have accidentally found I can access part of its API. In particular, given a user&#8217;s handle, I can download their skytes, their posts. I got tipped off here: https:\/\/mastodon.social\/@badlogic@mastodon.gamedev.place\/111246801418649849<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[31],"_links":{"self":[{"href":"https:\/\/teaching.sociology.ul.ie\/bhalpin\/wordpress\/index.php?rest_route=\/wp\/v2\/posts\/803"}],"collection":[{"href":"https:\/\/teaching.sociology.ul.ie\/bhalpin\/wordpress\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/teaching.sociology.ul.ie\/bhalpin\/wordpress\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/teaching.sociology.ul.ie\/bhalpin\/wordpress\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/teaching.sociology.ul.ie\/bhalpin\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=803"}],"version-history":[{"count":8,"href":"https:\/\/teaching.sociology.ul.ie\/bhalpin\/wordpress\/index.php?rest_route=\/wp\/v2\/posts\/803\/revisions"}],"predecessor-version":[{"id":816,"href":"https:\/\/teaching.sociology.ul.ie\/bhalpin\/wordpress\/index.php?rest_route=\/wp\/v2\/posts\/803\/revisions\/816"}],"wp:attachment":[{"href":"https:\/\/teaching.sociology.ul.ie\/bhalpin\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=803"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/teaching.sociology.ul.ie\/bhalpin\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=803"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/teaching.sociology.ul.ie\/bhalpin\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=803"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}