{"id":28301,"date":"2017-11-25T13:10:35","date_gmt":"2017-11-25T18:10:35","guid":{"rendered":"http:\/\/olduvai.ca\/?p=28301"},"modified":"2017-11-25T13:10:35","modified_gmt":"2017-11-25T18:10:35","slug":"more-than-a-million-pro-repeal-net-neutrality-comments-were-likely-faked","status":"publish","type":"post","link":"https:\/\/olduvai.ca\/?p=28301","title":{"rendered":"More than a Million Pro-Repeal Net Neutrality Comments were Likely\u00a0Faked"},"content":{"rendered":"<header class=\"container u-maxWidth740\">\n<div class=\"uiScale uiScale-ui--regular uiScale-caption--regular postMetaHeader u-paddingBottom10 row\">\n<div class=\"col u-size12of12 js-postMetaLockup\">\n<div class=\"uiScale uiScale-ui--regular uiScale-caption--regular postMetaLockup postMetaLockup--authorWithBio u-flexCenter js-postMetaLockup\"><\/div>\n<\/div>\n<\/div>\n<\/header>\n<div class=\"postArticle-content js-postField js-notesSource js-trackedPost\" data-post-id=\"e9f0e3ed36a6\" data-source=\"post_page\" data-collection-id=\"3a8144eabfe3\" data-tracking-context=\"postPage\" data-scroll=\"native\">\n<section class=\"section section--body section--first\">\n<div class=\"section-content\">\n<div class=\"section-inner sectionLayout--insetColumn\">\n<h3 id=\"7069\" class=\"graf graf--h3 graf--leading graf--title\"><a href=\"https:\/\/hackernoon.com\/more-than-a-million-pro-repeal-net-neutrality-comments-were-likely-faked-e9f0e3ed36a6\">More than a Million Pro-Repeal Net Neutrality Comments were Likely\u00a0Faked<\/a><\/h3>\n<p id=\"5ddb\" class=\"graf graf--h4 graf-after--h3 graf--subtitle\"><strong>I used natural language processing techniques to analyze net neutrality comments submitted to the FCC from April-October 2017, and the results were disturbing.<\/strong><\/p>\n<figure id=\"8cd9\" class=\"graf graf--figure graf-after--h4\">\n<div class=\"aspectRatioPlaceholder is-locked\">\n<div class=\"progressiveMedia js-progressiveMedia graf-image is-canvasLoaded is-imageLoaded\" data-image-id=\"1*shWYIe0km5rYxPebfGPTTg.png\" data-width=\"1111\" data-height=\"603\" data-action=\"zoom\" data-action-value=\"1*shWYIe0km5rYxPebfGPTTg.png\" data-scroll=\"native\"><canvas class=\"progressiveMedia-canvas js-progressiveMedia-canvas\" width=\"75\" height=\"40\"><\/canvas><img decoding=\"async\" class=\"progressiveMedia-image js-progressiveMedia-image\" src=\"https:\/\/cdn-images-1.medium.com\/max\/1600\/1*shWYIe0km5rYxPebfGPTTg.png\" data-src=\"https:\/\/cdn-images-1.medium.com\/max\/1600\/1*shWYIe0km5rYxPebfGPTTg.png\" \/><\/div>\n<\/div><figcaption class=\"imageCaption\">Spot the fake comment. Surprise\u200a\u2014\u200athey\u2019re all\u00a0fake.<\/figcaption><\/figure>\n<p id=\"c1aa\" class=\"graf graf--p graf-after--figure\">NY Attorney General Schneiderman <a class=\"markup--anchor markup--p-anchor\" href=\"https:\/\/medium.com\/@AGSchneiderman\/an-open-letter-to-the-fcc-b867a763850a\" target=\"_blank\" rel=\"noopener\" data-href=\"https:\/\/medium.com\/@AGSchneiderman\/an-open-letter-to-the-fcc-b867a763850a\">estimated that hundreds of thousands of Americans\u2019 identities were stolen<\/a> and used in spam campaigns that support repealing net neutrality. My research found at least 1.3 million fake pro-repeal comments, with suspicions about many more. In fact, the sum of fake pro-repeal comments in the proceeding may number in the millions. In this post, I will point out one particularly egregious spambot submission, make the case that there are likely many more pro-repeal spambots yet to be confirmed, and estimate the public position on net neutrality in the \u201corganic\u201d public submissions.\u00b9<\/p>\n<h3 id=\"18da\" class=\"graf graf--h3 graf-after--p\">Key Findings:\u00b2<\/h3>\n<ol class=\"postList\">\n<li id=\"046f\" class=\"graf graf--li graf-after--h3\">One pro-repeal spam campaign <strong class=\"markup--strong markup--li-strong\"><em class=\"markup--em markup--li-em\">used mail-merge to disguise 1.3 million comments<\/em><\/strong> as unique grassroots submissions.<\/li>\n<li id=\"483e\" class=\"graf graf--li graf-after--li\">There were likely multiple other campaigns aimed at injecting what may total <strong class=\"markup--strong markup--li-strong\"><em class=\"markup--em markup--li-em\">several million<\/em><\/strong> pro-repeal comments into the system.<\/li>\n<li id=\"10a1\" class=\"graf graf--li graf-after--li graf--trailing\">It\u2019s highly likely that <strong class=\"markup--strong markup--li-strong\"><em class=\"markup--em markup--li-em\">more than 99%<\/em><\/strong> of the truly unique comments\u00b3 were in favor of keeping net neutrality.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/section>\n<section class=\"section section--body\">\n<div class=\"section-divider\">\n<hr class=\"section-divider\" \/>\n<\/div>\n<div class=\"section-content\">\n<div class=\"section-inner sectionLayout--insetColumn\">\n<h3 id=\"62b5\" class=\"graf graf--h3 graf--leading\">Breaking Down the Submissions<\/h3>\n<p id=\"6597\" class=\"graf graf--p graf-after--h3\">Given the <a class=\"markup--anchor markup--p-anchor\" href=\"https:\/\/www.theverge.com\/2017\/5\/25\/15691564\/fcc-letter-anti-net-neutrality-spam-comments\" target=\"_blank\" rel=\"noopener\" data-href=\"https:\/\/www.theverge.com\/2017\/5\/25\/15691564\/fcc-letter-anti-net-neutrality-spam-comments\">well<\/a> <a class=\"markup--anchor markup--p-anchor\" href=\"https:\/\/arstechnica.com\/information-technology\/2017\/07\/fcc-has-no-documentation-of-ddos-attack-that-hit-net-neutrality-comments\/\" target=\"_blank\" rel=\"noopener\" data-href=\"https:\/\/arstechnica.com\/information-technology\/2017\/07\/fcc-has-no-documentation-of-ddos-attack-that-hit-net-neutrality-comments\/\">documented<\/a> irregularities throughout the comment submission process, it was clear from the start that the data was going to be duplicative and messy. If I wanted to do the analysis without having to set up the tools and infrastructure typically used for \u201cbig data,\u201d I needed to break down the 22M+ comments and 60GB+ worth of text data and metadata into smaller pieces.\u2074<\/p>\n<p id=\"da73\" class=\"graf graf--p graf-after--p\">Thus, I tallied up the many duplicate comments\u2075 and arrived at 2,955,182 unique comments and their respective duplicate counts. I then mapped each comment into semantic space vectors\u2076 and ran some clustering algorithms on the meaning of the comments.\u2077 This method identified nearly 150 clusters of comment submission texts of various sizes.\u2078<\/p>\n<p>&#8230;click on the above link to read the rest of the article&#8230;<\/p>\n<\/div>\n<\/div>\n<\/section>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>More than a Million Pro-Repeal Net Neutrality Comments were Likely\u00a0Faked I used natural language processing techniques to analyze net neutrality comments submitted to the FCC from April-October 2017, and the results were disturbing. Spot the fake comment. Surprise\u200a\u2014\u200athey\u2019re all\u00a0fake. NY Attorney General Schneiderman estimated that hundreds of thousands of Americans\u2019 identities were stolen and used [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"categories":[6],"tags":[17176,12961,17175,14273,3828],"class_list":["post-28301","post","type-post","status-publish","format-standard","hentry","category-liberty","tag-faked-comments","tag-fcc","tag-jeff-kao","tag-medium","tag-net-neutrality"],"_links":{"self":[{"href":"https:\/\/olduvai.ca\/index.php?rest_route=\/wp\/v2\/posts\/28301","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/olduvai.ca\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/olduvai.ca\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/olduvai.ca\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/olduvai.ca\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=28301"}],"version-history":[{"count":1,"href":"https:\/\/olduvai.ca\/index.php?rest_route=\/wp\/v2\/posts\/28301\/revisions"}],"predecessor-version":[{"id":28302,"href":"https:\/\/olduvai.ca\/index.php?rest_route=\/wp\/v2\/posts\/28301\/revisions\/28302"}],"wp:attachment":[{"href":"https:\/\/olduvai.ca\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=28301"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/olduvai.ca\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=28301"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/olduvai.ca\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=28301"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}