Babel Street: The Cacophony of Social Media is the New Newsroom
Natural language processing and sentiment analysis are enabling computers to understand our social media posts. What are the implications for public safety and user privacy?
You may think that your friends and family are the only audience liking and sharing your social media posts, but data analytics company Babel Street has made it their business to leverage open source data for event detection. Using powerful algorithms, Babel Street’s software is parsing through our day-to-day social media conversations and extracting relevant insights.
A Computer That Really Gets Me
Babel Street is a data analytics company that provides software as a service (SaaS) products to users in intelligence and law enforcement. Their technology is able to consolidate and interpret data from millions of publicly available sources including social media and news.[1] Babel Street’s customer promise is to transform this unstructured, multilingual data into actionable insights in real-time. The company is using machine learning to organize the terabytes of noisy data published on the internet each day and extract signals that can be used by public safety teams to react quickly to – or even preempt – security threats.[2]
In order for their software to make sense of all this data, Babel Street invests heavily in natural language processing (NLP), a form of artificial intelligence that helps computers understand human language.[3] The sophistication required by Babel Street’s natural language processing algorithms is much greater than the industry standard for two reasons. First, Babel Street is using data sourced from communication platforms like social networking websites and forums, which are characterized by complex, intertwined conversations. Second, while the field of natural language processing has made significant strides in analyzing the semantic dimension of human discourse, there are still gaps in the ability to evaluate contextual nuances.[4] Babel Street aims to close that gap and boasts to “possess the most sophisticated sentiment analysis tool on the market.”[5] Sentiment analysis enables computers to interpret the meaning behind text and understand the user’s attitude, emotions, and intentions.[6]
Machine learning is important to Babel Street because artificial intelligence drives the core event detection capability of the company’s software. Humans are not able to process data with the same speed and volume that computers can handle, and for Babel Street’s customers, speed and accuracy are critical. For example, Babel Street was the first source to detect the publication of the thirteenth issue of Dabiq, ISIS’s official magazine, when it was posted by propaganda accounts on Twitter.[7]
The results of machine learning algorithms also improve over time as they are given more training data.[8] Some of Babel Street’s customers have even started using the software to predict events based on trends and anomalies in the data. Counterterrorism units in the Pentagon, for example, use Babel Street to track terrorist conversations on social media to predict attacks.[9]
Next Stop, Emojis
In the short-term, Babel Street is adapting its natural language processing capabilities in response to evolving trends in internet communication. For example, criminals are increasingly using emojis to communicate in order to circumvent text analysis.[10] In the medium-term, the company is planning to expand its market to other industries. Reputational risk has become increasingly more difficult for companies to manage as customers air their grievances publicly on Twitter and Facebook. Babel Street plans to expand its enterprise solution and offer software products for brand management.[11] With their sentiment analysis capabilities, Babel Street can offer corporate clients the ability to gauge positive or negative customer sentiment toward brands.
In order for Babel Street to remain competitive in the evolving internet communication landscape, they will need to invest in image analysis as a complement to natural language processing. With the increased use of image sharing platforms like Instagram, events will be chronicled through photos and videos more frequently. Babel Street will need to use image analysis to identify significant events in media posts when there is no text present.
Helpful Or Harmful?
While the public good delivered by Babel Street’s software is undeniable, the company operates in an industry where privacy concerns around user data have made recent headlines. An important question to raise is, where do we draw the line when making the tradeoff between public safety and user privacy? In 2016, Babel Street’s competitor, Geofeedia, was heavily criticized by the ACLU for providing a tool that allowed Chicago police to surveil protests by searching for public social media posts by location[12]. The ACLU argued that Geofeedia failed to protect free speech rights, especially for activists of color. However, a counterargument could be made that public social media posts are fair game and that the onus should be on users to be educated about the implications of sharing information publicly. As machine learning becomes more advanced, it becomes more important for us as savvy internet users to be aware of how we share our data.
(Word Count: 776)
[1] Babelstreet.com. (2018). [online] Available at: https://www.babelstreet.com/ [Accessed 13 Nov. 2018].
[2] Gregg, A. (2017). For this company, online surveillance leads to profit in Washington’s suburbs. [online] The Washington Post. Available at: https://www.washingtonpost.com/business/economy/for-this-company-online-surveillance-leads-to-profit-in-washingtons-suburbs/2017/09/08/6067c924-9409-11e7-89fa-bb822a46da5b_story.html?noredirect=on&utm_term=.13d83f405bc3 [Accessed 13 Nov. 2018].
[3] Mills, T. (2018). What Is Natural Language Processing And What Is It Used For?. [online] Forbes. Available at: https://www.forbes.com/sites/forbestechcouncil/2018/07/02/what-is-natural-language-processing-and-what-is-it-used-for/#6aee142a5d71 [Accessed 13 Nov. 2018].
[4] Abbas, Ahmed, Yilu Zhou, Shasha Deng, and Pengzhu Zhang. 2018. “Text Analytics To Support Sense-Making In Social Media: A Language-Action Perspective”. MIS Quarterly42 (2): 427-464. doi:10.25300/misq/2018/13239.
[5] “Meet Babel Street, The Powerful Social Media Surveillance Used By Police, Secret Service, And Sports Stadiums”. 2018. Motherboard. https://motherboard.vice.com/en_us/article/gv7g3m/meet-babel-street-the-powerful-social-media-surveillance-used-by-police-secret-service-and-sports-stadiums.
[6] Socher, Richard. “AI’s Next Great Challenge: Understanding the Nuances of Language.” Harvard Business Review Digital Articles, July 2018, pp. 2–4. EBSCOhost, ezp-prod1.hul.harvard.edu/login?url=http://search.ebscohost.com/login.aspx?direct=true&db=bth&AN=130975191&site=ehost-live&scope=site.
[7] “Meet Babel Street, The Powerful Social Media Surveillance Used By Police, Secret Service, And Sports Stadiums”. 2018. Motherboard. https://motherboard.vice.com/en_us/article/gv7g3m/meet-babel-street-the-powerful-social-media-surveillance-used-by-police-secret-service-and-sports-stadiums.
[8] Brynjolfsson, Erik, and Andrew McAfee. “WHAT’S DRIVING THE MACHINE LEARNING EXPLOSION? Three Factors Make This AI’s Moment.” Harvard Business Review Digital Articles, July 2017, pp. 12–13. EBSCOhost, ezp-prod1.hul.harvard.edu/login?url=http://search.ebscohost.com/login.aspx?direct=true&db=bth&AN=124641872&site=ehost-live&scope=site.
[9] Gregg, A. (2017). For this company, online surveillance leads to profit in Washington’s suburbs. [online] The Washington Post. Available at: https://www.washingtonpost.com/business/economy/for-this-company-online-surveillance-leads-to-profit-in-washingtons-suburbs/2017/09/08/6067c924-9409-11e7-89fa-bb822a46da5b_story.html?noredirect=on&utm_term=.13d83f405bc3 [Accessed 13 Nov. 2018].
[10] Ibid.
[11] Ibid.
[12] “Police Use Surveillance Tool To Scan Social Media, A.C.L.U. Says”. 2018. Nytimes.Com. https://www.nytimes.com/2016/10/12/technology/aclu-facebook-twitter-instagram-geofeedia.html.
I thought the content in this piece was accurate and thoughtful. I have used Babel Street technology before and can attest to the speed, efficiency, and general common good that it provides. So, thank you for the holistic view point. The last question you raised is critical and has been a central debate in policy, dating back to 2003 and the release of the Patriot Act. U.S. policy has simply not caught up with technology yet. As AI advances, the federal government’s response lags behind. Regardless, I believe it won’t be until some type of standard (law/policy) is set that this fundamental, and arguably philosophical, question can be answered.
What I found particularly compelling about your piece was your underlying point regarding speed at the tradeoff of human judgement. While at first glance this may seem enticing, I am hesitant to believe it will be albeit of profiling. I wonder in what ways you can restructure the learning to only raise topics of a certain confidence level – much like Watson. To that point, I wonder at what threshold of crime, crisis, or general situation, utilizing this technology makes sense and does not infringe on citizen’s rights.