Name: Featured Student Research Lightning Talks: Cyber Harassment Detection and Prediction of User Blocks in Wikipedia
Start: 2019-04-11T10:30:00-0400
End: 2019-04-11T11:05:00-0400

THE BIG FESTIVAL ABOUT SMALL CITIES
Tom Tom champions civic innovation, creativity, and entrepreneurship in America’s hometowns.

[Back to Tom Tom Festival]

Back To Schedule

Featured Student Research Lightning Talks: Cyber Harassment Detection and Prediction of User Blocks in Wikipedia

Feedback form is now closed.

The advent of the Internet can be easily heralded as oneof the key events which led to the “Information age” as it is colloquially known. Sharing of thoughts, ideas and opinions reached new heights when people were able to engage in meaningful debates through online forums. However, a darker aspect to this medium – online harassment, has become became rampant in these communities. The Wikipedia usercommunity is no stranger tothis phenomenon.As of January 2019, Wikipedia has 35 million users and on average 250k users register every month. Also, as per the Wikipedia Community Engagement Insights 2018 report - 68% of the respondents reported having experienced harassment at some point in the past and as a result about22% of Wikipedians reported a decrease in their contribution levels. To combat harassment, currently Wikipedia has an organic, human-driven process in place, where cases of abuse reported are evaluated and enacted upon by Wikipedia administrators.Butrelying on human evaluation works in someways but it is not a solution which scales with the growth of Wikipedia, as there were ~170k user blocks in 2018 alone.
Our goal is to develop a data-driven approach in combating cyber harassment that will address a variety of issues that are otherwise faced by the human driven process, from errors and bias in human judgement to efficiently evaluating a larger magnitude of cases. By analyzing user activity in form of editing behavior and discussions, we will be able to predict users who are at risk of getting blocked in the future.

You need this ticket from Eventbrite to sign up: Applied Machine Learning Conference.

Speakers

Charu Rawat

Graduate Student, University of Virginia

Charu is currently a graduate student at UVA pursuing her Masters in Data Science at the Data Science Institute. Prior to this, she earned a bachelor's degree in Mathematics and worked at The D.E. Shaw Group for 3 years leveraging alternative data for investment decisions. Her research... Read More →

Arnab Sarkar

Student, University of Virginia, Data Science Institute

Budding Data Scientist, with more than 3 years of experience in Oracle PL/SQL, Oracle E-Business Suite ERP platform, Oracle Demantra and Talend Data Integration tool. Currently pursuing a Master's degree in the field of Data Science and looking to expand my knowledge in both the statistical... Read More →

Sameer Singh

Graduate Student, Data Science Institute, University of Virginia

Sameer Singh is currently pursuing M.S. in Data Science at the University of Virginia.He has considerable work experience in data analytics and consulting. At ZS Associates, he provided analytics based sales and marketing solutions such as promotion response modeling, marketing mix... Read More →

Sponsors