The Low-Down: How Wikipedia Uses AI To Reduce Abuse, Improve Service

Aug 20, 2018

How Wikipedia Uses AI To Reduce Abuse, Improve Service

As trolls, fake news and inappropriate content multiply - sometimes exponentially - Wikipedia is using AI and machine learning to lessen the burden on human editors while also identifying the sources of hostile comments, making for a less toxic cultural environment.

At the same time, new uses are being explored for technological contributions and editing, leading to a more up-to-date and accurate system. JL

Bernard Marr reports Forbes:

Toxicity was so bad that active contributors had fallen 40%. 10% of all attacks were made by just 34 users. Now that the algorithms have created more clarity, Wikipedia can figure out the best way to combat negativity. Although human moderation is still needed, algorithms can flag those that require human involvement. An editing system powered by an algorithm trained to score the quality of changes and edits can direct humans to review and determine the caliber of mistakes. AI can do "OK" writing Wikipedia articles, but text summarization is more difficult than thought.
The Wikipedia community, the free encyclopedia that is built from a model of openly editable content, is notorious for its toxicity. The issue was so bad that the number of active contributors or editors—those that made one edit per month—had fallen by 40 percent during an eight-year period. Even though there’s not one solution to combat this issue, Wikimedia Foundation, the nonprofit that supports Wikipedia, decided to use artificial intelligence to learn more about the problem and consider ways to combat it.
Collaboration with Wikimedia Foundation and Jigsaw to Stop Abusive Comments
In one effort to stop the trolls, Wikimedia Foundation partnered with Jigsaw (the tech incubator formerly known as Google Ideas) on a research project called Detox using machine learning to flag comments that might be personal attacks. This project is part of Jigsaw’s initiative to build open-source AI tools to help combat harassment on social media platforms and web forums.
The first step in the project was to train the machine learning algorithms using 100,000 toxic comments from Wikipedia Talk pages that had been identified by a 4,000-person human team where every comment had ten different human reviewers. This annotated dataset was one of the largest ever created that looked at online abuse. Not only did these include direct personal attacks, but also third-party and indirect personal attacks ("You are horrible." "Bob is horrible." "Sally said Bob is horrible.") After training, the machines could determine a comment was a personal attack just as well as three human moderators.
Then, the project team had the algorithm review 63 million English Wikipedia comments posted during a 14-year period between 2001 to 2015 to find patterns in the abusive comments. What they discovered was outlined in the Ex Machina: Personal Attacks Seen at Scale paper:

More than 80% of all comments characterized as abusive were made by more than 9,000 people who made less than five abusive comments in a year rather than an isolated group of trolls.

Nearly 10% of all attacks were made by just 34 users.

Anonymous users made up 34% of all comments left on Wikipedia.

More than half of the personal attacks are being carried out by registered users although anonymous users were six times more likely to launch personal attacks. (There are 20 times more registered users than anonymous users.)

Now that the algorithms have created more clarity about who is contributing to the community’s toxicity, Wikipedia can figure out the best way to combat the negativity. Although human moderation is likely still needed, algorithms can help sort through the comments and flag those that require human involvement.
Objective Revision Evaluation Service (ORES System)
Another reason for the significant decline in editors to Wikipedia is thought to be the organization’s complex bureaucracy as well as its harsh editing tactics. It was common for first-time contributors/editors to have an entire body of work wiped out with no explanation. One way they hope to fight this situation is with the ORES system, a machine that acts as an editing system powered by an algorithm trained to score the quality of changes and edits. Wikipedia editors used an online tool to label examples of past edits, and that was how the algorithm was taught the severity of errors. The ORES system can direct humans to review the most damaging edit and determine the caliber of mistakes—rookie mistakes are treated more appropriately as innocent.

AI to Write Wikipedia Articles
Well, AI can do "OK" writing Wikipedia articles, but you have to start somewhere, right? A team within Google Brain taught software to summarize info on web pages and write a Wikipedia-style article. It turns out text summarization is more difficult than most of us thought. Google Brain's efforts to get a machine to summarize content is slightly better than previous attempts, but there is still work to be done before a machine can write with the cadence and flair humans can. It turns out we're not quite ready to have a machine automatically generate Wikipedia entries, but there are efforts underway to get us there.
While the use cases for artificial intelligence in the operations of Wikipedia are still being optimized, machines can undoubtedly help the organization analyze the vast amount of data they generate daily. Better information and analysis can help Wikipedia create successful strategies to troubleshoot negativity from its community and recruitment issues for its contributors.

25 comments:

helen said...: The comprehensive research and attention to smash karts unblocked 76 detail evident in your exploration of this theme are commendable, as you leave no stone unturned in your pursuit of knowledge and understanding.; August 22, 2024 at 9:47 PM
Sprunki Incredibox said...: The combination of different music is so magical that I can't stop listening to it; October 5, 2024 at 12:06 AM
taotao said...: Crazy racing game polytrack, too difficult; October 5, 2024 at 12:06 AM
Perchance AI Story said...: Perchance AI Story is a website that supports multiple languages and can continue to write stories; October 5, 2024 at 12:07 AM
incredibox mustard said...: Explore Colorbox Mustard, a fan-made mod inspired by Incredibox. Create unique music and learn about this creative extension of the popular music creation game.; October 6, 2024 at 12:51 PM
Mochi 1 said...: Mochi 1 is a super cool new AI video generator that just dropped! You can literally type what you want and it makes videos for you - pretty wild, right?

What makes it special:
- Creates smooth 30fps videos up to 5.4 seconds long
- Totally free to try in their playground
- It's open-source (Apache 2.0 license), so you can use it for whatever you want
- Works really well with realistic-style videos; October 30, 2024 at 10:31 AM
khanv said...: Sprunki Dandys World is a quirky adventure game thatll make you smile. Join Dandy on his journey through colorful worlds filled with fun challenges and mini-games.; October 30, 2024 at 10:33 AM
khanv said...: Sprunki Phase lets you play with cool visual effects and animations. Just move your mouse around and watch the magic happen. Perfect for when you need to relax or want to get creative.; October 30, 2024 at 10:33 AM
Corruptbox said...: If you seek astronomical replay value, then Corruptbox undeniably revolutionizes the way you engage with music in a manner that can only be described as quintessentially extraordinary.; January 3, 2025 at 5:47 AM
Sprunki said...: Wow, Sprunki is super cool with its awesome music tech, I cant stop tapping my fingers to the beat!; January 10, 2025 at 3:33 AM
Veo 2 said...: One can only marvel at how Veo 2 brings imagination to life with breathtaking generated videos, proving that creativity knows no bounds!; January 11, 2025 at 3:18 AM
khanv said...: Yo, when it comes to art, Im flexin with finesse, Janus Pro AI in the mix, spittin visuals that impress!; January 29, 2025 at 9:24 AM
khanv said...: OMG, DeepSea AI is like the cutest thing ever, I could totally chat with it 24/7 and never get bored!; January 30, 2025 at 4:34 AM
khanv said...: The artistic sophistication achieved by Janus Pro 7B is so remarkable that even my superior intellect is compelled to appreciate its unparalleled graphics and diverse styles.; January 31, 2025 at 5:11 AM
default1 said...: This comment has been removed by the author.; January 31, 2025 at 7:45 AM
Anonymous said...: Revolutionize Your Style with thevirtualtryon
Experience the future of fashion with our innovative technology.; January 31, 2025 at 7:46 AM
Anonymous said...: Ray2 | AI Video Generation from Text; January 31, 2025 at 7:50 AM
khanv said...: If you seek a transcendent experience where the unimaginable becomes reality, then the Image to Video AI will exceed your wildest expectations, delivering results that even a theoretical physicist like myself can only describe as astonishingly competent.; February 9, 2025 at 5:25 AM
Anonymous said...: Experience powerful language understanding and generation with Nemotron AI; March 13, 2025 at 1:10 AM
Anonymous said...: Discover unparalleled AI capabilities with Gemma 3 Leverage robust language models that operate seamlessly on conventional hardware, perfectly suited for your applications.; March 16, 2025 at 1:33 AM
khanv said...: This Schedule 1 Calculator is super cool because it uses lots of tests to find the best mix really fast!; April 22, 2025 at 3:31 AM
Lupe Stevens said...: The way you break things down is fantastic—it really shows how much you understand the subject and care about helping granny revamp others.; April 29, 2025 at 5:40 AM
hello said...: Great to see Wikipedia using AI to combat toxicity and support editors. This fosters a healthier, more accurate platform, benefiting all. Crucial tech use! Find more insights on ai drawing.; August 26, 2025 at 5:03 AM
Anonymous said...: I was surprised by how Sora 2 API managed to deliver such quality without breaking the bank.; October 27, 2025 at 3:50 AM
Anonymous said...: I tried the new image generation API, and the functionality is pretty impressive, especially with the Nano Banana API managing to create varied and detailed results.; October 30, 2025 at 3:43 AM

A Blog by Jonathan Low

Aug 20, 2018

How Wikipedia Uses AI To Reduce Abuse, Improve Service

25 comments:

Post a Comment

contact

Search This Blog

Blog Archive

Labels

links