Wikipedia:Counter-Vandalism Unit/Vandalism studies: Difference between revisions

Content deleted Content added
Published: fix link
m Members: add Superb Owl
 
(21 intermediate revisions by 19 users not shown)
Line 1:
{{Short description|Subproject of CVU that researches vandalism}}
{{WikiProject status|Inactive|taskforce=yes}}
[[File:CVU_Vandalism_Studies.svg|right|210 px]]
{{cquote|<big>'''Vandalism is any addition, removal, or change of content made in a deliberate attempt to compromise the integrity of Wikipedia.'''</big>|author=[[Wikipedia:Vandalism]]}}
Line 14 ⟶ 15:
! Study !! Status
|-
| '''Study 3''' ([[Wikipedia talk:Counter-Vandalism Unit/Vandalism studies/Study 3|suggest ideas]])|| {{discussingnotdone2}}, but planned for November
|-
 
Line 31 ⟶ 32:
 
{{collapse top|List of members}}
# {{User|JohnLaurensAnthonyRamos333}} – happy to help!
# {{User|Theopolisme}} – CVUA Co-coordinator
# {{User|Chip123456}} – CVUA instructor
Line 109 ⟶ 111:
# {{User|SuperGhostPrimus}} I would like to help.
#[[User:JaneciaTaylor|JaneciaTaylor]] ([[User talk:JaneciaTaylor|talk]]) I would like to help.
# {{User|Asension}} Glad to help.
# {{User|ItzJustLucky}} Planning to use [[WP:Twinkle|twinkle]] on the way.
# {{User|MrAgentSochi}} We love Anti Vandals
# {{User|Suriname0}} academic researcher and interested in subtle vandalism.
# {{User|Pink Saffron}} Interested in Vandalism studies
# {{User|Drewthescorpio}} If this stops vandals, I'm in.
# {{User|Superb Owl}}
<!-- NEW and RETURNING USERS: ADD YOUR NAME ABOVE -->
 
Line 134 ⟶ 143:
*Who is responsible for vandalism? What do vandals want? What are the demographics of the vandal population?
*What proportion of vandals are on dynamic [[IP address]]es, and hence very hard to block?
*Are IP edits ever responsible tofor improving a featured article while on the Main Page? (See also essay [[Wikipedia:IPs are human too|IPs are human too]].)
*What motivates people to vandalize articles? How can we minimize the satisfaction they get from doing it? (See: [[Wikipedia:The motivation of a vandal|The motivation of a vandal]])
*Do vandals just choose another article to edit instead if an article is semi-protected? How can we test this?
Line 140 ⟶ 149:
*What types of vandalism are there? What message are they trying to get across? Why do vandals not fully realise that their actions are futile?
*What sort of financial gains can be made from using Wikipedia to advertise – are spammers just wasting their time, or can it actually be profitable? Are our anti-spam measures adequate?
*What is the overall contribution from schools and universities? Are they worth having? Do universities contribute less vandalism than schools, or are all ages equally immature?
*How does the rate of vandalism vary throughout the day?
*Would there still be problems with vandalism if unregistered editing werewas blocked? How can we test this hypothesis? Certain categories could be experimentally altered to block unregistered editors, but then vandals could just choose an article that wasn't protected. We would have to block all IP editing, which would certainly be controversial, even just to gather a small sample of data. The blocks would also have to allow ''newly registered'' users to edit, otherwise, there wouldn't be time to create an account and then wait 4 days. Perhaps we could use a comparative method by doing the experiments on another wiki instead?
*Quantitatively, how are levels of vandalism affected (both in terms of percentage of edits and number of edits) when there is external attention draw to an article (e.g. [[Slashdot]] or [[The Colbert Report]]). Do levels of vandalism return to normal (e.g. in [[elephant]]) in all cases? How quickly?
*How much of vandalism is self-reverted?
Line 218 ⟶ 227:
===Published===
<!--organized by relevance-->
* {{cite pressweb release|last1=Carter |first1=Jacobi |title=UClueBot ofand MVandalism researchers reveal new findings abouton Wikipedia authorship|url=http://www.acm.uiuc.edu/~carter11/ClueBot.pdf and vandalism|publisheraccessdate=University of Minnesota5 October Department of Computer Science and Engineering2020 |date=2007-11-062 June 2010|archive-url=https://web.archive.org/web/2012092020001220100602050925/http://www1www.umnacm.uiuc.edu/news~carter11/news-releases/2007/UR_RELEASE_MIG_4284ClueBot.htmlpdf |archive-date=2010-06-02 }}
* {{cite press release |title=U of M researchers reveal new findings about Wikipedia authorship and vandalism|publisher=University of Minnesota – Department of Computer Science and Engineering |date=2007-11-06|url=http://www1.umn.edu/news/news-releases/2007/UR_RELEASE_MIG_4284.html|archive-url=https://web.archive.org/web/20120920200012/http://www1.umn.edu/news/news-releases/2007/UR_RELEASE_MIG_4284.html |archive-date=2012-09-20 }}
* {{cite web |url= http://www.chato.cl/papers/buriol_2006_temporal_analysis_wikigraph.pdf|title=Temporal Analysis of the Wikigraph|author=Buriol, Luciana S. |author2=Carlos Castillo |author3=Debora Donato |author4=Stefano Leonardi |author5=Stefano Millozzi|date= 2006|publisher=Sapienza University of Rome}}
* {{cite conference|chapter-url=https://dl.acm.org/doi/10.1145/1316624.1316663|titlechapter=Creating, Destroying, and Restoring Value in Wikipedia|author=GroupLens Research|title=Proceedings of the 2007 international ACM conference on Conference on supporting group work - GROUP '07 |date=November 4–7, 2007|page=259 |publisher= University of Minnesota – Department of Computer Science and Engineering|location=Sanibel Island, Florida, USA|doi=10.1145/1316624.1316663 |isbn=9781595938459 }}
* {{cite web|author=MIT Media Lab |author2=IBM Research|url=http://alumni.media.mit.edu/~fviegas/papers/history_flow.pdf|title=Studying Cooperation and Conflict between Authors with history flow Visualizations|date=April 24–29, 2004|publisher=Massachusetts Institute of Technology|location=Vienna}}
* {{cite web|last=Moore|first=Rick|title=New information on Wikipedia|date=2007-11-16|url=http://www1.umn.edu/news/features/2007/UR_160965_REGION1.html|publisher=University of Minnesota}}
* {{cite web|last=Smets|first=Koen |author2=Bart Goethals |author3=Brigitte Verdonk|title=Automatic Vandalism Detection in Wikipedia: Towards a Machine Learning Approach|url=http://win.ua.ac.be/~adrem/bibrem/pubs/WikiAI08.pdf|date=2008|publisher=University of Antwerp – Department of Mathematics and Computer Science}}
* {{cite web |url= http://www.aclweb.org/anthology/C/C10/C10-1129.pdf|title=Got You!: Automatic Vandalism Detection in Wikipedia with Web-based Shallow Syntactic-Semantic Modeling|author=Wang, William Yang |author2=McKeown, Kathleen R.|date= 2010|publisher=the 23rd International Conference on Computational Linguistics}}
* {{cite webarXiv|last=Belani|first=Amit|title=Vandalism Detection in Wikipedia: a Bag-of-Words Classifier Approach|urleprint=http://arxiv.org/ftp/arxiv/papers/1001/1001.0700.pdf|date=2009-11-11|workclass=arXiv|publisher=Cornellcs.LG University}}
* {{cite webbook|last=West|first=Andrew G. |author2=Sampath Kannan |author3=Insup Lee|title=Detecting Wikipedia Vandalism via Spatio-Temporal Analysis of Revision Metadata|chapter=Detecting Wikipedia vandalism via spatio-temporal analysis of revision metadata? |chapter-url=http://repository.upenn.edu/cis_papers/428/|date=2010|pages=22–28 |doi=10.1145/1752046.1752050 |isbn=9781450300599 |s2cid=215753727 }}
* {{cite webbook|last=Adler|first=B. Thomas |author2=Luca de Alfaro |author3=Santiago Mola-Velasco |author4=Paolo Rosso |author5=Andrew G. West|title=Computational Linguistics and Intelligent Text Processing |chapter=Wikipedia Vandalism Detection: Combining Natural Language, Metadata, and Reputation Features|series=Lecture Notes in Computer Science |chapter-url=httphttps://repositoryhdl.upennhandle.edunet/cis_papershandle/45710251/36621|date=2011|volume=6609 |pages=277–288 |doi=10.1007/978-3-642-19437-5_23 |hdl=10251/36621 |isbn=978-3-642-19436-8 }}
* {{cite webjournal|last=West|first=Andrew G. |author2=Insup Lee|title=Multilingual Vandalism Detection using Language-Independent & Ex Post Facto Evidence|journal=Pan-Clef '11: Notebook Papers on Uncovering Plagiarism, Authorship, and Social Software Misuse |url=http://repository.upenn.edu/cis_papers/479/|date=2011}}
 
{{Col-break}}
Line 246 ⟶ 256:
* [https://webis.de/data/pan-wvc-10 PAN-WVC-10]
* [https://webis.de/data/pan-wvc-11.html PAN-WVC-11]
* {{cite web|title=Wikipedia Vandal Study – US Senate: Oct 1 – Dec 31, 2007|date=January 2008|url=https://spreadsheets.google.com/pub?key=psAWteTSyixEB98YcV-5VEw|publisher=Google Docs|archive-url=https://web.archive.org/web/20081220010821/https://spreadsheets.google.com/pub?key=psAWteTSyixEB98YcV-5VEw |publisherarchive-date=Google2008-12-20 Docs}}
 
</br>