Artificial intelligence is used on a number of Wikipedia and Wikimedia projects. This may be directly involved with creation of text content, or in support roles related to evaluating article quality, adding metadata, or generating images. As with any machine-generated content, care must be used when employing AI at scale or in applying it where the community consensus is to exercise more caution.
When exploring AI
techniques and systems, the community consensus is to prefer human decisions
over machine-generated outcomes until the implications are better understood.
Applications
AI-related efforts on
Wikipedia include but are not limited to:
Revision scoring
The Objective Revision
Evaluation Service (ORES) was started in 2015 as a project of the Wikimedia
Foundation, and provides a revision score against machine
learning models that have been trained in order to report article quality or
vandalism. This is used in tools such as CluebotNG to help
immediately revert vandalism, or in evaluation tools like the Program and
Events Dashboard to measure the outcomes of classwork, edit-a-thons, or
organized editing campaigns.
Text translation
Guidance can be found
at Help:Translation#English Wikipedia policy requirements. There is
a Content Translation Tool used across Wikimedia projects that can
use the output of machine translation from one Wikipedia article to another,
using services like Google Translate. However, on the English Wikipedia, it
currently states that "machine translation is disabled for all users and
this tool is limited to extended confirmed editors." As a result, only
manual translation on the English Wikipedia is supported by the tool, though
some users have used translation to Simple English as a workaround. Relatedly,
there is a section of the Help:Translation page with the broad advice:
"avoid machine translations." However, this guidance was last edited
in 2016, and the state of the art for machine translation has advanced
significantly since then, meriting a re-examination of that advice.
Article text generation
Main page: Wikipedia:Large
language models, a draft proposal for a Wikipedia guideline on the use of
language models
The explosion of
interest in ChatGPT in 2022 has led to increased curiosity in using
generative AI to help compose Wikipedia articles. The status of
machine-generated text from tools such as ChatGPT is generally accepted to be
public domain, so the copyright issues are not a blocker to using the generated
text from a legal standpoint. These issues are generally governed by Help:Adding
open license text to Wikipedia#Converting and adding open license text to
Wikipedia, which advises to make sure content is adjusted for style and that
reliable sources are used. Conversations on the Village Pump and in
some test articles (i.e. Artwork title) have noted positive aspects of
machine generated text, but a serious warning that content must be checked for
facts and accuracy and never used straight from ChatGPT.
A good general page
looking at the issues can be found at: Wikipedia:Using neural network
language models on Wikipedia. Some user experiences can be found here:
- Talk:Artwork title
- User:JPxG/LLM demonstration
- User:Fuzheado/ChatGPT
- User:DraconicDark/ChatGPT
- User:BrokenSegue - Wikidata:Wwwyzzerdd and Psychiq Wikidata
game that uses distilBERT and ML, analyzing Wikipedia categories.
Other
]
Image metadata – There have been efforts from GLAM
institutions and at the WMF to help supplement image keyword data with machine
learning efforts. Among them include:
- Combining AI and Human Judgment to
Build Knowledge about Art on a Global Scale March 4, 2019, https://www.metmuseum.org/blogs/now-at-the-met/2019/wikipedia-art-and-ai
Image generation
- c:Commons:AI generated media
Link for this
discussion: https://en.wikipedia.org/wiki/Wikipedia:Artificial_intelligence
No comments:
Post a Comment