
"This is a list of writing and formatting conventions typical of AI chatbots such as ChatGPT, with real examples taken from Wikipedia articles and drafts. It is meant to act as a field guide to help detect undisclosed AI-generated content on Wikipedia. This list is descriptive, not prescriptive; it consists of observations, not rules. Advice about formatting or language to avoid in Wikipedia articles can be found in the policies and guidelines and the Manual of Style, but does not belong on this page."
"This list is not a ban on certain words, phrases, or punctuation. No one is taking your em-dashes away or claiming that only LLMs use them. Not all text featuring the following indicators is AI-generated, as the large language models that power AI chatbots are trained on human writing, including the writing of Wikipedia editors. This is simply a catalog of very common patterns observed over many thousands of instances of AI-generated text, specific to Wikipedia."
"The patterns here are also only potential signs of a problem, not the problem itself. While many of these issues are immediately obvious and easy to fix-e.g., excessive boldface, poor wordsmithing, broken markup, citation style quirks-they can point to less outwardly visible problems that carry much more serious policy risks. If LLM-generated text is polished enough (initially or subsequently tidied up), those surface defects might not be present, but the deeper problems likely will."
Large language models often produce recognizable writing and formatting conventions on Wikipedia. Observations identify common patterns such as excessive boldface, broken markup, citation style quirks, and certain punctuation use. Those patterns are descriptive and not prescriptive; they are not a ban on particular words or punctuation. Similar indicators can appear in human writing because models are trained on human-produced content. Many signs are potential indicators rather than conclusive proof of AI authorship. Surface defects can be easy to fix, but polished AI-generated content may conceal deeper policy risks. Suspected AI-generated content should be addressed or flagged following handling guidance.
Read at Wikipedia
Unable to calculate read time
Collection
[
|
...
]