Misplaced Pages

Wikidata

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
#56943

130-634: Wikidata is a collaboratively edited multilingual knowledge graph hosted by the Wikimedia Foundation . It is a common source of open data that Wikimedia projects such as Misplaced Pages , and anyone else, is able to use under the CC0 public domain license. Wikidata is a wiki powered by the software MediaWiki , including its extension for semi-structured data , the Wikibase . As of mid-2024, Wikidata had 1.57 billion item statements ( semantic triple ). Wikidata

260-470: A lexeme is a unit of lexical meaning representing a group of words that share the same core meaning and grammatical characteristics. Similarly, Wikidata's lexemes are items with a structure that makes them more suitable to store lexicographical data. Since 2016, Wikidata has supported lexicographical entries in the form of lexemes. In Wikidata, lexicographical entries have a different identifier from regular item entries. These entries are prefixed with

390-407: A property (such as "author", or "publication date") with one or more entity values (such as " Sir Arthur Conan Doyle " or "1902"). For example, the informal English statement "milk is white" would be encoded by a statement pairing the property color (P462) with the value white (Q23444) under the item milk (Q8495) . Statements may map a property to more than one value. For example,

520-558: A taxonomy , or other forms of ad hoc content organization. Wiki implementations can provide one or more ways to categorize or tag pages to support the maintenance of such index pages, such as a backlink feature which displays all pages that link to a given page. Adding categories or tags to a page makes it easier for other users to find it. Most wikis allow the titles of pages to be searched amongst, and some offer full text search of all stored content. Some wiki communities have established navigational networks between each other using

650-419: A "single value constraint", reflecting the reality that (typically) territories have only one capital city. Constraints are treated as testing alerts and hints, rather than inviolable rules. Before a new property is created, it needs to undergo a discussion process. The most used property is cites work (P2860) , which is used on more than 290,000,000 item pages as of November 2023. In linguistics ,

780-414: A branch of linguistics. Before the 20th century, linguists analysed language on a diachronic plane, which was historical in focus. This meant that they would compare linguistic features and try to analyse language from the point of view of how it had changed between then and later. However, with the rise of Saussurean linguistics in the 20th century, the focus shifted to a more synchronic approach, where

910-560: A comparison of different time periods in the past and present) or in a synchronic manner (by observing developments between different variations that exist within the current linguistic stage of a language). At first, historical linguistics was the cornerstone of comparative linguistics , which involves a study of the relationship between different languages. At that time, scholars of historical linguistics were only concerned with creating different categories of language families , and reconstructing prehistoric proto-languages by using both

1040-414: A form of content management system , these differ from other web-based systems such as blog software or static site generators in that the content is created without any defined owner or leader. Wikis have little inherent structure, allowing one to emerge according to the needs of the users. Wiki engines usually allow content to be written using a lightweight markup language and sometimes edited with

1170-423: A given content size is likely to reduce growth; access controls restricting editing to registered users tends to reduce growth; a lack of such access controls tends to fuel new user registration; and that a higher ratio of administrators to regular users has no significant effect on content or population growth. Joint authorship of articles, in which different users participate in correcting, editing, and compiling

1300-434: A linguistic medium of communication in itself. Palaeography is therefore the discipline that studies the evolution of written scripts (as signs and symbols) in language. The formal study of language also led to the growth of fields like psycholinguistics , which explores the representation and function of language in the mind; neurolinguistics , which studies language processing in the brain; biolinguistics , which studies

1430-443: A link to view that specific revision. A diff (short for "difference") feature may be available, which highlights the changes between any two revisions. The edit history view in many wiki implementations will include edit summaries written by users when submitting changes to a page. Similar to the function of a log message in a revision control system, an edit summary is a short piece of text which summarizes and perhaps explains

SECTION 10

#1732830547057

1560-406: A long period. In addition to using the approach of soft security for protecting themselves, larger wikis may employ sophisticated methods, such as bots that automatically identify and revert vandalism. For example, on Misplaced Pages, the bot ClueBot NG uses machine learning to identify likely harmful changes, and reverts these changes within minutes or even seconds. Disagreements between users over

1690-403: A numeric identifier prefixed with a capital P and a page on Wikidata with optional label, description, aliases, and statements. As such, there are properties with the sole purpose of describing other properties, such as subproperty of (P1647) . Properties may also define more complex rules about their intended usage, termed constraints . For example, the capital (P36) property includes

1820-491: A page or set of pages to maintain quality. A person willing to maintain pages will be alerted of modifications to them, allowing them to verify the validity of new editions quickly. Such a feature is often called a watchlist . Some wikis also implement patrolled revisions , in which editors with the requisite credentials can mark edits as being legitimate. A flagged revisions system can prevent edits from going live until they have been reviewed. Wikis may allow any person on

1950-426: A page to an older version to rectify a mistake, or counteract a malicious or inappropriate edit to its content. These stores are typically presented for each page in a list, called a "log" or "edit history", available from the page via a link in the interface. The list displays metadata for each revision to the page, such as the time and date of when it was stored, and the name of the person who created it, alongside

2080-439: A page was displayed, any instance of a camel case phrase would be transformed into a link to another page named with the same phrase. While this system made it easy to link to pages, it had the downside of requiring pages to be named in a form deviating from standard spelling, and titles of a single word required abnormally capitalizing one of the letters (e.g. "WiKi" instead of "Wiki"). Some wiki implementations attempt to improve

2210-416: A particular feature or usage is "good" or "bad". This is analogous to practice in other sciences: a zoologist studies the animal kingdom without making subjective judgments on whether a particular species is "better" or "worse" than another. Prescription , on the other hand, is an attempt to promote particular linguistic usages over others, often favoring a particular dialect or " acrolect ". This may have

2340-613: A query. The bars on the logo contain the word "WIKI" encoded in Morse code . It was created by Arun Ganesh and selected through community decision. In November 2014, Wikidata received the Open Data Publisher Award from the Open Data Institute "for sheer scale, and built-in openness". In December 2014, Google announced that it would shut down Freebase in favor of Wikidata. As of November 2018, Wikidata information

2470-416: A rich text editing mode. This is usually implemented, using JavaScript , as an interface which translates formatting instructions chosen from a toolbar into the corresponding wiki markup or HTML. This is generated and submitted to the server transparently , shielding users from the technical detail of markup editing and making it easier for them to change the content of pages. An example of such an interface

2600-419: A second-language speaker who is attempting to acquire the language. Most contemporary linguists work under the assumption that spoken data and signed data are more fundamental than written data . This is because Nonetheless, linguists agree that the study of written language can be worthwhile and valuable. For research that relies on corpus linguistics and computational linguistics , written language

2730-500: A series of scripts which operate an existing web server , a standalone application server that runs on one or more web servers, or in the case of personal wikis , run as a standalone application on a single computer. Some wikis use flat file databases to store page content, while others use a relational database , as indexed database access is faster on large wikis, particularly for searching. Wikis can also be created on wiki hosting services (also known as wiki farms ), where

SECTION 20

#1732830547057

2860-422: A single website, but rather to a mass of user-editable pages or sites so that a single website is not "a wiki" but "an instance of wiki". In this concept of wiki federation, in which the same content can be hosted and edited in more than one location in a manner similar to distributed version control , the idea of a single discrete "wiki" no longer made sense. The software which powers a wiki may be implemented as

2990-443: A system called WikiNodes . A WikiNode is a page on a wiki which describes and links to other, related wikis. Some wikis operate a structure of neighbors and delegates , wherein a neighbor wiki is one which discusses similar content or is otherwise of interest, and a delegate wiki is one which has agreed to have certain content delegated to it. WikiNode networks act as webrings which may be navigated from one node to another to find

3120-530: A term in natural language could be wrapped in special characters to turn it into a link without modifying it. The concept was given the name in its first implementation, in UseModWiki in February 2001. In that implementation, link terms were wrapped in a double set of square brackets, for example [[Kingdom of France]] . This syntax was adopted by a number of later wiki engines. It is typically possible for users of

3250-419: A view towards uncovering the biological underpinnings of language. In Generative Grammar , these underpinning are understood as including innate domain-specific grammatical knowledge. Thus, one of the central concerns of the approach is to discover what aspects of linguistic knowledge are innate and which are not. Cognitive linguistics , in contrast, rejects the notion of innate grammar, and studies how

3380-434: A wiki to create links to pages that do not yet exist, as a way to invite the creation of those pages. Such links are usually differentiated visually in some fashion, such as being colored red instead of the default blue, which was the case in the original WikiWikiWeb, or by appearing as a question mark next to the linked words. WikiWikiWeb was the first wiki. Ward Cunningham started developing it in 1994, and installed it on

3510-455: A wiki which addresses a specific subject. The syntax used to create internal hyperlinks varies between wiki implementations. Beginning with the WikiWikiWeb in 1995, most wikis used camel case to name pages, which is when words in a phrase are capitalized and the spaces between them removed. In this system, the phrase "camel case" would be rendered as "CamelCase". In early wiki engines, when

3640-454: A wiki's enforcement of certain rules, such as anti-bias, verifiability, reliable sourcing, and no-original-research policies, could pose legal risks. When defamation occurs on a wiki, theoretically, all users of the wiki can be held liable, because any of them had the ability to remove or amend the defamatory material from the "publication". It remains to be seen whether wikis will be regarded as more akin to an internet service provider , which

3770-424: A word. Linguistic structures are pairings of meaning and form. Any particular pairing of meaning and form is a Saussurean linguistic sign . For instance, the meaning "cat" is represented worldwide with a wide variety of different sound patterns (in oral languages), movements of the hands and face (in sign languages ), and written symbols (in written languages). Linguistic patterns have proven their importance for

3900-465: Is a document-oriented database , focusing on items, which represent any kind of topic, concept, or object. Each item is allocated a unique, persistent identifier , a positive integer prefixed with the upper-case letter Q, known as a "QID". Q is the starting letter of the first name of Qamarniso Vrandečić (née Ismoilova), an Uzbek Wikimedian married to the Wikidata co-developer Denny Vrandečić . This enables

4030-458: Is a researcher within the field, or to someone who uses the tools of the discipline to describe and analyse specific languages. An early formal study of language was in India with Pāṇini , the 6th century BC grammarian who formulated 3,959 rules of Sanskrit morphology . Pāṇini's systematic classification of the sounds of Sanskrit into consonants and vowels, and word classes, such as nouns and verbs,

Wikidata - Misplaced Pages Continue

4160-430: Is a system of rules which governs the production and use of utterances in a given language. These rules apply to sound as well as meaning, and include componential subsets of rules, such as those pertaining to phonology (the organization of phonetic sound systems), morphology (the formation and composition of words), and syntax (the formation and composition of phrases and sentences). Modern frameworks that deal with

4290-441: Is concerned with understanding the universal and fundamental nature of language and developing a general theoretical framework for describing it. Applied linguistics seeks to utilize the scientific findings of the study of language for practical purposes, such as developing methods of improving language education and literacy. Linguistic features may be studied through a variety of perspectives: synchronically (by describing

4420-440: Is conventional or "coded" in a given language, pragmatics studies how the transmission of meaning depends not only on the structural and linguistic knowledge (grammar, lexicon, etc.) of the speaker and listener, but also on the context of the utterance, any pre-existing knowledge about those involved, the inferred intent of the speaker, and other factors. Phonetics and phonology are branches of linguistics concerned with sounds (or

4550-651: Is easy to correct mistakes or harmful changes, rather than attempting to prevent them from happening in the first place. This allows them to be very open while providing a means to verify the validity of recent additions to the body of pages. Most wikis offer a recent changes page which shows recent edits, or a list of edits made within a given time frame. Some wikis can filter the list to remove edits flagged by users as "minor" and automated edits. The version history feature allows harmful changes to be reverted quickly and easily. Some wiki engines provide additional content control, allowing remote monitoring and management of

4680-469: Is generally hard to find for events long ago, due to the occurrence of chance word resemblances and variations between language groups. A limit of around 10,000 years is often assumed for the functional purpose of conducting research. It is also hard to date various proto-languages. Even though several methods are available, these languages can be dated only approximately. In modern historical linguistics, we examine how languages change over time, focusing on

4810-496: Is generally not held liable due to its lack of control over publications' contents, than a publisher. It has been recommended that trademark owners monitor what information is presented about their trademarks on wikis, since courts may use such content as evidence pertaining to public perceptions, and they can edit entries to rectify misinformation. Linguistics Linguistics is the scientific study of language . The areas of linguistic analysis are syntax (rules governing

4940-501: Is not a single wiki but rather a collection of hundreds of wikis, with each one pertaining to a specific language. The English-language Misplaced Pages has the largest collection of articles, standing at 6,917,210 as of November 2024. In their 2001 book The Wiki Way: Quick Collaboration on the Web , Cunningham and co-author Bo Leuf described the essence of the wiki concept: Some wikis will present users with an edit button or link directly on

5070-447: Is often much more convenient for processing large amounts of linguistic data. Large corpora of spoken language are difficult to create and hard to find, and are typically transcribed and written. In addition, linguists have turned to text-based discourse occurring in various formats of computer-mediated communication as a viable site for linguistic inquiry. The study of writing systems themselves, graphemics, is, in any case, considered

5200-452: Is selected based on specific contexts but also, at a micro level, shapes language as text (spoken or written) down to the phonological and lexico-grammatical levels. Grammar and discourse are linked as parts of a system. A particular discourse becomes a language variety when it is used in this way for a particular purpose, and is referred to as a register . There may be certain lexical additions (new words) that are brought into play because of

5330-407: Is sometimes also used for wikis that cover not just a city, but a small town or an entire region. Such a wiki contains information about specific instances of things, ideas, people and places. Such highly localized information might be appropriate for a wiki targeted at local viewers, and could include: A study of several hundred wikis in 2008 showed that a relatively high number of administrators for

Wikidata - Misplaced Pages Continue

5460-478: Is specified, an implied license to read and add content to a wiki may be deemed to exist on the grounds of business necessity and the inherent nature of a wiki. Wikis and their users can be held liable for certain activities that occur on the wiki. If a wiki owner displays indifference and forgoes controls (such as banning copyright infringers) that they could have exercised to stop copyright infringement, they may be deemed to have authorized infringement, especially if

5590-506: Is the VisualEditor in MediaWiki , the wiki engine used by Misplaced Pages. WYSIWYG editors may not provide all the features available in wiki markup, and some users prefer not to use them, so a source editor will often be available simultaneously. Some wiki implementations keep a record of changes made to wiki pages, and may store every version of the page permanently. This allows authors to revert

5720-428: Is the study of how language changes over history, particularly with regard to a specific language or a group of languages. Western trends in historical linguistics date back to roughly the late 18th century, when the discipline grew out of philology , the study of ancient texts and oral traditions. Historical linguistics emerged as one of the first few sub-disciplines in the field, and was most widely practised during

5850-1024: The Hungarian Misplaced Pages became the first to enable the provision of interlanguage links via Wikidata. This functionality was extended to the Hebrew and Italian Wikipedias on 30 January, to the English Misplaced Pages on 13 February and to all other Wikipedias on 6 March. After no consensus was reached over a proposal to restrict the removal of language links from the English Misplaced Pages, they were automatically removed by bots . On 23 September 2013, interlanguage links went live on Wikimedia Commons. On 4 February 2013, statements were introduced to Wikidata entries. The possible values for properties were initially limited to two data types (items and images on Wikimedia Commons), with more data types (such as coordinates and dates) to follow later. The first new type, string,

5980-610: The Internet domain c2.com on March 25, 1995. Cunningham gave it the name after remembering a Honolulu International Airport counter employee telling him to take the " Wiki Wiki Shuttle " bus that runs between the airport's terminals, later observing that "I chose wiki-wiki as an alliterative substitute for 'quick' and thereby avoided naming this stuff quick-web." Cunningham's system was inspired by his having used Apple 's hypertext software HyperCard , which allowed users to create interlinked "stacks" of virtual cards. HyperCard, however,

6110-503: The Sanskrit language in his Aṣṭādhyāyī . Today, modern-day theories on grammar employ many of the principles that were laid down then. Before the 20th century, the term philology , first attested in 1716, was commonly used to refer to the study of language, which was then predominantly historical in focus. Since Ferdinand de Saussure 's insistence on the importance of synchronic analysis , however, this focus has shifted and

6240-530: The United States Court of Appeals for the Seventh Circuit , used to post court rules and allow practitioners to comment and ask questions. The United States Patent and Trademark Office operates Peer-to-Patent , a wiki to allow the public to collaborate on finding prior art relevant to the examination of pending patent applications. Queens , New York has used a wiki to allow citizens to collaborate on

6370-734: The WikiWikiWeb , Memory Alpha , Wikivoyage , and previously Susning.nu , a Swedish-language knowledge base. Medical and health-related wiki examples include Ganfyd , an online collaborative medical reference that is edited by medical professionals and invited non-medical experts. Many wiki communities are private, particularly within enterprises . They are often used as internal documentation for in-house systems and applications. Some companies use wikis to allow customers to help produce software documentation. A study of corporate wiki users found that they could be divided into "synthesizers" and "adders" of content. Synthesizers' frequency of contribution

6500-432: The agent or patient . Functional linguistics , or functional grammar, is a branch of structural linguistics. In the humanistic reference, the terms structuralism and functionalism are related to their meaning in other human sciences . The difference between formal and functional structuralism lies in the way that the two approaches explain why languages have the properties they have. Functional explanation entails

6630-626: The comparative method and the method of internal reconstruction . Internal reconstruction is the method by which an element that contains a certain meaning is re-used in different contexts or environments where there is a variation in either sound or analogy. The reason for this had been to describe well-known Indo-European languages , many of which had detailed documentation and long written histories. Scholars of historical linguistics also studied Uralic languages , another European language family for which very little written material existed back then. After that, there also followed significant work on

SECTION 50

#1732830547057

6760-412: The knowledge engineering field especially with the ever-increasing amount of available data. Linguists focusing on structure attempt to understand the rules regarding language use that native speakers know (not always consciously). All linguistic structures can be broken down into component parts that are combined according to (sub)conscious rules, over multiple levels of analysis. For instance, consider

6890-504: The mind of the individual or the speech community. Construction grammar is a framework which applies the meme concept to the study of syntax. The generative versus evolutionary approach are sometimes called formalism and functionalism , respectively. This reference is however different from the use of the terms in human sciences . Modern linguistics is primarily descriptive . Linguists describe and explain features of language without making subjective judgments on whether

7020-491: The server-side software is implemented by the wiki farm owner, and may do so at no charge in exchange for advertisements being displayed on the wiki's pages. Some hosting services offer private, password-protected wikis requiring authentication to access. Free wiki farms generally contain advertising on every page. The four basic types of users who participate in wikis are readers, authors, wiki administrators and system administrators. System administrators are responsible for

7150-455: The "medical discourse", and so on. The lexicon is a catalogue of words and terms that are stored in a speaker's mind. The lexicon consists of words and bound morphemes , which are parts of words that can not stand alone, like affixes . In some analyses, compound words and certain classes of idiomatic expressions and other collocations are also considered to be part of the lexicon. Dictionaries represent attempts at listing, in alphabetical order,

7280-410: The "n" sound in "tenth" is made differently from the "n" sound in "ten" spoken alone. Although most speakers of English are consciously aware of the rules governing internal structure of the word pieces of "tenth", they are less often aware of the rule governing its sound structure. Linguists focused on structure find and analyze rules such as these, which govern how native speakers use language. Grammar

7410-476: The "occupation" property for Marie Curie could be linked with the values "physicist" and "chemist", to reflect the fact that she engaged in both occupations. Values may take on many types including other Wikidata items, strings, numbers, or media files. Properties prescribe what types of values they may be paired with. For example, the property official website (P856) may only be paired with values of type "URL". Optionally, qualifiers can be used to refine

7540-543: The 18th century, the first use of the comparative method by William Jones sparked the rise of comparative linguistics . Bloomfield attributes "the first great scientific linguistic work of the world" to Jacob Grimm , who wrote Deutsche Grammatik . It was soon followed by other authors writing similar comparative studies on other language groups of Europe. The study of language was broadened from Indo-European to language in general by Wilhelm von Humboldt , of whom Bloomfield asserts: This study received its foundation at

7670-501: The American singer and actor , and Elvis Presley (Q610926) , which represents his self-titled album . However, the combination of a label and its description must be unique. To avoid ambiguity, an item's unique identifier ( QID ) is hence linked to this combination. Fundamentally, an item consists of: Statements are how any information known about an item is recorded in Wikidata. Formally, they consist of key–value pairs , which match

7800-451: The Berlin article, which was not feasible before. On 27 April 2016, arbitrary access was activated on Wikimedia Commons. According to a 2020 study, a large proportion of the data on Wikidata consists of entries imported en masse from other databases by Internet bots , which helps to "break down the walls" of data silos . On 7 September 2015, the Wikimedia Foundation announced the release of

7930-563: The East, but the grammarians of the classical languages did not use the same methods or reach the same conclusions as their contemporaries in the Indic world. Early interest in language in the West was a part of philosophy, not of grammatical description. The first insights into semantic theory were made by Plato in his Cratylus dialogue , where he argues that words denote concepts that are eternal and exist in

SECTION 60

#1732830547057

8060-584: The WikiCite project. It includes data collections from other open projects including Freebase (database) . The creation of the project was funded by donations from the Allen Institute for Artificial Intelligence , the Gordon and Betty Moore Foundation , and Google, Inc. , totaling € 1.3 million. The development of the project is mainly driven by Wikimedia Deutschland under the management of Lydia Pintscher , and

8190-582: The Wikidata Query Service, which lets users run queries on the data contained in Wikidata. The service uses SPARQL as the query language. As of November 2018, there are at least 26 different tools that allow querying the data in different ways. It uses Blazegraph as its triplestore and graph database . In 2021, Wikimedia Deutschland released the Query Builder, "a form-based query builder to allow people who don't know how to use SPARQL" to write

8320-498: The academic community for sharing and dissemination of information across institutional and international boundaries. In those settings, they have been found useful for collaboration on grant writing , strategic planning , departmental documentation, and committee work. In the mid-2000s, the increasing trend among industries toward collaboration placed a heavier impetus upon educators to make students proficient in collaborative work, inspiring even greater interest in wikis being used in

8450-668: The aim of establishing a linguistic standard , which can aid communication over large geographical areas. It may also, however, be an attempt by speakers of one language or dialect to exert influence over speakers of other languages or dialects (see Linguistic imperialism ). An extreme version of prescriptivism can be found among censors , who attempt to eradicate words and structures that they consider to be destructive to society. Prescription, however, may be practised appropriately in language instruction , like in ELT , where certain fundamental grammatical rules and lexical items need to be introduced to

8580-430: The basic information required to identify the topic that the item covers to be translated without favouring any language. Examples of items include 1988 Summer Olympics (Q8470) , love (Q316) , Johnny Cash (Q42775) , Elvis Presley (Q303) , and Gorilla (Q36611) . Item labels do not need to be unique. For example, there are two items named "Elvis Presley": Elvis Presley (Q303) , which represents

8710-404: The biology and evolution of language; and language acquisition , which investigates how children and adults acquire the knowledge of one or more languages. The fundamental principle of humanistic linguistics, especially rational and logical grammar , is that language is an invention created by people. A semiotic tradition of linguistic research considers language a sign system which arises from

8840-426: The change, for example "Corrected grammar" or "Fixed table formatting to not extend past page width". It is not inserted into the article's main text. Traditionally, wikis offer free navigation between their pages via hypertext links in page text, rather than requiring users to follow a formal or structured navigation scheme. Users may also create indexes or table of contents pages, hierarchical categorization via

8970-598: The classroom. Wikis have found some use within the legal profession and within the government. Examples include the Central Intelligence Agency 's Intellipedia , designed to share and collect intelligence assessments , DKosopedia , which was used by the American Civil Liberties Union to assist with review of documents about the internment of detainees in Guantánamo Bay ; and the wiki of

9100-424: The content or appearance of pages may cause edit wars , where competing users repetitively change a page back to a version that they favor. Some wiki software allows administrators to prevent pages from being editable until a decision has been made on what version of the page would be most appropriate. Some wikis may be subject to external structures of governance which address the behavior of persons with access to

9230-420: The content. Proponents maintain that these issues will be caught and rectified by a wiki's community of users. High editorial standards in medicine and health sciences articles, in which users typically use peer-reviewed journals or university textbooks as sources, have led to the idea of expert-moderated wikis. Wiki implementations retaining and allowing access to specific versions of articles has been useful to

9360-546: The corpora of other languages, such as the Austronesian languages and the Native American language families . In historical work, the uniformitarian principle is generally the underlying working hypothesis, occasionally also clearly expressed. The principle was expressed early by William Dwight Whitney , who considered it imperative, a "must", of historical linguistics to "look to find the same principle operative also in

9490-474: The data in Wikidata items in the form of a Resource Description Framework (RDF). The use of entity schemas in Wikidata helps address data inconsistencies and unchecked vandalism. In January 2019, development started of a new extension for MediaWiki to enable storing ShEx in a separate namespace. Entity schemas are stored with different identifiers than those used for items, properties, and lexemes. Entity schemas are stored with an "E" identifier, such as E10 for

9620-435: The design and planning of a local park. Cornell Law School founded a wiki-based legal dictionary called Wex , whose growth has been hampered by restrictions on who can edit. In academic contexts, wikis have also been used as project collaboration and research support systems. A city wiki or local wiki is a wiki used as a knowledge base and social network for a specific geographical locale. The term city wiki

9750-445: The developer of the first wiki software, WikiWikiWeb , originally described wiki as "the simplest online database that could possibly work". " Wiki " (pronounced [wiki] ) is a Hawaiian word meaning "quick". The online encyclopedia project Misplaced Pages is the most popular wiki-based website, as well being one of the internet's most popular websites , having been ranked consistently as such since at least 2007. Misplaced Pages

9880-462: The development of modern standard varieties of languages, and over the development of a language from its standardized form to its varieties. For instance, some scholars also tried to establish super-families , linking, for example, Indo-European, Uralic, and other language families to Nostratic . While these attempts are still not widely accepted as credible methods, they provide necessary information to establish relatedness in language change. This

10010-412: The display of camel case page titles and links by reinserting spaces and possibly also reverting to lower case, but this simplistic method is not able to correctly present titles of mixed capitalization. For example, " Kingdom of France " as a page title would be written as "KingdomOfFrance", and displayed as "Kingdom Of France". To avoid this problem, the syntax of wiki markup gained free links , wherein

10140-532: The entity schema of human data instances and E270 for the entity schema of building data instances. This extension has since been installed on Wikidata and enables contributors to use ShEx for validating and describing Resource Description Framework data in items and lexemes. Any item or lexeme on Wikidata can be validated against an Entity Schema, and this makes it an important tool for quality assurance. Wikidata's content collections include data for biographies, medicine, digital humanities, scholarly metadata through

10270-426: The equivalent aspects of sign languages). Phonetics is largely concerned with the physical aspects of sounds such as their articulation , acoustics, production, and perception. Phonology is concerned with the linguistic abstractions and categorizations of sounds, and it tells us what sounds are in a language, how they do and can combine into words, and explains why certain phonetic features are important to identifying

10400-430: The expertise of the community of people within a certain domain of specialization. Thus, registers and discourses distinguish themselves not only through specialized vocabulary but also, in some cases, through distinct stylistic choices. People in the medical fraternity, for example, may use some medical terminology in their communication that is specialized to the field of medicine. This is often referred to as being part of

10530-450: The field of philology , of which some branches are more qualitative and holistic in approach. Today, philology and linguistics are variably described as related fields, subdisciplines, or separate fields of language study but, by and large, linguistics can be seen as an umbrella term. Linguistics is also related to the philosophy of language , stylistics , rhetoric , semiotics , lexicography , and translation . Historical linguistics

10660-572: The finished product, can also cause editors to become tenants in common of the copyright, making it impossible to republish without permission of all co-owners, some of whose identities may be unknown due to pseudonymous or anonymous editing. Some copyright issues can be alleviated through the use of an open content license. Version 2 of the GNU Free Documentation License includes a specific provision for wiki relicensing, and Creative Commons licenses are also popular. When no license

10790-621: The hands of the Prussian statesman and scholar Wilhelm von Humboldt (1767–1835), especially in the first volume of his work on Kavi, the literary language of Java, entitled Über die Verschiedenheit des menschlichen Sprachbaues und ihren Einfluß auf die geistige Entwickelung des Menschengeschlechts ( On the Variety of the Structure of Human Language and its Influence upon the Mental Development of

10920-835: The help of a rich-text editor . There are dozens of different wiki engines in use, both standalone and part of other software, such as bug tracking systems . Some wiki engines are free and open-source , whereas others are proprietary . Some permit control over different functions (levels of access); for example, editing rights may permit changing, adding, or removing material. Others may permit access without enforcing access control. Further rules may be imposed to organize content. In addition to hosting user-authored content, wikis allow those users to interact, hold discussions, and collaborate. There are hundreds of thousands of wikis in use , both public and private, including wikis functioning as knowledge management resources, note-taking tools, community websites , and intranets . Ward Cunningham ,

11050-433: The history of a language. The discipline that deals specifically with the sound changes occurring within morphemes is morphophonology . Semantics and pragmatics are branches of linguistics concerned with meaning. These subfields have traditionally been divided according to aspects of meaning: "semantics" refers to grammatical and lexical meanings, while "pragmatics" is concerned with meaning in context. Within linguistics,

11180-414: The human mind creates linguistic constructions from event schemas , and the impact of cognitive constraints and biases on human language. In cognitive linguistics, language is approached via the senses . A closely related approach is evolutionary linguistics which includes the study of linguistic units as cultural replicators . It is possible to study how language replicates and adapts to

11310-461: The idea that language is a tool for communication, or that communication is the primary function of language. Linguistic forms are consequently explained by an appeal to their functional value, or usefulness. Other structuralist approaches take the perspective that form follows from the inner mechanisms of the bilateral and multilayered language system. Approaches such as cognitive linguistics and generative grammar study linguistic cognition with

11440-405: The installation and maintenance of the wiki engine and the container web server. Wiki administrators maintain content and, through having elevated privileges , are granted additional functions (including, for example, preventing edits to pages, deleting pages, changing users' access rights, or blocking them from editing). Wikis are generally designed with a soft security philosophy in which it

11570-498: The interaction of meaning and form. The organization of linguistic levels is considered computational. Linguistics is essentially seen as relating to social and cultural studies because different languages are shaped in social interaction by the speech community . Frameworks representing the humanistic view of language include structural linguistics , among others. Structural analysis means dissecting each linguistic level: phonetic, morphological, syntactic, and discourse, to

11700-412: The late 19th century. Despite a shift in focus in the 20th century towards formalism and generative grammar , which studies the universal properties of language, historical research today still remains a significant field of linguistic inquiry. Subfields of the discipline include language change and grammaticalization . Historical linguistics studies language change either diachronically (through

11830-441: The letter L, such as in the example entries for book and cow . Lexicographical entries in Wikidata can contain statements, senses, and forms. The use of lexicographical entries in Wikidata allows for the documentation of word usage, the connection between words and items on Wikidata, word translations, and enables machine-readable lexicographical data. In 2020, lexicographical entries on Wikidata exceeded 250,000. The language with

11960-429: The lexicon of a given language; usually, however, bound morphemes are not included. Lexicography , closely linked with the domain of semantics, is the science of mapping the words into an encyclopedia or a dictionary. The creation and addition of new words (into the lexicon) is called coining or neologization , and the new words are called neologisms . It is often believed that a speaker's capacity for language lies in

12090-458: The link had their systems infected with the worm. Some wiki engines offer a blacklist feature which prevents users from adding hyperlinks to specific sites that have been placed on the list by the wiki's administrators. The English Misplaced Pages has the largest user base among wikis on the World Wide Web and ranks in the top 10 among all Web sites in terms of traffic. Other large wikis include

12220-421: The meaning of a statement by providing additional information. For example, a "population" statement could be modified with a qualifier such as "point in time (P585): 2011" (as its own key-value pair). Values in the statements may also be annotated with references , pointing to a source backing up the statement's content. As with statements, all qualifiers and references are property–value pairs. Each property has

12350-552: The most famous wiki site , launched in January 2001 and entering the top ten most popular websites in 2007. In the early 2000s, wikis were increasingly adopted in enterprise as collaborative software. Common uses included project communication, intranets , and documentation, initially for technical users. Some companies use wikis as their collaborative software and as a replacement for static intranets, and some schools and universities use wikis to enhance group learning . On March 15, 2007,

12480-583: The most lexicographical entries was Russian , with a total of 101,137 lexemes, followed by English with 38,122 lexemes. There are over 668 languages with lexicographical entries on Wikidata. In Wikidata, a schema is a data model that outlines the necessary attributes for a data item. For instance, a data item that uses the attribute " instance of " with the value " human " would typically include attributes such as " place of birth ," " date of birth ," "date of death ," and " place of death ." The entity schema in Wikidata utilizes Shape Expression (ShEx) to describe

12610-426: The nature of crosslinguistic variation, and the relationship between form and meaning. There are numerous approaches to syntax that differ in their central assumptions and goals. Morphology is the study of words , including the principles by which they are formed, and how they relate to one another within a language. Most approaches to morphology investigate the structure of words in terms of morphemes , which are

12740-421: The other hand, focuses on an analysis that is based on the paradigms or concepts that are embedded in a given text. In this case, words of the same type or class may be replaced in the text with each other to achieve the same conceptual understanding. The earliest activities in the description of language have been attributed to the 6th-century-BC Indian grammarian Pāṇini who wrote a formal description of

12870-531: The page being viewed. This will open an interface for writing, formatting, and structuring page content. The interface may be a source editor, which is text-based and employs a lightweight markup language (also known as wikitext , wiki markup , or wikicode ), or a visual editor . For example, in a source editor, starting lines of text with asterisks could create a bulleted list . The syntax and features of wiki markup languages for denoting style and structure can vary greatly among implementations . Some allow

13000-478: The principles of grammar include structural and functional linguistics , and generative linguistics . Sub-fields that focus on a grammatical study of language include the following: Discourse is language as social practice (Baynham, 1995) and is a multilayered concept. As a social practice, discourse embodies different ideologies through written and spoken texts. Discourse analysis can examine or expose these ideologies. Discourse not only influences genre, which

13130-416: The quantity of words stored in the lexicon. However, this is often considered a myth by linguists. The capacity for the use of language is considered by many linguists to lie primarily in the domain of grammar, and to be linked with competence , rather than with the growth of vocabulary. Even a very small lexicon is theoretically capable of producing an infinite number of sentences. Stylistics also involves

13260-424: The relationships between dialects within a specific period. This includes studying morphological, syntactical, and phonetic shifts. Connections between dialects in the past and present are also explored. Syntax is the study of how words and morphemes combine to form larger units such as phrases and sentences . Central concerns of syntax include word order , grammatical relations , constituency , agreement ,

13390-427: The scientific community, by allowing expert peer reviewers to provide links to trusted version of articles which they have analyzed. Trolling and cybervandalism on wikis, where content is changed to something deliberately incorrect or a hoax , offensive material or nonsense is added, or content is maliciously removed, can be a major problem. On larger wiki sites it is possible for such changes to go unnoticed for

13520-401: The scientific study of language, though linguistic science is sometimes used. Linguistics is a multi-disciplinary field of research that combines tools from natural sciences, social sciences, formal sciences , and the humanities. Many linguists, such as David Crystal, conceptualize the field as being primarily scientific. The term linguist applies to someone who studies language or

13650-744: The smallest units in a language with some independent meaning . Morphemes include roots that can exist as words by themselves, but also categories such as affixes that can only appear as part of a larger word. For example, in English the root catch and the suffix -ing are both morphemes; catch may appear as its own word, or it may be combined with -ing to form the new word catching . Morphology also analyzes how words behave as parts of speech , and how they may be inflected to express grammatical categories including number , tense , and aspect . Concepts such as productivity are concerned with how speakers create words in specific contexts, which evolves over

13780-404: The smallest units. These are collected into inventories (e.g. phoneme, morpheme, lexical classes, phrase types) to study their interconnectedness within a hierarchy of structures and layers. Functional analysis adds to structural analysis the assignment of semantic and other functional roles that each unit may have. For example, a noun phrase may function as the subject or object of the sentence; or

13910-488: The structure of a language at a specific point in time) or diachronically (through the historical development of a language over a period of time), in monolinguals or in multilinguals , among children or among adults, in terms of how it is being learnt or how it was acquired, as abstract objects or as cognitive structures, through written texts or through oral elicitation, and finally through mechanical data collection or through practical fieldwork. Linguistics emerged from

14040-696: The structure of sentences), semantics (meaning), morphology (structure of words), phonetics (speech sounds and equivalent gestures in sign languages ), phonology (the abstract sound system of a particular language), and pragmatics (how the context of use contributes to meaning). Subdisciplines such as biolinguistics (the study of the biological variables and evolution of language) and psycholinguistics (the study of psychological factors in human language) bridge many of these divisions. Linguistics encompasses many branches and subfields that span both theoretical and practical applications. Theoretical linguistics (including traditional descriptive linguistics)

14170-445: The structure of the word "tenth" on two different levels of analysis. On the level of internal word structure (known as morphology), the word "tenth" is made up of one linguistic form indicating a number and another form indicating ordinality. The rule governing the combination of these forms ensures that the ordinality marker "th" follows the number "ten." On the level of sound structure (known as phonology), structural analysis shows that

14300-471: The study of language in canonical works of literature, popular fiction, news, advertisements, and other forms of communication in popular culture as well. It is usually seen as a variation in communication that changes from speaker to speaker and community to community. In short, Stylistics is the interpretation of text. In the 1960s, Jacques Derrida , for instance, further distinguished between speech and writing, by proposing that written language be studied as

14430-531: The study of written, signed, or spoken discourse through varying speech communities, genres, and editorial or narrative formats in the mass media. It involves the study and interpretation of texts for aspects of their linguistic and tonal style. Stylistic analysis entails the analysis of description of particular dialects and registers used by speech communities. Stylistic features include rhetoric , diction, stress, satire, irony , dialogue, and other forms of phonetic variations. Stylistic analysis can also include

14560-436: The study was geared towards analysis and comparison between different language variations, which existed at the same given point of time. At another level, the syntagmatic plane of linguistic analysis entails the comparison between the way words are sequenced, within the syntax of a sentence. For example, the article "the" is followed by a noun, because of the syntagmatic relation between the words. The paradigmatic plane, on

14690-586: The subfield of formal semantics studies the denotations of sentences and how they are composed from the meanings of their constituent expressions. Formal semantics draws heavily on philosophy of language and uses formal tools from logic and computer science . On the other hand, cognitive semantics explains linguistic meaning via aspects of general cognition, drawing on ideas from cognitive science such as prototype theory . Pragmatics focuses on phenomena such as speech acts , implicature , and talk in interaction . Unlike semantics, which examines meaning that

14820-472: The system, for example in academic contexts. As most wikis allow the creation of hyperlinks to other sites and services, the addition of malicious hyperlinks, such as sites infected with malware , can also be a problem. For example, in 2006 a German Misplaced Pages article about the Blaster Worm was edited to include a hyperlink to a malicious website, and users of vulnerable Microsoft Windows systems who followed

14950-475: The term philology is now generally used for the "study of a language's grammar, history, and literary tradition", especially in the United States (where philology has never been very popularly considered as the "science of language"). Although the term linguist in the sense of "a student of language" dates from 1641, the term linguistics is first attested in 1847. It is now the usual term in English for

15080-497: The topic in all the various language editions of Misplaced Pages (interwikipedia links). Historically, a Misplaced Pages article would include a list of interlanguage links (links to articles on the same topic in other editions of Misplaced Pages, if they existed). Wikidata was originally a self-contained repository of interlanguage links. Misplaced Pages language editions were still not able to access Wikidata, so they needed to continue to maintain their own lists of interlanguage links. On 14 January 2013,

15210-673: The use of HTML Tooltip Hypertext Markup Language and CSS Tooltip Cascading Style Sheets , while others prevent the use of these to foster uniformity in appearance. A short section of Alice's Adventures in Wonderland rendered in wiki markup: "I've had nothing yet," Alice replied in an offended tone, "so I can't take more." "You mean you can't take less ," said the Hatter. "It's very easy to take more than nothing." While wiki engines have traditionally offered source editing to users, in recent years some implementations have added

15340-531: The uses of Wikidata in research was carried out in 2019. Wiki A wiki ( / ˈ w ɪ k i / WI -kee ) is a form of hypertext publication on the internet which is collaboratively edited and managed by its audience directly through a web browser . A typical wiki contains multiple pages that can either be edited by the public or limited to use within an organization for maintaining its internal knowledge base . Wikis are powered by wiki software , also known as wiki engines. Being

15470-420: The very outset of that [language] history." The above approach of comparativism in linguistics is now, however, only a small part of the much broader discipline called historical linguistics. The comparative study of specific Indo-European languages is considered a highly specialized field today, while comparative research is carried out over the subsequent internal developments in a language: in particular, over

15600-568: The web to edit their content without having to register an account on the site first ( anonymous editing ), or require registration as a condition of participation. On implementations where an administrator is able to restrict editing of a page or group of pages to a specific group of users, they may have the option to prevent anonymous editing while allowing it for registered users. Critics of publicly editable wikis argue that they could be easily tampered with by malicious individuals, or even by well-meaning but unskilled users who introduce errors into

15730-494: The wiki is primarily used to infringe copyrights or obtains a direct financial benefit, such as advertising revenue, from infringing activities. In the United States, wikis may benefit from Section 230 of the Communications Decency Act , which protects sites that engage in " Good Samaritan " policing of harmful material, with no requirement on the quality or quantity of such self-policing. It has also been argued that

15860-414: The word wiki was listed in the online Oxford English Dictionary . In the late 1990s and early 2000s, the word "wiki" was used to refer to both user-editable websites and the software that powers them, and the latter definition is still occasionally in use. By 2014, Ward Cunningham's thinking on the nature of wikis had evolved, leading him to write that the word "wiki" should not be used to refer to

15990-551: The word in its original meaning as " téchnē grammatikḗ " ( Τέχνη Γραμματική ), the "art of writing", which is also the title of one of the most important works of the Alexandrine school by Dionysius Thrax . Throughout the Middle Ages , the study of language was subsumed under the topic of philology, the study of ancient languages and texts, practised by such educators as Roger Ascham , Wolfgang Ratke , and John Amos Comenius . In

16120-582: The world of ideas. This work is the first to use the word etymology to describe the history of a word's meaning. Around 280 BC, one of Alexander the Great 's successors founded a university (see Musaeum ) in Alexandria , where a school of philologists studied the ancient texts in Greek, and taught Greek to speakers of other languages. While this school was the first to use the word "grammar" in its modern sense, Plato had used

16250-629: Was affected more by their impact on other wiki users, while adders' contribution frequency was affected more by being able to accomplish their immediate work. From a study of thousands of wiki deployments, Jonathan Grudin concluded careful stakeholder analysis and education are crucial to successful wiki deployment. In 2005, the Gartner Group, noting the increasing popularity of wikis, estimated that they would become mainstream collaboration tools in at least 50% of companies by 2009. Wikis can be used for project management . Wikis have also been used in

16380-431: Was deployed on 6 March. The ability for the various language editions of Misplaced Pages to access data from Wikidata was rolled out progressively between 27 March and 25 April 2013. On 16 September 2015, Wikidata began allowing so-called arbitrary access , or access from a given article of a Misplaced Pages to the statements on Wikidata items not directly connected to it. For example, it became possible to read data about Germany from

16510-467: Was originally split into three phases: Wikidata was launched on 29 October 2012 and was the first new project of the Wikimedia Foundation since 2006. At this time, only the centralization of language links was available. This enabled items to be created and filled with basic information: a label – a name or title, aliases – alternative terms for the label, a description, and links to articles about

16640-478: Was single-user, and Cunningham was inspired to build upon the ideas of Vannevar Bush , the inventor of hypertext, by allowing users to "comment on and change one another's text." Cunningham says his goals were to link together people's experiences to create a new literature to document programming patterns , and to harness people's natural desire to talk and tell stories with a technology that would feel comfortable to those not used to "authoring". Misplaced Pages became

16770-507: Was the first known instance of its kind. In the Middle East, Sibawayh , a Persian, made a detailed description of Arabic in AD 760 in his monumental work, Al-kitab fii an-naħw ( الكتاب في النحو , The Book on Grammar ), the first known author to distinguish between sounds and phonemes (sounds as units of a linguistic system) . Western interest in the study of languages began somewhat later than in

16900-492: Was used in 58.4% of all English Misplaced Pages articles, mostly for external identifiers or coordinate locations. In aggregate, data from Wikidata is shown in 64% of all Wikipedias ' pages, 93% of all Wikivoyage articles, 34% of all Wikiquotes ', 32% of all Wikisources ', and 27% of Wikimedia Commons . As of December 2020, Wikidata's data was visualized by at least 20 other external tools and over 300 papers have been published about Wikidata. A systematic literature review of

#56943