121-670: Baidu Baike ( / ˈ b aɪ d uː ˈ b aɪ k ə / ; Chinese : 百度 百科 ; pinyin : Bǎidù Bǎikē ; lit. 'Baidu Encyclopedia', also known as Baidu Wiki ) is a semi-regulated Chinese -language collaborative online encyclopedia owned by the Chinese technology company Baidu . The beta version was launched on 20 April 2006, and the official version was launched on 21 April 2008. In November 2019, it had more than 16 million articles and 6.9 million editors. As of February 2022, it has more than 25.54 million entries and 7.5 million editors. It has
242-464: A nucleus that has a vowel (which can be a monophthong , diphthong , or even a triphthong in certain varieties), preceded by an onset (a single consonant , or consonant + glide ; a zero onset is also possible), and followed (optionally) by a coda consonant; a syllable also carries a tone . There are some instances where a vowel is not used as a nucleus. An example of this is in Cantonese, where
363-457: A subject–verb–object word order , and like many other languages of East Asia, makes frequent use of the topic–comment construction to form sentences. Chinese also has an extensive system of classifiers and measure words , another trait shared with neighboring languages such as Japanese and Korean. Other notable grammatical features common to all the spoken varieties of Chinese include the use of serial verb construction , pronoun dropping , and
484-420: A "user contributions" option on a sidebar. In a 2004 article, Carl Challborn and Teresa Reimann noted that "While this feature may be a slight deviation from the collaborative, 'ego-less' spirit of wiki purists, it can be very useful for educators who need to assess the contribution and participation of individual student users." MediaWiki provides many features beyond hyperlinks for structuring content. One of
605-455: A Chinese character is the morpheme, as characters represent the smallest grammatical units with individual meanings in the Chinese language. Estimates of the total number of Chinese words and lexicalized phrases vary greatly. The Hanyu Da Zidian , a compendium of Chinese characters, includes 54,678 head entries for characters, including oracle bone versions. The Zhonghua Zihai (1994) contains 85,568 head entries for character definitions and
726-457: A MediaWiki-based wiki found that when they were asked an open question about main problems with the wiki, 24% cited technical problems with formatting, e.g. "Couldn't figure out how to get an image in. Can't figure out how to show a link with words; it inserts a number." To make editing long pages easier, MediaWiki allows the editing of a subsection of a page (as identified by its header). A registered user can also indicate whether or not an edit
847-595: A Shanghai resident may speak both Standard Chinese and Shanghainese ; if they grew up elsewhere, they are also likely fluent in the dialect of their home region. In addition to Standard Chinese, a majority of Taiwanese people also speak Taiwanese Hokkien (also called 台語 ; 'Taiwanese' ), Hakka , or an Austronesian language . A speaker in Taiwan may mix pronunciations and vocabulary from Standard Chinese and other languages of Taiwan in everyday speech. In part due to traditional cultural ties with Guangdong , Cantonese
968-462: A central variety (i.e. prestige variety, such as Standard Mandarin), as the issue requires some careful handling when mutual intelligibility is inconsistent with language identity. The Chinese government's official Chinese designation for the major branches of Chinese is 方言 ; fāngyán ; 'regional speech', whereas the more closely related varieties within these are called 地点方言 ; 地點方言 ; dìdiǎn fāngyán ; 'local speech'. Because of
1089-597: A compromise between the pronunciations of different regions. The royal courts of the Ming and early Qing dynasties operated using a koiné language known as Guanhua , based on the Nanjing dialect of Mandarin. Standard Chinese is an official language of both the People's Republic of China and the Republic of China (Taiwan), one of the four official languages of Singapore , and one of
1210-450: A continuous feed of Recent Changes to an IRC channel that these tools can monitor, eliminating their need to send requests for a refreshed Recent Changes feed to the API. Another important tool is watchlisting. Each logged-in user has a watchlist to which the user can add whatever pages he or she wishes. When an edit is made to one of those pages, a summary of that edit appears on the watchlist
1331-700: A corresponding increase in the number of homophones . As an example, the small Langenscheidt Pocket Chinese Dictionary lists six words that are commonly pronounced as shí in Standard Chinese: In modern spoken Mandarin, however, tremendous ambiguity would result if all of these words could be used as-is. The 20th century Yuen Ren Chao poem Lion-Eating Poet in the Stone Den exploits this, consisting of 92 characters all pronounced shi . As such, most of these words have been replaced in speech, if not in writing, with less ambiguous disyllabic compounds. Only
SECTION 10
#17328515815211452-530: A file called LocalSettings.php . Some aspects of MediaWiki can be configured through special pages or by editing certain pages; for instance, abuse filters can be configured through a special page, and certain gadgets can be added by creating JavaScript pages in the MediaWiki namespace. The MediaWiki community publishes a comprehensive installation guide. One of the earliest differences between MediaWiki (and its predecessor, UseModWiki ) and other wiki engines
1573-452: A large part of the set requirements for the software. Besides its usage on Wikimedia sites, MediaWiki has been used as a knowledge management and content management system on websites such as Fandom , wikiHow and major internal installations like Intellipedia and Diplopedia . MediaWiki is written in the PHP programming language and stores all text content into a database . The software
1694-505: A lesser degree, the Wikimedia Foundation's other projects. Fandom , a wiki hosting service formerly known as Wikia, runs on MediaWiki. Other public wikis that run on MediaWiki include wikiHow and SNPedia . WikiLeaks began as a MediaWiki-based site, but is no longer a wiki. A number of alternative wiki encyclopedias to Misplaced Pages run on MediaWiki, including Citizendium , Metapedia , Scholarpedia and Conservapedia . MediaWiki
1815-564: A millennium. The Four Commanderies of Han were established in northern Korea in the 1st century BCE but disintegrated in the following centuries. Chinese Buddhism spread over East Asia between the 2nd and 5th centuries CE, and with it the study of scriptures and literature in Literary Chinese. Later, strong central governments modeled on Chinese institutions were established in Korea, Japan, and Vietnam, with Literary Chinese serving as
1936-580: A new logo was initiated on June 22, 2020, as the old logo was a bitmap image and had "high details", leading to problems when rendering at high and low resolutions, respectively. After two rounds of voting, the new and current MediaWiki logo designed by Serhio Magpie was selected on October 24, 2020, and officially adopted on April 1, 2021. The first version of MediaWiki, 1.1, was released in December 2003. MediaWiki's most famous use has been in Misplaced Pages and, to
2057-538: A secure reconstruction of Proto-Sino-Tibetan, the higher-level structure of the family remains unclear. A top-level branching into Chinese and Tibeto-Burman languages is often assumed, but has not been convincingly demonstrated. The first written records appeared over 3,000 years ago during the Shang dynasty . As the language evolved over this period, the various local varieties became mutually unintelligible. In reaction, central governments have repeatedly sought to promulgate
2178-503: A similar way to the use of Latin and Ancient Greek roots in European languages. Many new compounds, or new meanings for old phrases, were created in the late 19th and early 20th centuries to name Western concepts and artifacts. These coinages, written in shared Chinese characters, have then been borrowed freely between languages. They have even been accepted into Chinese, a language usually resistant to loanwords, because their foreign origin
2299-678: A single language. However, their lack of mutual intelligibility means they are sometimes considered to be separate languages in a family . Investigation of the historical relationships among the varieties of Chinese is ongoing. Currently, most classifications posit 7 to 13 main regional groups based on phonetic developments from Middle Chinese , of which the most spoken by far is Mandarin with 66%, or around 800 million speakers, followed by Min (75 million, e.g. Southern Min ), Wu (74 million, e.g. Shanghainese ), and Yue (68 million, e.g. Cantonese ). These branches are unintelligible to each other, and many of their subgroups are unintelligible with
2420-610: A unified standard. The earliest examples of Old Chinese are divinatory inscriptions on oracle bones dated to c. 1250 BCE , during the Late Shang . The next attested stage came from inscriptions on bronze artifacts dating to the Western Zhou period (1046–771 BCE), the Classic of Poetry and portions of the Book of Documents and I Ching . Scholars have attempted to reconstruct
2541-563: A variety of Yue from a small coastal area around Taishan, Guangdong . In parts of South China, the dialect of a major city may be only marginally intelligible to its neighbors. For example, Wuzhou and Taishan are located approximately 260 km (160 mi) and 190 km (120 mi) away from Guangzhou respectively, but the Yue variety spoken in Wuzhou is more similar to the Guangzhou dialect than
SECTION 20
#17328515815212662-473: A wide variety of uploaded media files. Its richest functionality is in the area of images, where image galleries and thumbnails can be generated with relative ease. There is also support for Exif metadata . The use of MediaWiki to operate the Wikimedia Commons , one of the largest free content media archives, has driven the need for further functionality in this area. For WYSIWYG editing, VisualEditor
2783-591: Is free and open-source wiki software originally developed by Magnus Manske for use on Misplaced Pages on January 25, 2002, and further improved by Lee Daniel Crocker , after which development has been coordinated by the Wikimedia Foundation . It powers several wiki hosting websites across the Internet, as well as most websites hosted by the Wikimedia Foundation including Misplaced Pages, Wiktionary , Wikimedia Commons , Wikiquote , Meta-Wiki and Wikidata , which define
2904-596: Is Taishanese. Wuzhou is located directly upstream from Guangzhou on the Pearl River , whereas Taishan is to Guangzhou's southwest, with the two cities separated by several river valleys. In parts of Fujian , the speech of some neighbouring counties or villages is mutually unintelligible. Local varieties of Chinese are conventionally classified into seven dialect groups, largely based on the different evolution of Middle Chinese voiced initials: Proportions of first-language speakers The classification of Li Rong , which
3025-502: Is also used for feature and enhancement requests. When Misplaced Pages was launched in January 2001, it ran on an existing wiki software system, UseModWiki . UseModWiki is written in the Perl programming language, and stores all wiki pages in text ( .txt ) files. This software soon proved to be limiting, in both functionality and performance. In mid-2001, Magnus Manske —a developer and student at
3146-765: Is also used internally by a large number of companies, including Novell and Intel . Notable usages of MediaWiki within governments include Intellipedia , used by the United States Intelligence Community , Diplopedia , used by the United States Department of State , and milWiki, a part of milSuite used by the United States Department of Defense . United Nations agencies such as the United Nations Development Programme and INSTRAW chose to implement their wikis using MediaWiki, because "this software runs Misplaced Pages and
3267-476: Is available in more than 400 languages. The software has more than 1,000 configuration settings and more than 1,800 extensions available for enabling various features to be added or changed. MediaWiki is free and open-source and is distributed under the terms of the GNU General Public License version 2 or any later version. Its documentation, located at its official website at www.mediawiki.org,
3388-400: Is available to use in MediaWiki which simplifying editing process for editors and has been bundled since MediaWiki 1.35. Other extensions exist for handling WYSIWYG editing to different degrees. Among the features of MediaWiki to assist in tracking edits is a Recent Changes feature that provides a list of recent edits to the wiki. This list contains basic information about those edits such as
3509-485: Is called either 华语 ; 華語 ; Huáyǔ or 汉语 ; 漢語 ; Hànyǔ ). Standard Chinese is based on the Beijing dialect of Mandarin. The governments of both China and Taiwan intend for speakers of all Chinese speech varieties to use it as a common language of communication. Therefore, it is used in government agencies, in the media, and as a language of instruction in schools. Diglossia is common among Chinese speakers. For example,
3630-444: Is copyright infringement. Some even copy all Misplaced Pages entries, which violates the copyright law of mainland China. Due to copyright issues, the content of Baidu Baike does not meet various accepted definitions of openness or freedom . In addition to copyright concerns, criticism of Baidu Baike mainly focuses on its academic merits and lack of neutrality. The former is manifested in the lack of detailed and clear source references for
3751-422: Is largely incompatible with MediaWiki. Cloud hosting can eliminate the need to deploy a new server. An installation PHP script is accessed via a web browser to initialize the wiki's settings. It prompts the user for a minimal set of required parameters, leaving further changes, such as enabling uploads, adding a site logo, and installing extensions, to be made by modifying configuration settings contained in
Baidu Baike - Misplaced Pages Continue
3872-405: Is minor. Correcting spelling, grammar or punctuation are examples of minor edits, whereas adding paragraphs of new text is an example of a non-minor edit. Sometimes while one user is editing, a second user saves an edit to the same part of the page. Then, when the first user attempts to save the page, an edit conflict occurs. The second user is then given an opportunity to merge their content into
3993-595: Is often described as a 'monosyllabic' language. However, this is only partially correct. It is largely accurate when describing Old and Middle Chinese; in Classical Chinese, around 90% of words consist of a single character that corresponds one-to-one with a morpheme , the smallest unit of meaning in a language. In modern varieties, it usually remains the case that morphemes are monosyllabic—in contrast, English has many multi-syllable morphemes, both bound and free , such as 'seven', 'elephant', 'para-' and '-able'. Some of
4114-445: Is only about an eighth as many as English. All varieties of spoken Chinese use tones to distinguish words. A few dialects of north China may have as few as three tones, while some dialects in south China have up to 6 or 12 tones, depending on how one counts. One exception from this is Shanghainese which has reduced the set of tones to a two-toned pitch accent system much like modern Japanese. A very common example used to illustrate
4235-416: Is optimized to efficiently handle large projects, which can have terabytes of content and hundreds of thousands of views per second. Because Misplaced Pages is one of the world's largest and most visited websites, achieving scalability through multiple layers of caching and database replication has been a major concern for developers. Another major aspect of MediaWiki is its internationalization; its interface
4356-560: Is outlined on the Help page in the terms of use section. In it, Baidu states that by adding content to the site, users agree to waive the copyright of their contributions to Baidu. It also states that the content must not violate intellectual property law and that content using the Creative Commons and/or GNU Free Documentation License (GFDL) must respect the limitations of such licenses. Despite this, Baidu has received criticism for violating
4477-535: Is released under the Creative Commons BY-SA 4.0 license and partly in the public domain . Specifically, the manuals and other content at MediaWiki.org are Creative Commons -licensed, while the set of help pages intended to be freely copied into fresh wiki installations and/or distributed with MediaWiki software is public domain. This was done to eliminate legal issues arising from the help pages being imported into wikis with licenses that are incompatible with
4598-668: Is specifically meant. However, when one of the above words forms part of a compound, the disambiguating syllable is generally dropped and the resulting word is still disyllabic. For example, 石 ; shí alone, and not 石头 ; 石頭 ; shítou , appears in compounds as meaning 'stone' such as 石膏 ; shígāo ; 'plaster', 石灰 ; shíhuī ; 'lime', 石窟 ; shíkū ; 'grotto', 石英 ; 'quartz', and 石油 ; shíyóu ; 'petroleum'. Although many single-syllable morphemes ( 字 ; zì ) can stand alone as individual words, they more often than not form multi-syllable compounds known as 词 ; 詞 ; cí , which more closely resembles
4719-484: Is the largest reference work based purely on character and its literary variants. The CC-CEDICT project (2010) contains 97,404 contemporary entries including idioms, technology terms, and names of political figures, businesses, and products. The 2009 version of the Webster's Digital Chinese Dictionary (WDCD), based on CC-CEDICT, contains over 84,000 entries. The most comprehensive pure linguistic Chinese-language dictionary,
4840-489: Is therefore guaranteed to be thoroughly tested, will continue to be developed well into the future, and future technicians on these wikis will be more likely to have exposure to MediaWiki than any other wiki software." The Free Software Foundation uses MediaWiki to implement the LibrePlanet site. MediaWiki provides a rich core feature set and a mechanism to attach extensions to provide additional functionality. Due to
4961-496: Is used as an everyday language in Hong Kong and Macau . The designation of various Chinese branches remains controversial. Some linguists and most ordinary Chinese people consider all the spoken varieties as one single language, as speakers share a common national identity and a common written form. Others instead argue that it is inappropriate to refer to major branches of Chinese such as Mandarin, Wu, and so on as "dialects" because
Baidu Baike - Misplaced Pages Continue
5082-497: Is used in education, media, formal speech, and everyday life—though Mandarin is increasingly taught in schools due to the mainland's growing influence. Historically, the Chinese language has spread to its neighbors through a variety of means. Northern Vietnam was incorporated into the Han dynasty (202 BCE – 220 CE) in 111 BCE, marking the beginning of a period of Chinese control that ran almost continuously for
5203-540: Is used in the Language Atlas of China (1987), distinguishes three further groups: Some varieties remain unclassified, including the Danzhou dialect on Hainan , Waxianghua spoken in western Hunan , and Shaozhou Tuhua spoken in northern Guangdong . Standard Chinese is the standard language of China (where it is called 普通话 ; pǔtōnghuà ) and Taiwan, and one of the four official languages of Singapore (where it
5324-428: Is very complex, with a large number of consonants and vowels, but they are probably not all distinguished in any single dialect. Most linguists now believe it represents a diasystem encompassing 6th-century northern and southern standards for reading the classics. The complex relationship between spoken and written Chinese is an example of diglossia : as spoken, Chinese varieties have evolved at different rates, while
5445-735: The Qieyun rime dictionary (601 CE), and a late period in the 10th century, reflected by rhyme tables such as the Yunjing constructed by ancient Chinese philologists as a guide to the Qieyun system. These works define phonological categories but with little hint of what sounds they represent. Linguists have identified these sounds by comparing the categories with pronunciations in modern varieties of Chinese , borrowed Chinese words in Japanese, Vietnamese, and Korean, and transcription evidence. The resulting system
5566-525: The Korean , Japanese and Vietnamese languages, and today comprise over half of their vocabularies. This massive influx led to changes in the phonological structure of the languages, contributing to the development of moraic structure in Japanese and the disruption of vowel harmony in Korean. Borrowed Chinese morphemes have been used extensively in all these languages to coin compound words for new concepts, in
5687-680: The May Fourth Movement beginning in 1919. After the fall of the Northern Song dynasty and subsequent reign of the Jurchen Jin and Mongol Yuan dynasties in northern China, a common speech (now called Old Mandarin ) developed based on the dialects of the North China Plain around the capital. The 1324 Zhongyuan Yinyun was a dictionary that codified the rhyming conventions of new sanqu verse form in this language. Together with
5808-579: The National Language Unification Commission finally settled on the Beijing dialect in 1932. The People's Republic founded in 1949 retained this standard but renamed it 普通话 ; 普通話 ; pǔtōnghuà ; 'common speech'. The national language is now used in education, the media, and formal situations in both mainland China and Taiwan. In Hong Kong and Macau , Cantonese is the dominant spoken language due to cultural influence from Guangdong immigrants and colonial-era policies, and
5929-500: The University of Cologne , as well as a Misplaced Pages editor —began working on new software that would replace UseModWiki, specifically designed for use by Misplaced Pages. This software was written in the PHP scripting language, and stored all of its information in a MySQL database. The new software was largely developed by August 24, 2001, and a test wiki for it was established shortly thereafter. The first full implementation of this software
6050-476: The Wikimedia Foundation , took up the role of release manager . Major milestones in MediaWiki's development have included: the categorization system (2004); parser functions, (2006); Flagged Revisions , (2008); the " ResourceLoader ", a delivery system for CSS and JavaScript (2011); and the VisualEditor , a "what you see is what you get" ( WYSIWYG ) editing platform (2013). The contest of designing
6171-496: The Wikimedia Foundation . MediaWiki developers participate in the Google Summer of Code by facilitating the assignment of mentors to students wishing to work on MediaWiki core and extension projects. During the year prior to November 2012, there were about two hundred developers who had committed changes to the MediaWiki core or extensions. Major MediaWiki releases are generated approximately every six months by taking snapshots of
SECTION 50
#17328515815216292-473: The oracle bone inscriptions created during the Shang dynasty c. 1250 BCE . The phonetic categories of Old Chinese can be reconstructed from the rhymes of ancient poetry. During the Northern and Southern period , Middle Chinese went through several sound changes and split into several varieties following prolonged geographic and political separation. The Qieyun , a rime dictionary , recorded
6413-591: The phonology of Old Chinese by comparing later varieties of Chinese with the rhyming practice of the Classic of Poetry and the phonetic elements found in the majority of Chinese characters. Although many of the finer details remain unclear, most scholars agree that Old Chinese differs from Middle Chinese in lacking retroflex and palatal obstruents but having initial consonant clusters of some sort, and in having voiceless nasals and liquids. Most recent reconstructions also describe an atonal language with consonant clusters at
6534-607: The "Baike Expert Group" with a level 4 encyclopedia and a pass rate of over 85% and professional users with a level 6 encyclopedia and a pass rate of over 85%. Open, ordinary users no longer have the right to view historical versions of entries and use the historical version comparison function. Baidu Encyclopedia officially claims that this is to "ensure that the majority of netizens obtain the accuracy of entry information and avoid interference from outdated information in various historical versions". On 24 April 2024, Baidu posted an announcement on Baidu Baike that it would end support for
6655-633: The 12-volume Hanyu Da Cidian , records more than 23,000 head Chinese characters and gives over 370,000 definitions. The 1999 revised Cihai , a multi-volume encyclopedic dictionary reference work, gives 122,836 vocabulary entry definitions under 19,485 Chinese characters, including proper names, phrases, and common zoological, geographical, sociological, scientific, and technical terms. The 2016 edition of Xiandai Hanyu Cidian , an authoritative one-volume dictionary on modern standard Chinese language as used in mainland China, has 13,000 head characters and defines 70,000 words. MediaWiki MediaWiki
6776-524: The API. MediaWiki supports rich content generated through specialized syntax. For example, the software comes with optional support for rendering mathematical formulas using LaTeX and a special parser written in OCaml named texvc. Similar functionality for other content, ranging from graphical timelines over mathematical plotting and musical scores to Egyptian hieroglyphs , is available via extensions. The software has become more powerful at dealing with
6897-514: The API. The API is accessed via URLs such as https://en.wikipedia.org/w/api.php?action=query&list=recentchanges . In this case, the query would be asking Misplaced Pages for information relating to the last 10 edits to the site. One of the perceived advantages of the API is its language independence; it listens for HTTP connections from clients and can send a response in a variety of formats, such as XML , serialized PHP, or JSON . Client code has been developed to provide layers of abstraction to
7018-517: The Creative Commons license. MediaWiki's development has generally favored the use of open-source media formats . MediaWiki has an active volunteer community for development and maintenance. Users who have made meaningful contributions to the project by submitting patches are generally, upon request, granted access to commit revisions to the project's Git / Gerrit repository . There are also paid programmers who primarily develop projects for
7139-824: The GFDL license when using content directly from Misplaced Pages, infringing copyright on Hudong.com and tending to plagiarize other sites. There are a few organized groups within the Baidu Baike community. The Baike Elite Team consists of about 340 core contributors that are directed by Baidu and serve as community liaisons. There is also a group of campus ambassadors made of students and an expert team with over 2,500 members, including university professors. Baidu Baike engages in partnerships with cultural institutions in China and abroad to digitize cultural heritage . In late 2017, Baidu signed an agreement in China to create "2,000 online digital museums" in
7260-555: The Latin-based Vietnamese alphabet . English words of Chinese origin include tea from Hokkien 茶 ( tê ), dim sum from Cantonese 點心 ( dim2 sam1 ), and kumquat from Cantonese 金橘 ( gam1 gwat1 ). The sinologist Jerry Norman has estimated that there are hundreds of mutually unintelligible varieties of Chinese. These varieties form a dialect continuum , in which differences in speech generally become more pronounced as distances increase, though
7381-517: The MediaWiki Language Extension Bundle, are designed to further enhance the multilingualism and internationalization of MediaWiki. Installation of MediaWiki requires that the user have administrative privileges on a server running both PHP and a compatible type of SQL database . Some users find that setting up a virtual host is helpful if the majority of one's site runs under a framework (such as Zope or Ruby on Rails ) that
SECTION 60
#17328515815217502-421: The data contained in the MediaWiki databases. Client programs can use the API to log in, get data, and post changes. The API supports thin web-based JavaScript clients and end-user applications (such as vandal-fighting tools). The API can be accessed by the backend of another web site. An extensive Python bot library, Pywikibot , and a popular semi-automated tool called AutoWikiBrowser , also interface with
7623-464: The dedicated Baidu Baike application on 30 June to focus on "better user experiences". Articles are written and edited by registered users and reviewed by administrators prior to publication. Contributions from registered users are evaluated under a scoring system. Although the test version was called the Baidu Wiki, official press releases and pages from Baidu Baike itself stated that the system for this
7744-441: The development branch, which is kept continuously in a runnable state; minor releases , or point releases , are issued as needed to correct bugs (especially security problems). MediaWiki is developed on a continuous integration development model, in which software changes are pushed live to Wikimedia sites on regular basis. MediaWiki also has a public bug tracker, phabricator.wikimedia.org , which runs Phabricator . The site
7865-676: The differences between wiki markup and HTML: "Take some more tea ," the March Hare said to Alice, very earnestly. "I've had nothing yet," Alice replied in an offended tone: "so I can't take more." "You mean you can't take less ," said the Hatter: "it's very easy to take more than nothing." (Quotation above from Alice's Adventures in Wonderland by Lewis Carroll ) MediaWiki's default page-editing tools have been described as somewhat challenging to learn. A survey of students assigned to use
7986-399: The different spoken dialects varies, but in general, there has been a tendency to a reduction in sounds from Middle Chinese. The Mandarin dialects in particular have experienced a dramatic decrease in sounds and so have far more polysyllabic words than most other spoken varieties. The total number of syllables in some varieties is therefore only about a thousand, including tonal variation, which
8107-484: The difficulties involved in determining the difference between language and dialect, other terms have been proposed. These include topolect , lect , vernacular , regional , and variety . Syllables in the Chinese languages have some unique characteristics. They are tightly related to the morphology and also to the characters of the writing system, and phonologically they are structured according to fixed rules. The structure of each syllable consists of
8228-441: The earliest such features is namespaces . One of Misplaced Pages's earliest problems had been the separation of encyclopedic content from pages pertaining to maintenance and communal discussion, as well as personal pages about encyclopedia editors. Namespaces are prefixes before a page title (such as " User: " or " Talk: ") that serve as descriptors for the page's purpose and allow multiple pages with different functions to exist under
8349-451: The editing user, the edit summary, the page edited, as well as any tags (e.g. "possible vandalism ") added by customizable abuse filters and other extensions to aid in combating unhelpful edits. On more active wikis, so many edits occur that it is hard to track Recent Changes manually. Anti-vandal software, including user-assisted tools, is sometimes employed on such wikis to process Recent Changes items. Server load can be reduced by sending
8470-743: The encyclopedia note that it censors its content in accordance with the requirements of the Chinese government . Being in the jurisdiction of the Chinese government, Baidu is required to censor content on their encyclopedia in accordance to relevant laws and regulations such as the Cybersecurity Law of the People's Republic of China and the National Intelligence Law . All editors must register accounts using their real names before editing, and administrators review all edits before they become available to
8591-566: The end of the syllable, developing into tone distinctions in Middle Chinese. Several derivational affixes have also been identified, but the language lacks inflection , and indicated grammatical relationships using word order and grammatical particles . Middle Chinese was the language used during Northern and Southern dynasties and the Sui , Tang , and Song dynasties (6th–10th centuries CE). It can be divided into an early period, reflected by
8712-413: The entries it contains, leading to the accuracy of its content being questioned. Baidu Baike does not limit entries based on notability. In 2011, an online poll involving 561 Baidu Baike users showed that the main criticisms of users on Baidu Baike included "insufficient review and proliferation of wrong content", "too much plagiarism and copy-pasting and too little original work", "information contained on
8833-414: The entries" and "lack of authority and credibility". Baidu 10 Mythical Creatures was a humorous hoax and internet meme originating from Baidu Baike. Chinese language Chinese ( simplified Chinese : 汉语 ; traditional Chinese : 漢語 ; pinyin : Hànyǔ ; lit. ' Han language' or 中文 ; Zhōngwén ; 'Chinese writing') is a group of languages spoken natively by
8954-486: The entry corresponding to the search term, its link will usually be ranked at the top of the search page. Baidu Baike has been criticised for its censorship , copyright violations , commercialist practices and abundance of unsourced information . Only registered users can edit the articles. In April 2024, Baidu announced that the dedicated Baidu Baike app, but not Baidu Baike itself, was to be shut down in June. Baidu Baike
9075-594: The ethnic Han Chinese majority and many minority ethnic groups in China , as well as by various communities of the Chinese diaspora . Approximately 1.35 billion people, or 17% of the global population, speak a variety of Chinese as their first language . Chinese languages form the Sinitic branch of the Sino-Tibetan language family. The spoken varieties of Chinese are usually considered by native speakers to be dialects of
9196-413: The first one, 十 , normally appears in monosyllabic form in spoken Mandarin; the rest are normally used in the polysyllabic forms of respectively. In each, the homophone was disambiguated by the addition of another morpheme, typically either a near-synonym or some sort of generic word (e.g. 'head', 'thing'), the purpose of which is to indicate which of the possible meanings of the other, homophonic syllable
9317-485: The first results. The Chinese government has cut off access to the Chinese Misplaced Pages for residents of mainland China since 2019. In March 2021, Chinese netizens claimed that South Korean netizens changed their entries related to Chinese history on a large scale through the historical version comparison function of Baidu Baike. Baidu Baike stipulates that the historical version function is only available to users of
9438-491: The form of a word), to indicate a word's function within a sentence. In other words, Chinese has very few grammatical inflections —it possesses no tenses , no voices , no grammatical number , and only a few articles . They make heavy use of grammatical particles to indicate aspect and mood . In Mandarin, this involves the use of particles such as 了 ; le ; ' PFV ', 还 ; 還 ; hái ; 'still', and 已经 ; 已經 ; yǐjīng ; 'already'. Chinese has
9559-550: The government of the People's Republic of China, with Singapore officially adopting them in 1976. Traditional characters are used in Taiwan, Hong Kong, Macau, and among Chinese-speaking communities overseas . Linguists classify all varieties of Chinese as part of the Sino-Tibetan language family , together with Burmese , Tibetan and many other languages spoken in the Himalayas and the Southeast Asian Massif . Although
9680-581: The language of administration and scholarship, a position it would retain until the late 19th century in Korea and (to a lesser extent) Japan, and the early 20th century in Vietnam. Scholars from different lands could communicate, albeit only in writing, using Literary Chinese. Although they used Chinese solely for written communication, each country had its own tradition of reading texts aloud using what are known as Sino-Xenic pronunciations . Chinese words with these pronunciations were also extensively imported into
9801-531: The largest number of entries in the world of any Chinese-language online encyclopedia. Baidu officially stated that the Baidu Encyclopedia also serves as the information storage space provided by Baidu for netizens. Baidu Baike advocates "equality, collaboration, sharing, and freedom" in spirit. It combines Baidu Baike with its search engines. When searching using the Baidu search engine, if Baidu Baike has included
9922-516: The late 19th century. Today Japanese is written with a composite script using both Chinese characters called kanji , and kana. Korean is written exclusively with hangul in North Korea, although knowledge of the supplementary Chinese characters called hanja is still required, and hanja are increasingly rarely used in South Korea. As a result of its historical colonization by France, Vietnamese now uses
10043-459: The license at all, [...] That might be the biggest copyright violation we have. We have others." Users of the Chinese Misplaced Pages created a list of entries allegedly infringing Misplaced Pages's copyright. The Wikimedia Foundation decided not to pursue any legal action. In response to criticism, Baidu stressed that Baike is a platform for user-generated content . Baidu Baike users often fail to list
10164-455: The more conservative modern varieties, usually found in the south, have largely monosyllabic words , especially with basic vocabulary. However, most nouns, adjectives, and verbs in modern Mandarin are disyllabic. A significant cause of this is phonetic erosion : sound changes over time have steadily reduced the number of possible syllables in the language's inventory. In modern Mandarin, there are only around 1,200 possible syllables, including
10285-425: The mutual unintelligibility between them is too great. However, calling major Chinese branches "languages" would also be wrong under the same criterion, since a branch such as Wu, itself contains many mutually unintelligible varieties, and could not be properly called a single language. There are also viewpoints pointing out that linguists often ignore mutual intelligibility when varieties share intelligibility with
10406-583: The nasal sonorant consonants /m/ and /ŋ/ can stand alone as their own syllable. In Mandarin much more than in other spoken varieties, most syllables tend to be open syllables, meaning they have no coda (assuming that a final glide is not analyzed as a coda), but syllables that do have codas are restricted to nasals /m/ , /n/ , /ŋ/ , the retroflex approximant /ɻ/ , and voiceless stops /p/ , /t/ , /k/ , or /ʔ/ . Some varieties allow most of these codas, whereas others, such as Standard Chinese, are limited to only /n/ , /ŋ/ , and /ɻ/ . The number of sounds in
10527-668: The next three years. In early 2018, partnerships were expanded to cover 1,000 Spanish cities and tourist sites, including the Camino de Santiago , the Sagrada Família and the Prado Museum . Baidu Baike has been accused by some critics of censorship and by the former chair of the Wikimedia Foundation of copyright violation. Additionally, Baidu Baike has also been accused of commercialization . Due to unsourced information in Baidu Baike, there are also concerns over its correctness. Critics of
10648-478: The next time it is refreshed. As with the recent changes page, recent edits that appear on the watchlist contain clickable links for easy review of the article history and specific changes made. There is also the capability to review all edits made by any particular user. In this way, if an edit is identified as problematic, it is possible to check the user's other edits for issues. MediaWiki allows one to link to specific versions of articles. This has been useful to
10769-470: The number in Chinese Misplaced Pages . During the conference WWW 2008 of the World Wide Web Consortium , Baidu 's William Chang said, "There is, in fact, no reason for China to use Misplaced Pages ... It's very natural for China to make its own products." When searching with the search engine Baidu, the link to the corresponding entry in Baidu Baike, if it exists, will be put as the first result or one of
10890-538: The other varieties within the same branch (e.g. Southern Min). There are, however, transitional areas where varieties from different branches share enough features for some limited intelligibility, including New Xiang with Southwestern Mandarin , Xuanzhou Wu Chinese with Lower Yangtze Mandarin , Jin with Central Plains Mandarin and certain divergent dialects of Hakka with Gan . All varieties of Chinese are tonal at least to some degree, and are largely analytic . The earliest attested written Chinese consists of
11011-553: The page as it now exists following the first user's page save. MediaWiki's user interface has been localized in many different languages. A language for the wiki content itself can also be set, to be sent in the "Content-Language" HTTP header and "lang" HTML attribute . VisualEditor has its own integrated wikitext editing interface known as 2017 wikitext editor, the older editing interface is known as 2010 wikitext editor. MediaWiki has an extensible web API ( application programming interface ) that provides direct, high-level access to
11132-498: The public. This censorship has attracted criticism. In 2013, Citizen Lab released a report saying that censorship is known to take place on Baidu Baike but "identifying outright instances or patterns in censorship can be difficult due to the (mostly) user-generated nature and oversight of the content." In 2007, Florence Devouard , then Chair of the Board of Trustees of the Wikimedia Foundation, said that "They [Baidu Baike] do not respect
11253-490: The purpose of creating an encyclopedia, where accuracy in titles is important. MediaWiki uses an extensible lightweight wiki markup designed to be easier to use and learn than HTML . Tools exist for converting content such as tables between MediaWiki markup and HTML. Efforts have been made to create a MediaWiki markup spec, but a consensus seems to have been reached that Wikicode requires context-sensitive grammar rules. The following side-by-side comparison illustrates
11374-404: The rate of change varies immensely. Generally, mountainous South China exhibits more linguistic diversity than the North China Plain . Until the late 20th century, Chinese emigrants to Southeast Asia and North America came from southeast coastal areas, where Min, Hakka, and Yue dialects were spoken. Specifically, most Chinese immigrants to North America until the mid-20th century spoke Taishanese ,
11495-552: The related subject dropping . Although the grammars of the spoken varieties share many traits, they do possess differences. The entire Chinese character corpus since antiquity comprises well over 50,000 characters, of which only roughly 10,000 are in use and only about 3,000 are frequently used in Chinese media and newspapers. However, Chinese characters should not be confused with Chinese words. Because most Chinese words are made up of two or more characters, there are many more Chinese words than characters. A more accurate equivalent for
11616-509: The relationship was first proposed in the early 19th century and is now broadly accepted, reconstruction of Sino-Tibetan is much less developed than that of families such as Indo-European or Austroasiatic . Difficulties have included the great diversity of the languages, the lack of inflection in many of them, and the effects of language contact. In addition, many of the smaller languages are spoken in mountainous areas that are difficult to reach and are often also sensitive border zones. Without
11737-489: The same title. For instance, a page titled " [[The Terminator]] ", in the default namespace, could describe the 1984 movie starring Arnold Schwarzenegger , while a page titled " [[User:The Terminator]] " could be a profile describing a user who chooses this name as a pseudonym. More commonly, each namespace has an associated " Talk: " namespace, which can be used to discuss its contents, such as " User talk: " or " Template talk: ". The purpose of having discussion pages
11858-415: The same topic in other editions of Misplaced Pages. This was superseded by the launch of Wikidata. Page tabs are displayed at the top of pages. These tabs allow users to perform actions or view pages that are related to the current page. The available default actions include viewing, editing, and discussing the current page. The specific tabs displayed depend on whether the user is logged into the wiki and whether
11979-458: The same way as namespaces. A set of interwiki prefixes can be configured to cause, for instance, a page title of wikiquote:Jimbo Wales to direct the user to the Jimbo Wales article on Wikiquote . Unlike internal wikilinks, interwiki links lack page existence detection functionality, and accordingly there is no way to tell whether a blue interwiki link is broken or not. Interlanguage links are
12100-698: The scientific community, in that expert peer reviewers could analyse articles, improve them and provide links to the trusted version of that article. Navigation through the wiki is largely through internal wikilinks. MediaWiki's wikilinks implement page existence detection, in which a link is colored blue if the target page exists on the local wiki and red if it does not. If a user clicks on a red link, they are prompted to create an article with that title. Page existence detection makes it practical for users to create "wikified" articles—that is, articles containing links to other pertinent subjects—without those other articles being yet in existence. Interwiki links function much
12221-505: The six official languages of the United Nations . Standard Chinese is based on the Beijing dialect of Mandarin and was first officially adopted in the 1930s. The language is written primarily using a logography of Chinese characters , largely shared by readers who may otherwise speak mutually unintelligible varieties. Since the 1950s, the use of simplified characters has been promoted by
12342-557: The slightly later Menggu Ziyun , this dictionary describes a language with many of the features characteristic of modern Mandarin dialects. Up to the early 20th century, most Chinese people only spoke their local variety. Thus, as a practical measure, officials of the Ming and Qing dynasties carried out the administration of the empire using a common language based on Mandarin varieties , known as 官话 ; 官話 ; Guānhuà ; 'language of officials'. For most of this period, this language
12463-401: The small navigation links that show up in the sidebar in most MediaWiki skins that connect an article with related articles in other languages within the same Wiki family. This can provide language-specific communities connected by a larger context, with all wikis on the same server or each on its own server. Previously, Misplaced Pages used interlanguage links to link an article to other articles on
12584-420: The source and mark the original author or the copyright statement of the original website when reprinting content from copyrighted websites or official publications, or ignore its license terms, causing infringement of the copyright of others. For example, Misplaced Pages entries use Copyleft 's free content licensing terms, while Baidu Baike’s copyright license is not free. Baidu Baike’s use of Misplaced Pages’s entry content
12705-479: The strong emphasis on multilingualism in the Wikimedia projects, internationalization and localization has received significant attention by developers. The user interface has been fully or partially translated into more than 400 languages on translatewiki.net , and can be further customized by site administrators (the entire interface is editable through the wiki). Several extensions, most notably those collected in
12826-609: The tonal distinctions, compared with about 5,000 in Vietnamese (still a largely monosyllabic language), and over 8,000 in English. Most modern varieties tend to form new words through polysyllabic compounds . In some cases, monosyllabic words have become disyllabic formed from different characters without the use of compounding, as in 窟窿 ; kūlong from 孔 ; kǒng ; this is especially common in Jin varieties. This phonological collapse has led to
12947-547: The traditional Western notion of a word. A Chinese cí can consist of more than one character–morpheme, usually two, but there can be three or more. Examples of Chinese words of more than two syllables include 汉堡包 ; 漢堡包 ; hànbǎobāo ; 'hamburger', 守门员 ; 守門員 ; shǒuményuán ; 'goalkeeper', and 电子邮件 ; 電子郵件 ; diànzǐyóujiàn ; 'e-mail'. All varieties of modern Chinese are analytic languages : they depend on syntax (word order and sentence structure), rather than inflectional morphology (changes in
13068-512: The use of tones in Chinese is the application of the four tones of Standard Chinese, along with the neutral tone, to the syllable ma . The tones are exemplified by the following five Chinese words: In contrast, Standard Cantonese has six tones. Historically, finals that end in a stop consonant were considered to be " checked tones " and thus counted separately for a total of nine tones. However, they are considered to be duplicates in modern linguistics and are no longer counted as such: Chinese
13189-489: The user has sysop privileges on the wiki. For instance, the ability to move a page or add it to one's watchlist is usually restricted to logged-in users. The site administrator can add or remove tabs by using JavaScript or installing extensions. Each page has an associated history page from which the user can access every version of the page that has ever existed and generate diffs between two versions of his choice. Users' contributions are displayed not only here, but also via
13310-452: The words in newspapers, and 60% of the words in science magazines. Vietnam, Korea, and Japan each developed writing systems for their own languages, initially based on Chinese characters , but later replaced with the hangul alphabet for Korean and supplemented with kana syllabaries for Japanese, while Vietnamese continued to be written with the complex chữ Nôm script. However, these were limited to popular literature until
13431-508: The written language used throughout China changed comparatively little, crystallizing into a prestige form known as Classical or Literary Chinese . Literature written distinctly in the Classical form began to emerge during the Spring and Autumn period . Its use in writing remained nearly universal until the late 19th century, culminating with the widespread adoption of written vernacular Chinese with
13552-453: Was a koiné based on dialects spoken in the Nanjing area, though not identical to any single dialect. By the middle of the 19th century, the Beijing dialect had become dominant and was essential for any business with the imperial court. In the 1930s, a standard national language ( 国语 ; 國語 ; Guóyǔ ), was adopted. After much dispute between proponents of northern and southern dialects and an abortive attempt at an artificial pronunciation,
13673-515: Was also written in PHP, with a MySQL backend, and kept the basic interface of the phase II software, but with the added functionality of a wider scalability . The "phase III" software went live on Misplaced Pages in July 2002. The Wikimedia Foundation was announced on June 20, 2003. In July, Misplaced Pages contributor Daniel Mayer suggested the name "MediaWiki" for the software, as a play on "Wikimedia". The MediaWiki name
13794-427: Was chosen to represent MediaWiki rather than Misplaced Pages, with the second place logo being used for the Wikimedia Foundation. The double square brackets ( [[ ]] ) symbolize the syntax MediaWiki uses for creating hyperlinks to other wiki pages; while the sunflower represents the diversity of content on Misplaced Pages, its constant growth, and the wilderness. Later, Brooke Vibber , the chief technical officer of
13915-734: Was founded by Robin Li in April 2006, following the Chinese government's decision to censor Misplaced Pages in 2005. The beta version of Baidu Baike was launched on 20 April 2006. 5 to 20 April was used for a period of internal testing. According to the serial number in the entry address bar , the first 10 entries were: Baidu Baike, "Entries", an edit experiment (a sand table page, which has been deleted), Mántou (steamed buns), orchid cultivation (deleted due to typos), Yandang Mountain , Lingfeng, Dalongqiu, Wudafusong and Red Food. After 20 days, it had more than 300,000 registered users and more than 100,000 articles, surpassing
14036-512: Was gradually phased in, beginning in August 2003. The name has frequently caused confusion due to its (intentional) similarity to the "Wikimedia" name (which itself is similar to "Misplaced Pages"). The old product logo was created by Erik Möller , using a flower photograph taken by Florence Nibart-Devouard , and was originally submitted to the logo contest for a new Misplaced Pages logo , held from July 20 to August 27, 2003. The logo came in third place, and
14157-481: Was hidden by their written form. Often different compounds for the same concept were in circulation for some time before a winner emerged, and sometimes the final choice differed between countries. The proportion of vocabulary of Chinese origin thus tends to be greater in technical, abstract, or formal language. For example, in Japan, Sino-Japanese words account for about 35% of the words in entertainment magazines, over half
14278-545: Was not a Wiki. Baidu Baike does not use MediaWiki . The editors are divided into 15 levels based on their experience points. In Baidu Baike's articles, headings, bolding text styles and hyperlinks are supported. Each heading can be listened to ( 播报 ; lit. ' broadcast ' ) separately. The references that are used are listed at the bottom of each page. The site supports editing, commenting, printing articles and viewing an article's history. Users can access multiple editing functions, including: The copyright policy
14399-493: Was the new Meta Misplaced Pages on November 9, 2001. There was a desire to have it implemented immediately on the English-language Misplaced Pages. However, Manske was apprehensive about any potential bugs harming the nascent website during the period of the final exams he had to complete immediately prior to Christmas; this led to the launch on the English-language Misplaced Pages being delayed until January 25, 2002. The software
14520-456: Was the use of " free links " instead of CamelCase . When MediaWiki was created, it was typical for wikis to require text like "WorldWideWeb" to create a link to a page about the World Wide Web ; links in MediaWiki, on the other hand, are created by surrounding words with double square brackets, and any spaces between them are left intact, e.g. [[World Wide Web]] . This change was logical for
14641-486: Was then, gradually, deployed on all the Misplaced Pages language sites of that time. This software was referred to as "the PHP script" and as "phase II", with the name "phase I", retroactively given to the use of UseModWiki. Increasing usage soon caused load problems to arise again, and soon after, another rewrite of the software began; this time being done by Lee Daniel Crocker , which became known as "phase III". This new software
#520479