2014年9月21日 星期日

Hu Shih in The Google Ngram Viewer

Graph these comma-separated phrases:
  case-insensitive
between  and  from the corpus  with smoothing of .   
Replaced 胡適 with 胡适 to match how we processed the books.
19001910192019301940195019601970198019902000(click on line/label for focus)0.000000%0.000020%0.000040%0.000060%0.000080%0.000100%0.000120%0.000140%0.000160%0.000180%0.000200%0.000220%0.000240%胡适1951‪胡适‬0.0000364582%




The Google Ngram Viewer is an online phrase-usage graphing tool originally developed by Google, inspired by a prototype (called "Bookworm") created by Jean-Baptiste Michel and Erez Aiden from Harvard and Yuan Shen from MIT. It charts the yearly count of selected n-grams (letter combinations)[n] or words and phrases,[1][2] as found in over 5.2 million books digitized by Google Inc (up to 2012).[3][4] The words or phrases (or ngrams) are matched by case-sensitive spelling, comparing exact uppercase letters,[2] and plotted on the graph if found in 40 or more books during each year (of the requested year-range).[5] The Ngram tool was released in mid-December 2010[1][3] and now supports searches for parts of speech and wildcards.
The word-search database was created by Google Books and was based originally on 5.2 million books published between 1500 and 2008. Collectively, the corpus contained over 500 billion words[6] in American English, British English, French, German, Spanish, Russian, Hebrew, and Chinese.[1] Italian words are counted by their use in other languages. A user of the Ngram tool has the option to select among the source languages for the word-search operations.[7]
Researchers have analyzed the Google Ngram database of books written in American or British English discovering interesting results. Amongst them, they found correlations between the emotional output and significant events in the 20th century such as the World War II.[8]

Google BooksNgram Viewer

Graph these comma-separated phrases:
  case-insensitive
between  and  from the corpus  with smoothing of .   
19001910192019301940195019601970198019902000(click on line/label for focus)0.0000000%0.0000050%0.0000100%0.0000150%0.0000200%0.0000250%0.0000300%0.0000350%0.0000400%0.0000450%0.0000500%Hu Shih1942‪Hu Shih‬0.0000238138%

沒有留言:

張貼留言