Content
Gnuhoo borrowed the basic outline for its initial ontology from Usenet. In 1998, Rich Skrenta said, "I took a long list of groups and hand-edited them into a hierarchy." For example, the topic covered by the comp.ai.alife newsgroup was represented by the category Computers/AI/Artificial_Life. The original divisions were for Adult, Arts, Business, Computers, Games, Health, Home, News, Recreation, Reference, Regional, Science, Shopping, Society and Sports. While these fifteen top-level categories have remained intact, the ontology of second- and lower-level categories has undergone a gradual evolution; significant changes are initiated by discussion among editors and then implemented when consensus has been reached.
In July 1998, the directory became multilingual with the addition of the World top-level category. The remainder of the directory lists only English language sites. By May 2005, seventy-five languages were represented. The growth rate of the non-English components of the directory has been greater than the English component since 2002. While the English component of the directory held almost 75% of the sites in 2003, the World level grew to over 1.5 million sites as of May 2005, forming roughly one-third of the directory. The ontology in non-English categories generally mirrors that of the English directory, although exceptions which reflect language differences are quite common.
Several of the top-level categories have unique characteristics. The Adult category is not present on the directory homepage but it is fully available in the RDF dump that ODP provides. While the bulk of the directory is categorized primarily by topic, the Regional category is categorized primarily by region. This has led many to view ODP as two parallel directories: Regional and Topical.
On November 14, 2000, a special directory within the Open Directory was created for people under 18 years of age. Key factors distinguishing this "Kids and Teens" area from the main directory are:
- stricter guidelines which limit the listing of sites to those which are targeted or "appropriate" for people under 18 years of age;
- category names as well as site descriptions use vocabulary which is "age appropriate";
- age tags on each listing distinguish content appropriate for kids (age 12 and under), teens (13 to 15 years old) and mature teens (16 to 18 years old);
- Kids and Teens content is available as a separate RDF dump;
- editing permissions are such that the community is parallel to that of the Open Directory.
By May 2005, this portion of the Open Directory included over 32,000 site listings.
Since early 2004, the whole site has been in UTF-8 encoding. Prior to this, the encoding used to be ISO 8859-1 for English language categories and a language-dependent character set for other languages. The RDF dumps have been encoded in UTF-8 since early 2000.
Read more about this topic: Open Directory Project
Famous quotes containing the word content:
“To me style is just the outside of content, and content the inside of style, like the outside and the inside of the human bodyboth go together, they cant be separated.”
—Jean-Luc Godard (b. 1930)
“You can hardly convince a man of an error in a life-time, but must content yourself with the reflection that the progress of science is slow. If he is not convinced, his grandchildren may be. The geologists tell us that it took one hundred years to prove that fossils are organic, and one hundred and fifty more to prove that they are not to be referred to the Noachian deluge.”
—Henry David Thoreau (18171862)
“It is still not enough for language to have clarity and content ... it must also have a goal and an imperative. Otherwise from language we descend to chatter, from chatter to babble and from babble to confusion.”
—René Daumal (19081944)