Matches in DBpedia 2014 for { <http://dbpedia.org/resource/Heaps'_law> ?p ?o. }
Showing items 1 to 25 of
25
with 100 items per page.
- Heaps'_law abstract "In linguistics, Heaps' law (also called Herdan's law) is an empirical law which describes the number of distinct words in a document (or set of documents) as a function of the document length (so called type-token relation). It can be formulated aswhere VR is the number of distinct words in an instance text of size n. K and β are free parameters determined empirically. With English text corpora, typically K is between 10 and 100, and β is between 0.4 and 0.6.The law is frequently attributed to Harold Stanley Heaps, but was originally discovered by Gustav Herdan (1960). Under mild assumptions, the Herdan–Heaps law is asymptotically equivalent to Zipf's law concerning the frequencies of individual words within a text. This is a consequence of the fact that the type-token relation (in general) of a homogenous text can be derived from the distribution of its types.Heaps' law means that as more instance text is gathered, there will be diminishing returns in terms of discovery of the full vocabulary from which the distinct terms are drawn.It is interesting to note that Heaps' law also applies to situations in which the "vocabulary" is just some set of distinct types which are attributes of some collection of objects. For example, the objects could be people, and the types could be country of origin of the person. If persons are selected randomly (that is, we are not selecting based on country of origin), then Heaps' law says we will quickly have representatives from most countries (in proportion to their population) but it will become increasingly difficult to cover the entire set of countries by continuing this method of sampling.".
- Heaps'_law thumbnail Heaps_law_plot.png?width=300.
- Heaps'_law wikiPageID "436287".
- Heaps'_law wikiPageRevisionID "565071150".
- Heaps'_law hasPhotoCollection Heaps'_law.
- Heaps'_law id "3431".
- Heaps'_law title "Heaps' law".
- Heaps'_law subject Category:Computational_linguistics.
- Heaps'_law subject Category:Empirical_laws.
- Heaps'_law subject Category:Statistical_laws.
- Heaps'_law type Abstraction100002137.
- Heaps'_law type Collection107951464.
- Heaps'_law type EmpiricalLaws.
- Heaps'_law type Group100031264.
- Heaps'_law type Law108441203.
- Heaps'_law type StatisticalLaws.
- Heaps'_law comment "In linguistics, Heaps' law (also called Herdan's law) is an empirical law which describes the number of distinct words in a document (or set of documents) as a function of the document length (so called type-token relation). It can be formulated aswhere VR is the number of distinct words in an instance text of size n. K and β are free parameters determined empirically.".
- Heaps'_law label "Heaps' law".
- Heaps'_law sameAs m.028bdr.
- Heaps'_law sameAs Q5691531.
- Heaps'_law sameAs Q5691531.
- Heaps'_law sameAs Heaps'_law.
- Heaps'_law wasDerivedFrom Heaps'_law?oldid=565071150.
- Heaps'_law depiction Heaps_law_plot.png.
- Heaps'_law isPrimaryTopicOf Heaps'_law.