ner_download_modeldata {crfsuite} | R Documentation |
Download training data for doing Named Entity Recognition (NER)
ner_download_modeldata(type = c("conll2002-nl", "conll2002-es", "GermanNER", "wikiner-de-wp2", "wikiner-de-wp3", "wikiner-en-wp2", "wikiner-en-wp3", "wikiner-es-wp2", "wikiner-es-wp3", "wikiner-fr-wp2", "wikiner-fr-wp3", "wikiner-it-wp2", "wikiner-it-wp3", "wikiner-nl-wp2", "wikiner-nl-wp3", "wikiner-pl-wp3", "wikiner-pt-wp3", "wikiner-ru-wp2", "wikiner-ru-wp3"), docs = -Inf)
type |
a character string with the type of data to download. See the function usage for all possible values. These data will be downloaded from either:
Please visit the information on these repositories first before you use these data in any commercial product. |
docs |
integer indicating how many documents to sample from the data (only used for data from the NLTK repository). This is only used to reduce CRAN R CMD check training time in the examples of this R package. |
a data.frame with training data for a Named Entity Recognition task or an object of try-error in case of failure of downloading the data
## Not run: x <- ner_download_modeldata("conll2002-nl") x <- ner_download_modeldata("conll2002-es") x <- ner_download_modeldata("GermanNER") x <- ner_download_modeldata("wikiner-en-wp2") x <- ner_download_modeldata("wikiner-nl-wp3") x <- ner_download_modeldata("wikiner-fr-wp3") ## End(Not run) ## reduce number of docs x <- ner_download_modeldata("conll2002-es", docs = 10)