Type: | Package |
Title: | Unine Light Stemmer |
Version: | 0.2.0 |
Maintainer: | Michaël Benesty <michael@benesty.fr> |
Description: | Implementation of "light" stemmers for French, German, Italian, Spanish, Portuguese, Finnish, Swedish. They are based on the same work as the "light" stemmers found in 'SolR' https://lucene.apache.org/solr/ or 'ElasticSearch' https://www.elastic.co/fr/products/elasticsearch. A "light" stemmer consists in removing inflections only for noun and adjectives. Indexing verbs for these languages is not of primary importance compared to nouns and adjectives. The stemming procedure for French is described in (Savoy, 1999) <doi:10.1002/(SICI)1097-4571(1999)50:10%3C944::AID-ASI9%3E3.3.CO;2-H>. |
URL: | https://github.com/pommedeterresautee/unine, https://pommedeterresautee.github.io/unine/, http://members.unine.ch/jacques.savoy/clef/ |
BugReports: | https://github.com/pommedeterresautee/unine/issues |
License: | MIT + file LICENSE |
Encoding: | UTF-8 |
LazyData: | true |
LinkingTo: | Rcpp |
Imports: | Rcpp (≥ 1.0.0), methods |
SystemRequirements: | C++11 |
RoxygenNote: | 6.1.1 |
NeedsCompilation: | yes |
Suggests: | testthat, covr, stringi |
Packaged: | 2019-04-13 22:02:47 UTC; geantvert |
Author: | Michaël Benesty [aut, cre, cph], Jacques Savoy [cph] |
Repository: | CRAN |
Date/Publication: | 2019-04-13 22:24:09 UTC |
unine: Unine Light Stemmer
Description
Implementation of "light" stemmers for French, German, Italian, Spanish, Portuguese, Finnish, Swedish. They are based on the same work as the "light" stemmers found in 'SolR' <https://lucene.apache.org/solr/> or 'ElasticSearch' <https://www.elastic.co/fr/products/elasticsearch>. A "light" stemmer consists in removing inflections only for noun and adjectives. Indexing verbs for these languages is not of primary importance compared to nouns and adjectives. The stemming procedure for French is described in (Savoy, 1999) <doi:10.1002/(SICI)1097-4571(1999)50:10
Author(s)
Maintainer: Michaël Benesty michael@benesty.fr [copyright holder]
Other contributors:
Jacques Savoy Jacques.Savoy@unine.ch [copyright holder]
See Also
Useful links:
Report bugs at https://github.com/pommedeterresautee/unine/issues
Stem Finnish words
Description
Stemmer for Finnish words
Usage
finnish_stemmer(words)
Arguments
words |
a character containing the original words. |
Value
character with stemmed words.
Examples
finnish_stemmer(c("taivas"))
Finnish stop list
Description
Finnish stop list
Usage
finnish_stoplist
Format
character containing the list of words
Source
Examples
data("finnish_stoplist")
Stem French words
Description
Stemmer for French words
Usage
french_stemmer(words)
Arguments
words |
a character containing the original words. |
Value
character with stemmed words.
Examples
french_stemmer(c("tester", "testament", "clients"))
French stop list
Description
French stop list
Usage
french_stoplist
Format
character containing the list of words
Source
Examples
data("french_stoplist")
Stem German words
Description
Stemmer for German words
Usage
german_stemmer(words)
Arguments
words |
a character containing the original words. |
Value
character with stemmed words.
Examples
german_stemmer(c("kinder"))
German stop list
Description
German stop list
Usage
german_stoplist
Format
character containing the list of words
Source
Examples
data("german_stoplist")
Stem Italian words
Description
Stemmer for Italian words
Usage
italian_stemmer(words)
Arguments
words |
a character containing the original words. |
Value
character with stemmed words.
Examples
italian_stemmer(c("arrivederci"))
Italian stop list
Description
Italian stop list
Usage
italian_stoplist
Format
character containing the list of words
Source
Examples
data("italian_stoplist")
Stem Portuguese words
Description
Stemmer for Portuguese words
Usage
portuguese_stemmer(words)
Arguments
words |
a character containing the original words. |
Value
character with stemmed words.
Examples
portuguese_stemmer(c("adeus"))
Portuguese stop list
Description
Portuguese stop list
Usage
portuguese_stoplist
Format
character containing the list of words
Source
Examples
data("portuguese_stoplist")
Stem Spanish words
Description
Stemmer for Spanish words
Usage
spanish_stemmer(words)
Arguments
words |
a character containing the original words. |
Value
character with stemmed words.
Examples
spanish_stemmer(c("perros"))
Spanish stop list
Description
Spanish stop list
Usage
spanish_stoplist
Format
character containing the list of words
Source
Examples
data("spanish_stoplist")
Stem Swedish words
Description
Stemmer for Swedish words
Usage
swedish_stemmer(words)
Arguments
words |
a character containing the original words. |
Value
character with stemmed words.
Examples
swedish_stemmer(c("stiga"))
Swedish stop list
Description
Swedish stop list
Usage
swedish_stoplist
Format
character containing the list of words
Source
Examples
data("swedish_stoplist")