get_keywords.Rd
Given an URL to a bookdown chapter, extract the keywords
get_keywords(url, min_occurrence = 4, udmodel = NULL)
a url to a bookdown chapter e.g. "https://jhudatascience.org/Documentation_and_Usability/what-does-good-documentation-look-like.html"
A numeric number specifying the minimum number of times a keyword should appear for it to stay in the list. Default is 4.
A udmodel passed in for keyword determination. Will be obtained using
udpipe::udpipe_download_model(language = "english")
if its not given.
a data frame of keywords
# Declare chapter URL
url <- "https://jhudatascience.org/Documentation_and_Usability/other-helpful-features.html"
keywords_df <- get_keywords(url)
#> Downloading udpipe model from https://raw.githubusercontent.com/jwijffels/udpipe.models.ud.2.5/master/inst/udpipe-ud-2.5-191206/english-ewt-ud-2.5-191206.udpipe to /Users/candacesavonen/Desktop/GitRepos/cow/docs/reference/english-ewt-ud-2.5-191206.udpipe
#> - This model has been trained on version 2.5 of data from https://universaldependencies.org
#> - The model is distributed under the CC-BY-SA-NC license: https://creativecommons.org/licenses/by-nc-sa/4.0
#> - Visit https://github.com/jwijffels/udpipe.models.ud.2.5 for model license details.
#> - For a list of all models and their licenses (most models you can download with this package have either a CC-BY-SA or a CC-BY-SA-NC license) read the documentation at ?udpipe_download_model. For building your own models: visit the documentation by typing vignette('udpipe-train', package = 'udpipe')
#> Downloading finished, model stored at '/Users/candacesavonen/Desktop/GitRepos/cow/docs/reference/english-ewt-ud-2.5-191206.udpipe'