Package: ojsr 0.1.5
ojsr: Crawler and Data Scraper for Open Journal System ('OJS')
Crawler for 'OJS' pages and scraper for meta-data from articles. You can crawl 'OJS' archives, issues, articles, galleys, and search results. You can scrape articles metadata from their head tag in html, or from Open Archives Initiative ('OAI') records. Most of these functions rely on 'OJS' routing conventions (<https://docs.pkp.sfu.ca/dev/documentation/en/architecture-routes>).
Authors:
ojsr_0.1.5.tar.gz
ojsr_0.1.5.zip(r-4.5)ojsr_0.1.5.zip(r-4.4)ojsr_0.1.5.zip(r-4.3)
ojsr_0.1.5.tgz(r-4.4-any)ojsr_0.1.5.tgz(r-4.3-any)
ojsr_0.1.5.tar.gz(r-4.5-noble)ojsr_0.1.5.tar.gz(r-4.4-noble)
ojsr_0.1.5.tgz(r-4.4-emscripten)ojsr_0.1.5.tgz(r-4.3-emscripten)
ojsr.pdf |ojsr.html✨
ojsr/json (API)
NEWS
# Install 'ojsr' in R: |
install.packages('ojsr', repos = c('https://gastonbecerra.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/gastonbecerra/ojsr/issues
Last updated 10 days agofrom:6127e0fd95. Checks:OK: 7. Indexed: yes.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Nov 13 2024 |
R-4.5-win | OK | Nov 13 2024 |
R-4.5-linux | OK | Nov 13 2024 |
R-4.4-win | OK | Nov 13 2024 |
R-4.4-mac | OK | Nov 13 2024 |
R-4.3-win | OK | Nov 13 2024 |
R-4.3-mac | OK | Nov 13 2024 |
Exports:get_articles_from_issueget_articles_from_searchget_galleys_from_articleget_html_meta_from_articleget_issues_from_archiveget_oai_meta_from_articleparse_base_urlparse_oai_url
Dependencies:askpassbitopsclicpp11curldplyrfansigenericsgluehttrjsonlitelifecyclemagrittrmimeopensslpillarpkgconfigpurrrR6RcppRCurlrlangrvestselectrstringistringrsystibbletidyrtidyselecttriebeardurltoolsutf8vctrswithrxml2
Readme and manuals
Help Manual
Help page | Topics |
---|---|
Scraping articles URLs from the ToC of OJS issues | get_articles_from_issue |
Scraping OJS search results for a given criteria to retrieve articles’ URL | get_articles_from_search |
Scraping galleys URLs from OJS articles | get_galleys_from_article |
Scraping metadata from the OJS articles HTML | get_html_meta_from_article |
Scraping issues’ URLs from the OJS issues archive | get_issues_from_archive |
Retrieving OAI records for OJS articles | get_oai_meta_from_article |
Parses urls against OJS routing conventions and retrieves the base url | parse_base_url |
Parses urls against OJS routing conventions and retrieves the OAI url | parse_oai_url |