This project converts and cleans the scientific metadata from Swepub into Python objects
GPL-3.0 License
Convert Wikidata and Wikipedia raw files to filterable formats with a focus of marking Wikidata ...
Project that aims to sentenize all the open data of Riksdagen and other sources to create an easi...
The purpose of this script is to get all the senses for all the words in a SRT-file from Wikidata
This tool helps scrape DOIs from https://orcid.org/ and curate them using Scholia
An ML API to compute similarity scores between meta information about sentence examples.
Open Access PDF harvester, metadata aggregator and full-text ingester
Biomedical Entity Linking Benchmark
Dumb web to disk tool; html, markdown / md / text, epub
arxiv_miner is a toolkit for mining research papers on CS ArXiv.
Data repository for pretrained NLP models and NLP corpora.
Local relational access to openly-available publication data sets
Semantic Publishing Workflow support
This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX
Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code
Recommendation engine for scholarly articles