Turbocharge a PubMed literature rather than clicking and clicking and clicking on Google Scholar
AGPL-3.0 License
Turbocharge a PubMed literature search with the command, icite
, rather than clicking and clicking and clicking on Google Scholar "Cited by N" links.
This open-source project is part of a peer-reviewed commentary that was invited by the editors of Research Synthesis Methods. Please Cite if you use pmidcite in your research or literature search.
Contact: [email protected]
PubMed contains peer-reviewed research papers
in biomedicine, biochemistry, chemistry, behavioral science, and other life sciences.
Citation data is downloaded
each time icite
is run
from the National Institutes of Health (NIH) and includes:
$ icite -H 26032263
25
citations, 10
references, and 4
authors.74
th percentile in column %
) compared to its peers.This paper is performing well (74
th percentile) compared to its peers (column %
).
The NIH percentile grouping (column G
) helps to
highlight the better performing papers in groups 2
, 3
, and 4
by
sorting the citing papers by group first, then publication year.
The sort places the lower performing papers in groups 0
or 1
at the back.
New papers appear at the beginning of a sorted list, no matter how many citations they have to better facilitate researchers in finding the latest discoveries.
The grouping of papers by NIH percentile grouping is a novel feature created by dvklopfenstein for this project.
Also known as following a paper's Cited by links or Forward snowballing
icite -H; icite 26032263 --load_citations | sort -k6 -r
or
icite -H; icite 26032263 -c | sort -k6 -r
Also known as following links to a paper's references or Backward snowballing
$ icite -H; icite 26032263 --load_references | sort -k6 -r
or
$ icite -H; icite 26032263 -r | sort -k6 -r
Create a file containing numerous PMIDs annotated with icite info
$ icite 30022098 -c -o goatools_cites.txt
WROTE: goatools_cites.txt
Count the number of lines in the file
$ wc -l goatools_cites.txt
468 goatools_cites.txt
Summarize the papers in "goatools_cites.txt"
$ sumpaps goatools_cites.txt
i=026.9% 4=003.0% 3=018.9% 2=028.8% 1=015.9% 0=006.5% 6 years:2018-2024 465 papers goatools_cites.txt
i
) to top-performing(4
), great(3
), very good(2
), and overlooked(1
and 0
)$ icite -i pmid-HIVANDDNAm-set.txt -o pmid-HIVANDDNAm-icite.txt
$ grep TOP pmid-HIVANDDNAm-icite.txt | sort -k6
A Command-Line Interface (CLI) can be preferable to a Graphical User Interface (GUI) because:
Researchers who use Linux or Mac already work from the command line. Researchers who use Windows can get that Linux-like command line feeling while still running native Windows programs by downloading Cygwin from https://www.cygwin.com/ [1].
In 2013, Boeker et al. [6] recommended that a scientific search interface contain five integrated search criteria. PubMed implements all five, while Google did not in 2013 or today.
Google's highly popular implementation of the forward citation search through their ubiquitous "Cited by N" links is a "Better" experience than the PubMed's "forward citation search" implementation.
But if your research is in the health sciences and you are amenable to working from the command line, you can use PubMed in your browser plus citation data downloaded from the NIH using the command-line using pmidcite. The NIH's citation data includes a paper's ranking among its co-citation network.
PubMed is a search interface and toolset used to access over 30.5 million article records from databases such as:
To install from PyPI
$ pip3 install pmidcite
To install locally
$ git clone https://github.com/dvklopfenstein/pmidcite.git
$ cd ./pmidcite
$ pip3 install .
Save your literature search in a GitHub repo.
Add a .pmidciterc init file to a non-git managed directory, such as home (~)
$ icite --generate-rcfile | tee ~/.pmidciterc
[pmidcite]
email = [email protected]
# To download PubMed search results, get an NCBI API key here:
# https://ncbiinsights.ncbi.nlm.nih.gov/2017/11/02/new-api-keys-for-the-e-utilities
apikey = MY_LONG_HEX_NCBI_API_KEY
tool = my_scripts
$ export PMIDCITECONF=~/.pmidciterc
Do not version manage the .pmidciterc
using a tool such as GitHub because it
contains your personal email and your private NCBI API key.
To download PubMed abstracts and PubMed search results using NCBI's E-Utils, get an NCBI API key using these instructions: https://ncbiinsights.ncbi.nlm.nih.gov/2017/11/02/new-api-keys-for-the-e-utilities
Set the apikey
value in the config file: ~/.pmidciterc
See the contributing guide for detailed instructions on how to get started contributing to the pmidcite project.
email: [email protected] https://orcid.org/0000-0003-0161-7603
If you use pmidcite in your research or literature search, please cite paper 1 (pmidcite) and paper 3 (NIH citation data).
Please also consider reading and citing Gusenbauer's response (paper 2) about improving search for all during the information avalanche of these times:
The pmidcite paper: Commentary to Gusenbauer and Haddaway 2020: Evaluating Retrieval Qualities of PubMed and Google Scholar Klopfenstein DV and Dampier W 2020 | Research Synthesis Methods | PMID: 33031632 | DOI: 10.1002/jrsm.1456 | pdf
Gusenbauer's response to the pmidcite paper: What every Researcher should know about Searching – Clarified Concepts, Search Advice, and an Agenda to improve Finding in Academia Gusenbauer M and Haddaway N 2020 | Research Synthesis Methods | PMID: 33031639 | DOI: 10.1002/jrsm.1457 | pdf
The NIH citation data used by pmidcite -- Scientific Influence, Translation, and Citation counts: The NIH Open Citation Collection: A public access, broad coverage resource Hutchins BI ... Santangelo GM 2019 | PLoS Biology | PMID: 31600197 | DOI: 10.1371/journal.pbio.3000385
Please consider reading and citing the paper [4] which inspired the creation of pmidcite [1] and the authors' response to our paper [2]:
Mentioned in this README are also these outstanding contributions:
Relative Citation Ratio (RCR): A New Metric That Uses Citation Rates to Measure Influence at the Article Level Hutchins BI, Xin Yuan, Anderson JM, and Santangelo, George M. 2016 | PLoS Biology | PMID: 27599104 | DOI: 10.1371/journal.pbio.1002541
Google Scholar as replacement for systematic literature searches: good relative recall and precision are not enough Boeker M et al. 2013 | BMC Medical Research Methodology | PMID: 24160679 | DOI: 10.1186/1471-2288-13-131
Best Match: New relevance search for PubMed Fiorini N ... Lu Zhiyong 2018 | PLoS Biology | PMID: 30153250 | DOI: 10.1371/journal.pbio.2005343
[email protected] https://orcid.org/0000-0003-0161-7603
Copyright (C) 2019-present pmidcite, DV Klopfenstein, PhD. All rights reserved.