scholarly

Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!

UNLICENSE License

Downloads
49.2K
Stars
1.4K
Committers
40

Bot releases are hidden (Show)

scholarly - v1.7.11 Latest Release

Published by arunkannawadi almost 2 years ago

Improvements

  • Handle redirects from using approximate or outdated scholar_id values in search_author_id query (#469)

Bugfix

  • ScraperAPI premium parameters are now respected
scholarly - v1.7.10

Published by arunkannawadi almost 2 years ago

Bugfix

  • Handle any gap years when filling citation counts per year (#481)

Improvement

  • Use a default, fixed user agent if fake-useragent cannot be imported.
scholarly - v1.7.10-alpha

Published by arunkannawadi almost 2 years ago

Improvement

Handle errors that may come from old version of fake-useragent.

scholarly - v1.7.9

Published by arunkannawadi almost 2 years ago

scholarly - v1.7.8

Published by arunkannawadi almost 2 years ago

Bugfix

Changes in v1.7.7 made the library incompatible with ScraperAPI method. This patch fixes this issue (https://github.com/scholarly-python-package/scholarly/issues/477)

scholarly - v1.7.7

Published by arunkannawadi almost 2 years ago

Improvements

Change the underlying library from requests to httpx. This largely solves #465 for the time being at least.

scholarly - v1.7.6

Published by arunkannawadi almost 2 years ago

#Bugfix

scholarly - v1.7.5

Published by arunkannawadi almost 2 years ago

Bugfix

  • fix sameSite cookie issue (#196)
  • version limit free-proxy in setup.py
scholarly - v1.7.4

Published by arunkannawadi almost 2 years ago

Bugfixes

  • includes a hotfix for backward incompatibility for one of the dependencies
  • minor reorganization to unit tests
scholarly - v1.7.3

Published by arunkannawadi almost 2 years ago

Improvement

  • search_citedby can fetch more than 1000+ citations. The improved implementation gets around Google Scholar's limitation of displaying only 1000 citations. Thanks to @jjshoots for noticing this problem and raising a PR. #449 #446
  • url_related_articles will be populated with the more useful link when more than one are available on Google Scholar.

Bugfix

  • search_keyword and search_keywords can take labels that are composed of multiple words. #437
scholarly - v1.7.2

Published by arunkannawadi about 2 years ago

Enhancement

  • Skip checking if ScraperAPI works by pinging a test URL. Instead, this is inferred from ScraperAPI's account API.

Bugfix

  • Changes introduced in v1.7.1 are reverted, since they increased the changes of getting blocked by Google Scholar.
scholarly - v1.7.1

Published by arunkannawadi over 2 years ago

Enhancement

  • Reduce the usage of premium proxies by up to a factor of 2.
scholarly - v1.7.0

Published by arunkannawadi over 2 years ago

Features

  • Add a new citation entry to pub fetched from an author profile with formatted citation entry #423.

Bugfixes

  • Fix pprint failures on Windows #413.
  • Thoroughly handle 1000 or more publications that are available (or not) according to public access mandates #414.
  • Fix errors in download_mandates_csv that may occassionally occur for agencies without a policy link #413.
scholarly - v1.6.3

Published by arunkannawadi over 2 years ago

Bugfix

search_pubs method did not respect include_last_year, which is now fixed.

Improvements

Unit tests involving funding agency mandates are a bit more robust.

scholarly - v1.6.2

Published by ipeirotis over 2 years ago

scholarly - v1.6.1

Published by arunkannawadi over 2 years ago

Bugfix

Handle 1000 or more publications that are available (or not) according to public access mandates

Improvement

Fetch 20+ coauthors without requiring geckodriver/chrome-driver to be installed.

scholarly - v1.6.0

Published by arunkannawadi over 2 years ago

Features

  • Download table of funding agencies as a CSV file with URL to the funding mandates included
  • Downlad top-ranking journals in general, under sub-categories and in different languages as a CSV file

Bugfixes

#392
#394

scholarly - v1.5.1

Published by arunkannawadi almost 3 years ago

Features

Support chromium (chrome-driver) as an alternative to geckodriver #387

Improvements

  • Firefox/Geckodriver operates in headless mode
  • Increase test coverage to include all public APIs
  • Clean up legacy code and improve coding styles
  • Remove the use of deprecated functions in dependency packages

Bugfix

  • Stop attempting to reuse a closed webdriver
scholarly - v1.5.0

Published by arunkannawadi almost 3 years ago

Features

  • Fetch the public access mandates information from a Scholar profile and mark the publications whether or not they satisfy the open-access mandate.
  • Fetch an author's organization identifer from their Scholar profile
  • Search for all authors affiliated with an organization
  • Fetch homepage URL from a Scholar profile

Enhancements

  • Make FreeProxies more robust
  • Stop the misleading traceback error message #313

Bugfixes

  • Fix bug in exception handling #366
scholarly - v1.5.0-alpha

Published by arunkannawadi almost 3 years ago

Features

  • Fetch the public access mandates information from a Scholar profile and mark the publications whether or not they satisfy the open-access mandate.
  • Fetch an author's organization identifer from their Scholar profile
  • Search for all authors affiliated with an organization
  • Fetch homepage URL from a Scholar profile

Enhancements

  • Make FreeProxies more robust
  • Stop the misleading traceback error message #313

Bugfixes

  • Fix bug in exception handling #366