chembl-downloader

Write reproducible code for getting and processing ChEMBL

MIT License

Stars
61
chembl-downloader - v0.4.5 Latest Release

Published by cthoyt 9 months ago

What's Changed

New Contributors

Full Changelog: https://github.com/cthoyt/chembl-downloader/compare/v0.4.4...v0.4.5

chembl-downloader - v0.4.4

Published by cthoyt 9 months ago

chembl-downloader - v0.4.3

Published by cthoyt about 1 year ago

What's Changed

New Contributors

Full Changelog: https://github.com/cthoyt/chembl-downloader/compare/v0.4.2...v0.4.3

chembl-downloader - v0.4.2

Published by cthoyt about 1 year ago

chembl-downloader - New notebooks and uniprot mapping function

Published by cthoyt almost 2 years ago

What's Changed

Full Changelog: https://github.com/cthoyt/chembl-downloader/compare/v0.4.0...v0.4.1

chembl-downloader - 🧪⚗️🧪 Improve project skeleton and other tasty things

Published by cthoyt almost 2 years ago

This PR does several things:

  1. Removes dependency on bioversions and just implements the code locally
  2. Adds a CLI for generating a statistics table for all versions of ChEMBL
  3. Add proper project skeleton (documentation, unit tests, code quality assurance, CI)
  4. Improve SQLite loading in case you delete the compressed data

Notebooks

  1. Adds notebook about drug indications
  2. Adds notebook about mapping to ChEBI
chembl-downloader - Add monomer library functionality 🧪🧪🧪

Published by cthoyt over 2 years ago

This release adds two new functions:

  1. chembl_downloader.download_monomer_library which gets this file https://ftp.ebi.ac.uk/pub/databases/chembl/ChEMBLdb/latest/chembl_30_monomer_library.xml for whatever version you specify
  2. chembl_downloader.get_monomer_library_root which does the same as the downloader but also parses the XML for you

Thanks to @iwatobipen and his recent blog post for inspiring this.

chembl-downloader - Get fingerprints and chemical representations 🧪⚗️🧪

Published by cthoyt almost 3 years ago

New Functions

  • chembl_downloader.download_fps downloads the pre-computed Morgan fingerprint file
  • chembl_downloader.download_chemreps downloads the chembl-smiles-inchi-inchikey map
  • chembl_downloader.get_chemreps_df builds on chembl_downloader.download_chemreps and loads them in a pandas dataframe

Misc

  • Add isort to code quality checking
  • Enable many functions with return_version to make a tuple with the version, which is useful if you're having it infer the latest version.
chembl-downloader - v0.1.3

Published by cthoyt almost 3 years ago

This release adds the get_substructure_library() for automating the generation of an RDKit substructure library as described in Greg Landrum's RDKit blog post, Some new features in the SubstructLibrary. The following example shows how it can be used to accomplish some of the first tasks presented in the post:

from rdkit import Chem

import chembl_downloader

library = chembl_downloader.get_substructure_library()
query = Chem.MolFromSmarts('[O,N]=C-c:1:c:c:n:c:c:1')
matches = library.GetMatches(query)

Full Changelog: https://github.com/cthoyt/chembl-downloader/compare/v0.1.2...v0.1.3

chembl-downloader - v0.1.2

Published by cthoyt almost 3 years ago

  • Add get_assay_sql() function

Full Changelog: https://github.com/cthoyt/chembl-downloader/compare/v0.1.1...v0.1.2

chembl-downloader -

Published by cthoyt about 3 years ago

Add more top-level imports for download_sdf(), download_sqlite(), and latest()

chembl-downloader -

Published by cthoyt about 3 years ago

  • rename download() to download_extract_sqlite() to make room for other download functions
  • added supplier() function for loading the SDF dump through RDKit
chembl-downloader -

Published by cthoyt about 3 years ago

  • Update pandas backend for query() function
  • Improve CLI
chembl-downloader -

Published by cthoyt about 3 years ago

Add query() function for automatically generating pandas DataFrames from a given SQL query

chembl-downloader -

Published by cthoyt about 3 years ago

  • Fix bug when version not given
  • Fix bug where different chembl versions' different folder structures causes problem
chembl-downloader -

Published by cthoyt about 3 years ago

Initial release has a download(), connect(), and cursor() function.