molli: A General-Purpose Python Toolkit for Combinatorial Small Molecule Library Generation, Manipulation, and Feature Extraction.

The construction, management and analysis of large in silico molecular libraries is critical in many areas of modern chemistry. Herein, we introduce the MOLecular LIibrary toolkit, “molli”, which is a Python 3 cheminformatics module that provides a streamlined interface for manipulating large in sil...

Full description

Bibliographic Details
Main Authors: Shved, Alexander S., Ocampo, Blake E., Burlova, Elena S., Olen, Casey L., Rinehart, N. Ian, Denmark, Scott E.
Format: Other/Unknown Material
Language:unknown
Published: American Chemical Society (ACS) 2024
Subjects:
Online Access:http://dx.doi.org/10.26434/chemrxiv-2024-1cl1f-v2
https://chemrxiv.org/engage/api-gateway/chemrxiv/assets/orp/resource/item/66b687e8c9c6a5c07aabd1a2/original/molli-a-general-purpose-python-toolkit-for-combinatorial-small-molecule-library-generation-manipulation-and-feature-extraction.pdf
id cracsoc:10.26434/chemrxiv-2024-1cl1f-v2
record_format openpolar
spelling cracsoc:10.26434/chemrxiv-2024-1cl1f-v2 2024-09-15T18:28:56+00:00 molli: A General-Purpose Python Toolkit for Combinatorial Small Molecule Library Generation, Manipulation, and Feature Extraction. Shved, Alexander S. Ocampo, Blake E. Burlova, Elena S. Olen, Casey L. Rinehart, N. Ian Denmark, Scott E. 2024 http://dx.doi.org/10.26434/chemrxiv-2024-1cl1f-v2 https://chemrxiv.org/engage/api-gateway/chemrxiv/assets/orp/resource/item/66b687e8c9c6a5c07aabd1a2/original/molli-a-general-purpose-python-toolkit-for-combinatorial-small-molecule-library-generation-manipulation-and-feature-extraction.pdf unknown American Chemical Society (ACS) https://creativecommons.org/licenses/by/4.0/ posted-content 2024 cracsoc https://doi.org/10.26434/chemrxiv-2024-1cl1f-v2 2024-08-15T04:08:21Z The construction, management and analysis of large in silico molecular libraries is critical in many areas of modern chemistry. Herein, we introduce the MOLecular LIibrary toolkit, “molli”, which is a Python 3 cheminformatics module that provides a streamlined interface for manipulating large in silico libraries. Three-dimensional, combinatorial molecule libraries can be expanded directly from two-dimensional chemical structure fragments stored in CDXML files with high stereochemical fidelity. Geometry optimization, property calculation, and conformer generation are executed by interfacing with widely used computational chemistry programs such as OpenBabel, RDKit, ORCA, NWChem, and xTB/CREST. Conformer-dependent grid-based feature calculators provide numerical representation, and interface to robust three-dimensional visualization tools that provide comprehensive images to enhance human understanding of libraries with thousands of members. The package includes a command-line interface in addition to Python classes to streamline frequently used workflows. Parallel performance is benchmarked on various hardware platforms, and common workflows are demonstrated for different tasks ranging from optimized grid-based descriptor calculation on catalyst libraries to an NMR chemical shift prediction workflow from CDXML files. Other/Unknown Material Orca ACS Publications
institution Open Polar
collection ACS Publications
op_collection_id cracsoc
language unknown
description The construction, management and analysis of large in silico molecular libraries is critical in many areas of modern chemistry. Herein, we introduce the MOLecular LIibrary toolkit, “molli”, which is a Python 3 cheminformatics module that provides a streamlined interface for manipulating large in silico libraries. Three-dimensional, combinatorial molecule libraries can be expanded directly from two-dimensional chemical structure fragments stored in CDXML files with high stereochemical fidelity. Geometry optimization, property calculation, and conformer generation are executed by interfacing with widely used computational chemistry programs such as OpenBabel, RDKit, ORCA, NWChem, and xTB/CREST. Conformer-dependent grid-based feature calculators provide numerical representation, and interface to robust three-dimensional visualization tools that provide comprehensive images to enhance human understanding of libraries with thousands of members. The package includes a command-line interface in addition to Python classes to streamline frequently used workflows. Parallel performance is benchmarked on various hardware platforms, and common workflows are demonstrated for different tasks ranging from optimized grid-based descriptor calculation on catalyst libraries to an NMR chemical shift prediction workflow from CDXML files.
format Other/Unknown Material
author Shved, Alexander S.
Ocampo, Blake E.
Burlova, Elena S.
Olen, Casey L.
Rinehart, N. Ian
Denmark, Scott E.
spellingShingle Shved, Alexander S.
Ocampo, Blake E.
Burlova, Elena S.
Olen, Casey L.
Rinehart, N. Ian
Denmark, Scott E.
molli: A General-Purpose Python Toolkit for Combinatorial Small Molecule Library Generation, Manipulation, and Feature Extraction.
author_facet Shved, Alexander S.
Ocampo, Blake E.
Burlova, Elena S.
Olen, Casey L.
Rinehart, N. Ian
Denmark, Scott E.
author_sort Shved, Alexander S.
title molli: A General-Purpose Python Toolkit for Combinatorial Small Molecule Library Generation, Manipulation, and Feature Extraction.
title_short molli: A General-Purpose Python Toolkit for Combinatorial Small Molecule Library Generation, Manipulation, and Feature Extraction.
title_full molli: A General-Purpose Python Toolkit for Combinatorial Small Molecule Library Generation, Manipulation, and Feature Extraction.
title_fullStr molli: A General-Purpose Python Toolkit for Combinatorial Small Molecule Library Generation, Manipulation, and Feature Extraction.
title_full_unstemmed molli: A General-Purpose Python Toolkit for Combinatorial Small Molecule Library Generation, Manipulation, and Feature Extraction.
title_sort molli: a general-purpose python toolkit for combinatorial small molecule library generation, manipulation, and feature extraction.
publisher American Chemical Society (ACS)
publishDate 2024
url http://dx.doi.org/10.26434/chemrxiv-2024-1cl1f-v2
https://chemrxiv.org/engage/api-gateway/chemrxiv/assets/orp/resource/item/66b687e8c9c6a5c07aabd1a2/original/molli-a-general-purpose-python-toolkit-for-combinatorial-small-molecule-library-generation-manipulation-and-feature-extraction.pdf
genre Orca
genre_facet Orca
op_rights https://creativecommons.org/licenses/by/4.0/
op_doi https://doi.org/10.26434/chemrxiv-2024-1cl1f-v2
_version_ 1810470365675126784