question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Difference in InChIKey values, from SMILES, when comparing to RDKit

See original GitHub issue

If I run the following with RDKit (2021.09.2):

from rdkit import Chem    

m1 = Chem.MolFromSmiles("N1C[C@@]2(CC[C@H](C)CC2)CC1")  
print(Chem.MolToInchiKey(m1))    

m2 = Chem.MolFromSmiles("N1C[C@]2(CC[C@H](C)CC2)CC1")  
print(Chem.MolToInchiKey(m2)) 

I obtain:

SGJOFPWLAFWQEP-AOOOYVTPSA-N
SGJOFPWLAFWQEP-MGCOHNPYSA-N

If I do something similar with OpenFF’s to_inchikey() method (openff-toolkit 0.10.1)

from openff.toolkit.topology import Molecule    

m1 = Molecule.from_smiles("N1C[C@@]2(CC[C@H](C)CC2)CC1")  
print(m1.to_inchikey())      
 
m2 = Molecule.from_smiles("N1C[C@]2(CC[C@H](C)CC2)CC1")   
print(m2.to_inchikey()) 

I obtain:

SGJOFPWLAFWQEP-UHFFFAOYSA-N
SGJOFPWLAFWQEP-UHFFFAOYSA-N

Any ideas why? I am missing something?

Many thanks,

Mark

Issue Analytics

  • State:open
  • Created 2 years ago
  • Comments:5 (4 by maintainers)

github_iconTop GitHub Comments

1reaction
j-wagscommented, Dec 8, 2021

Per the discussion above, it looks like a new feature in RDKit CAN handle this.

I’ll start poking around in OpenEye as well. One hint is the yellow warning in the Omega docs that ring stereo may not be respected.

0reactions
j-wagscommented, Nov 24, 2021

I’ve started a discussion on the RDKit board here: https://github.com/rdkit/rdkit/discussions/4731

Read more comments on GitHub >

github_iconTop Results From Across the Web

RDKit Cookbook — The RDKit 2022.09.1 documentation
A simpler way to add atom indices is to adjust the IPythonConsole properties. This produces a similar image to the example above, the...
Read more >
[Rdkit-discuss] comparing two or more tables of molecules
Unique Smiles or InChI String comparisons don't appear to work presumable because there are different but equivalent structures, e.g. explicit vs non-explicit ...
Read more >
My RDKit Cheatsheet - Xinhao Li
RDKit molecule enable several features to handle molecules: drawing, computing fingerprints/properties, molecular curation etc. smiles = 'COC(=O) ...
Read more >
organic chemistry - Pubchem, InChI, SMILES, and uniqueness
The different protonation states of the same compound will have InChIKeys differing only by the protonation indicator (unless both states have a ...
Read more >
An open source chemical structure curation pipeline using RDKit
In order to maintain the quality of the final database and to easily compare and integrate data on the same compound from different...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found