Sharing chemical knowledge between human and machine

Research team develops AI tool that translates chemical structures into machine-readable codes

Chemists have long relied on structural formulae to understand the composition and arrangement of chemical compounds. These formulae provide insights into reactions between molecules, synthesis of complex compounds, and potential therapeutic effects of natural substances. However, while these visual representations are intuitive for humans, they’re not easily processed by software. To bridge this gap, a team led by Prof. Christoph Steinbeck and Prof. Achim Zielesny has developed an AI tool called “DECIMER” (Deep Learning for Chemical Image Recognition).

DECIMER transforms images of chemical structural formulae into machine-readable code. This open-source platform automatically identifies and classifies images in scientific articles, converting recognized structural formulae into machine-readable structure codes. For instance, the caffeine molecule’s structural formula becomes the code CN1C=NC2=C1C(=O)N(C(=O)N2C)C. This code can be uploaded into databases and linked with additional molecule information.

The AI tool employs modern AI methods, similar to those used in Large Language Models like ChatGPT. To train DECIMER, the researchers utilized existing machine-readable databases, generating around 450 million structural formulae for training data. Companies and researchers are already using DECIMER to convert structural formulae from patent specifications into databases.

The inspiration for DECIMER arose from the development of AI methods for the ancient Asian board game Go. The chemists were intrigued by the capabilities of AI during the famous Go tournament between human champion Lee Sedol and the AI software “AlphaGo.” Realizing the potential of AI, the researchers aimed to apply these methods to solve complex problems in their field.

The team aspires to make chemical literature from the 1950s onwards machine-readable, preserving knowledge and sharing it with the scientific community. This effort aligns with Prof. Steinbeck’s role as the coordinator of Germany’s National Research Data Infrastructure for Chemistry.

Posted in

Aihub Team

Leave a Comment





AI in Agriculture

AI in Agriculture

The Future of Intelligent Content Management, Semantic AI, and Content Impact

The Future of Intelligent Content Management, Semantic AI, and Content Impact

The Future of Enterprise Content in the Era of AI

The Future of Enterprise Content in the Era of AI

The Art of the Practical - Making AI Real

The Art of the Practical – Making AI Real

AI: Making Data Protection Simpler

AI: Making Data Protection Simpler

Will Generative AI Aid Instead of Replace Workers?

Will Generative AI Aid Instead of Replace Workers?

UK: AI’s Impact on Workplace Safety

UK: AI’s Impact on Workplace Safety

Stay Abreast of Laws Restricting AI in the Workplace

Stay Abreast of Laws Restricting AI in the Workplace

Oracle introduces generative AI capabilities to support HR functions and productivity

Oracle introduces generative AI capabilities to support HR functions and productivity

Discovering hidden talent: How AI-powered talent marketplaces benefit employers

Discovering hidden talent: How AI-powered talent marketplaces benefit employers

Understanding Machine Learning Algorithms

Understanding Machine Learning Algorithms

Understanding Generative Adversarial Networks (GANs)

Understanding Generative Adversarial Networks (GANs)

The Impact of AI on the Job Market and Future of Work

The Impact of AI on the Job Market and Future of Work

The Basics of Artificial Intelligence

The Basics of Artificial Intelligence

Reinforcement Learning: Training AI Agents to Make Decisions

Reinforcement Learning: Training AI Agents to Make Decisions

Natural Language Processing Unleashing the Power of Text

Natural Language Processing Unleashing the Power of Text

How AI is Transforming Industries

How AI is Transforming Industries

Exploring Neural Networks and Deep Learning

Exploring Neural Networks and Deep Learning

Ethical Considerations in Artificial Intelligence

Ethical Considerations in Artificial Intelligence

Computer Vision and Image Recognition in AI

Computer Vision and Image Recognition in AI

ARTIFICIAL INTELLIGENCE IN LOGISTICS

ARTIFICIAL INTELLIGENCE IN LOGISTICS

On Artificial Intelligence - A European approach to excellence and trust

On Artificial Intelligence – A European approach to excellence and trust

AI in Healthcare Advancements and Applications

AI in Healthcare Advancements and Applications

AI in Financial Services: Opportunities and Challenges

AI in Financial Services: Opportunities and Challenges

AI in Customer Service: Improving User Experience

AI in Customer Service: Improving User Experience

AI and Robotics: Synergies and Applications

AI and Robotics: Synergies and Applications

AI and Data Science: Bridging the Gap

AI and Data Science: Bridging the Gap

Top 10 emerging AI and ML uses in data centres

Top 10 emerging AI and ML uses in data centres

Piero Molino, Predibase: On low-code machine learning and LLMs

Piero Molino, Predibase: On low-code machine learning and LLMs

OpenAI’s first global office will be in London

OpenAI’s first global office will be in London