MolMetaLM: a Physicochemical Knowledge-Guided Molecular Meta Language Model
November 23, 2024 Β· Declared Dead Β· π arXiv.org
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Yifan Wu, Min Zeng, Yang Li, Yang Zhang, Min Li
arXiv ID
2411.15500
Category
cs.ET: Emerging Technologies
Cross-listed
cs.CL
Citations
3
Venue
arXiv.org
Last Checked
3 months ago
Abstract
Most current molecular language models transfer the masked language model or image-text generation model from natural language processing to molecular field. However, molecules are not solely characterized by atom/bond symbols; they encapsulate important physical/chemical properties. Moreover, normal language models bring grammar rules that are irrelevant for understanding molecules. In this study, we propose a novel physicochemical knowledge-guided molecular meta language framework MolMetaLM. We design a molecule-specialized meta language paradigm, formatted as multiple <S,P,O> (subject, predicate, object) knowledge triples sharing the same S (i.e., molecule) to enhance learning the semantic relationships between physicochemical knowledge and molecules. By introducing different molecular knowledge and noises, the meta language paradigm generates tens of thousands of pretraining tasks. By recovering the token/sequence/order-level noises, MolMetaLM exhibits proficiency in large-scale benchmark evaluations involving property prediction, molecule generation, conformation inference, and molecular optimization. Through MolMetaLM, we offer a new insight for designing language models.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Emerging Technologies
π
π
The Cartographer
R.I.P.
π»
Ghosted
In-memory hyperdimensional computing
R.I.P.
π»
Ghosted
Magnetic skyrmion-based synaptic devices
R.I.P.
π»
Ghosted
DNA-Based Storage: Trends and Methods
π
π
The Cartographer
Neuro-memristive Circuits for Edge Computing: A review
R.I.P.
π»
Ghosted
4K-Memristor Analog-Grade Passive Crossbar Circuit
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted