Loading Events

« All Events

:

Large language models for scientific discovery in molecular property prediction – Aqsa Awan

May 23 @ 2:00 pm - 4:00 pm KST

https://www.ibs.re.kr, 55 Expo-ro Yuseong-gu
Daejeon, Daejeon 34126 Korea, Republic of
+ Google Map

Speaker

Aqsa Awan
KAIST

In this talk, we discuss the paper “Large language models for scientific discovery in molecular property prediction” by Yizhen Zheng et.al., nature machine intelligence, 2025.

Abstract

Large language models (LLMs) are a form of artificial intelligence system encapsulating vast knowledge in the form of natural language. These systems are adept at numerous complex tasks including creative writing, storytelling, translation, question-answering, summarization and computer code generation. Although LLMs have seen initial applications in natural sciences, their potential for driving scientific discovery remains largely unexplored. In this work, we introduce LLM4SD, a framework designed to harness LLMs for driving scientific discovery in molecular property prediction by synthesizing knowledge from literature and inferring knowledge from scientific data. LLMs synthesize knowledge by extracting established information from scientific literature, such as molecular weight being key to predicting solubility. For inference, LLMs identify patterns in molecular data, particularly in Simplified Molecular Input Line Entry System-encoded structures, such as halogen-containing molecules being more likely to cross the blood–brain barrier. This information is presented as interpretable knowledge, enabling the transformation of molecules into feature vectors. By using these features with interpretable models such as random forest, LLM4SD can outperform the current state of the art across a range of benchmark tasks for predicting molecular properties. We foresee it providing interpretable and potentially new insights, aiding scientific discovery in molecular property prediction.

Details

Date:
May 23
Time:
2:00 pm - 4:00 pm KST
Event Category:

Organizer

Jae Kyoung Kim
Email
jaekkim@kaist.ac.kr

Venue

B232 Seminar Room, IBS
55 Expo-ro Yuseong-gu
Daejeon, Daejeon 34126 Korea, Republic of
+ Google Map
View Venue Website
IBS 의생명수학그룹 Biomedical Mathematics Group
기초과학연구원 수리및계산과학연구단 의생명수학그룹
대전 유성구 엑스포로 55 (우) 34126
IBS Biomedical Mathematics Group (BIMAG)
Institute for Basic Science (IBS)
55 Expo-ro Yuseong-gu Daejeon 34126 South Korea
Copyright © IBS 2021. All rights reserved.