Towards Molecule Generation with Heterogeneous States via Reinforcement Learning

Shi, Fangzhou

Access status:

Open Access

Field	Value	Language
dc.contributor.author	Shi, Fangzhou
dc.date.accessioned	2020-05-21
dc.date.available	2020-05-21
dc.date.issued	2020-01-01
dc.identifier.uri	https://hdl.handle.net/2123/22335
dc.description.abstract	De novo molecular design and generation are frequently prescribed in the field of chemistry and biology, for it plays a critical role in maintaining the prosperity of the chemical industry and benefiting the drug discovery. Nowadays, many significant problems in this field are based on the philosophy of designing molecular structures towards specific desired properties. This research is very meaningful in both medical and AI fields, which can benefits novel drug discovery for some diseases. However, It remains a challenging task due to the large size of chemical space. In recent years, reinforcement learning-based methods leverage graphs to represent molecules and generate molecules as a decision making process. However, this vanilla graph representation may neglect the intrinsic context information with molecules and limits the generation performance accordingly. In this paper, we propose to augment the original graph states with the SMILES context vectors. As a result, SMILES representations are easily processed by a simple language model such that the general semantic features of a molecule can be extracted; and the graph representations perform better in handling the topology relationship of each atom. Moreover, we propose a framework that combines supervised learning and reinforcement learning algorithm to take a solid consideration of these two heterogeneous state representations of a molecule, which can fuse the information from both of them and extract more comprehensive features so that more sophisticated decisions can be made by the policy network. Our model also introduces two attention mechanisms, i.e., action-attention, and graph-attention, to further improve the performance. We conduct our experiments on a practical dataset, ZINC, and the experiment results demonstrate that our framework can outperform other baselines in the learning performance of molecule generation and chemical property optimization.	en_AU
dc.rights	The author retains copyright of this thesis. It may only be used for the purposes of research and study. It must not be used for any other purposes and may not be transmitted or shared with others without prior permission.	en_AU
dc.subject	molecule generation	en_AU
dc.subject	graph representation	en_AU
dc.subject	property optimization	en_AU
dc.subject	SMILES representation	en_AU
dc.subject	attention mechanism	en_AU
dc.subject	reinforcement learning	en_AU
dc.title	Towards Molecule Generation with Heterogeneous States via Reinforcement Learning	en_AU
dc.type	Thesis	en_AU
dc.type.thesis	Masters by Research	en_AU
usyd.faculty	Faculty of Engineering, School of Computer Science	en_AU
usyd.degree	Master of Philosophy M.Phil	en_AU
usyd.awardinginst	The University of Sydney	en_AU