Towards Molecule Generation with Heterogeneous States via Reinforcement Learning
Field | Value | Language |
dc.contributor.author | Shi, Fangzhou | |
dc.date.accessioned | 2020-05-21 | |
dc.date.available | 2020-05-21 | |
dc.date.issued | 2020-01-01 | |
dc.identifier.uri | https://hdl.handle.net/2123/22335 | |
dc.description.abstract | De novo molecular design and generation are frequently prescribed in the field of chemistry and biology, for it plays a critical role in maintaining the prosperity of the chemical industry and benefiting the drug discovery. Nowadays, many significant problems in this field are based on the philosophy of designing molecular structures towards specific desired properties. This research is very meaningful in both medical and AI fields, which can benefits novel drug discovery for some diseases. However, It remains a challenging task due to the large size of chemical space. In recent years, reinforcement learning-based methods leverage graphs to represent molecules and generate molecules as a decision making process. However, this vanilla graph representation may neglect the intrinsic context information with molecules and limits the generation performance accordingly. In this paper, we propose to augment the original graph states with the SMILES context vectors. As a result, SMILES representations are easily processed by a simple language model such that the general semantic features of a molecule can be extracted; and the graph representations perform better in handling the topology relationship of each atom. Moreover, we propose a framework that combines supervised learning and reinforcement learning algorithm to take a solid consideration of these two heterogeneous state representations of a molecule, which can fuse the information from both of them and extract more comprehensive features so that more sophisticated decisions can be made by the policy network. Our model also introduces two attention mechanisms, i.e., action-attention, and graph-attention, to further improve the performance. We conduct our experiments on a practical dataset, ZINC, and the experiment results demonstrate that our framework can outperform other baselines in the learning performance of molecule generation and chemical property optimization. | en_AU |
dc.rights | The author retains copyright of this thesis. It may only be used for the purposes of research and study. It must not be used for any other purposes and may not be transmitted or shared with others without prior permission. | en_AU |
dc.subject | molecule generation | en_AU |
dc.subject | graph representation | en_AU |
dc.subject | property optimization | en_AU |
dc.subject | SMILES representation | en_AU |
dc.subject | attention mechanism | en_AU |
dc.subject | reinforcement learning | en_AU |
dc.title | Towards Molecule Generation with Heterogeneous States via Reinforcement Learning | en_AU |
dc.type | Thesis | en_AU |
dc.type.thesis | Masters by Research | en_AU |
usyd.faculty | Faculty of Engineering, School of Computer Science | en_AU |
usyd.degree | Master of Philosophy M.Phil | en_AU |
usyd.awardinginst | The University of Sydney | en_AU |
Associated file/s
Associated collections