Skip to main content
Login | Suomeksi | På svenska | In English

Browsing by Author "Prasad, Ayush"

Sort by: Order: Results:

  • Prasad, Ayush (2024)
    Machine learning is increasingly being applied to model molecular data in various scientific fields such as drug discovery, materials science, and atmospheric science. However, the high dimensionality that molecular features present causes challenges when applying machine learning algorithms directly. Dimensionality reduction methods can help reduce the feature space and create new in- formative features. In this thesis, we first review current methods for representing molecules for machine learning. We then discuss the importance of evaluating dimensionality reduction visualizations; and review and propose metrics for it. We present Gradient Boosting Mapping (GBMAP), a supervised dimensionality reduction method. Through experiments on benchmark datasets and the GeckoQ molecular dataset, we demonstrate that low-dimensional embeddings created by GBMAP can be used as features to improve the performance of simpler interpretable machine learning models significantly.