ChemAICluster

A Clustering and Visualization Tool for Molecular Data

Introduction

ChemAICluster is a tool designed to perform clustering on molecular data based on SMILES strings. This tool uses various clustering algorithms like k-means and hierarchical clustering to group molecules based on their similarity. It also provides visualizations such as cluster plots and dendrograms.

How to Use

  1. Upload File: Prepare your dataset in CSV format. The CSV file should have at least two columns: an ID column and a SMILES string column.
  2. Select Clustering Algorithm: Choose between k-means and hierarchical clustering. You will also have the option to specify the number of clusters.
  3. Submit: Click the 'Submit' button to perform clustering. The app will process the data and return the cluster assignments, which can be downloaded as a CSV file.
  4. Visualization: After clustering, you will be able to view the 2D cluster plot (PCA) and dendrogram (if hierarchical clustering is selected).

Features

Output

Once the clustering is complete, you will be able to download the following:

References

Contact

If you have any questions or need support, feel free to contact us at [email protected].