Skip to main content

Working with CSV files

Calculating molecular data from uploading a CSV file

Format of uploaded CSV

The CSV to be uploaded should contain one column of SMILES.

Example of a CSV in table format

C
O=C=O
CCO
C1CCCCC1

Calculating molecular data from a CSV file is done by first selecting the descriptors one wants to calculate, then clicking the "Upload CSV" button. This will automatically calculate the contents of the CSV provided it is in the correct format.

Downloading the results

After providing the input field with one or many smiles, the user can select the descriptors to be calculated. Pressing the "Download CSV" button will download a CSV file containing the SMILES and the selected descriptors.

When the user inputs the following molecules and select the checkboxes for Number of Atoms, Molecular Weight, Polar Surface Area, and clogP, the system will produce the following CSV:

Input: CCN(C)C(=O)C, c1ccccc1Cl, C1COCCO1

SMILESNumber of C atomsNumber of Cl atomsNumber of H atomsNumber of N atomsNumber of O atomsNumber of atoms totalMolecularWeightPSAclogP
CCN(C)C(=O)C50111118101.08406397220.310.4846
c1ccccc1Cl6150012112.007977840.02.34
C1COCCO1408021488.05242949618.460.0332

The name of the downloaded CSV will contain the name of the current RDKit version: data_RDKit_[current-version].csv

Error handling

By selecting the "Exclude Invalid SMILES from result" option, any invalid SMILES will be omitted from the CSV output. If this option is not selected, a new column named "Error" will be added to the CSV to highlight the invalid SMILES entries.