| | | This software calculates a large number of molecular structure descriptors from the connection table of the submitted structure. The descriptors calculated fall into several categories. The outline below gives a listing of all the descriptors calculated by the current version. The list is organized by index sub-type. Additional information is available by clicking on the topic of interest. | | | | SIR Index Definition Categories | # | 1.0 Molecule Processing Information | | The first 8 columns of output from give information about the structure file that has been submitted for each compound. This information includes a sequential ID from processing. Several processing flags are also set by Molconn. These are different from the Molconn error messages that are discussed in section 5. Processing flags indicate that some change was made in the submitted structure before the indices were calculated and the flags serve to notify the user of the change. | | | | Molecule Processing Information Indices | | Index Name | Index Definition | | | | id | Sequential processing number supplied by Molconn | | | DiscAtom | An O- atom has been converted to OH because "Neutralize Structures" has been selected. | | | MultMole | An O- atom has been converted to OH because "Neutralize Structures" has been selected. | | | AromFlag | A ring encoded aromatic by lowercase smiles or by "4" bonds in a Mol file has been changes to non-aromatic because it does not comply with the simple Huckle rule. | | | NHp1Flag | An >NH+ atom has been converted to >N because "Neutralize Structures" has been selected. | | | NHp2Flag | An >NH2+ atom has been converted to >NH because "Neutralize Structures" has been selected. | | | NHp3FLAG | An NH3+ atom has been converted to NH2 because "Neutralize Structures" has been selected. | | | ONegFLAG | An O- atom has been converted to OH because "Neutralize Structures" has been selected. | | # | The Structure Information Representation (SIR) SUM indices may be used in conjunction with the formula weight (fw) to identify duplicates in a data set. The proceedure for identifying duplicates is given at the bottom of this section. | | Index Name | Index Definition | | | | SUMEST | Sum of E-State indices for the molecule | | | SUMCHI | Sum of Chi indices for the molecule | | | SUMKAPPA | Sum of Kappa indices for the molecule | | | Process for Identifying Duplicate Structures | | 1. | Process a set of structures to generate SUM descriptors. | | | **Note: process using "Neutralize" to have the ionized or protonated rendering be identified as a duplicate of the neutral rendering. | | 2. | Round fw, SUMEST, SUMCHI and SUMKAPPA to three decimal places | | | The output are single percision real numbers, and values for the SUM indices need to be rounded before diff evaluation as there may be non-significant differences beyond the fourth decimal place. | | 3. | Sort the data in ascending order on "fw" then "SUMEST" then "SUMCHI" | | Duplicate compounds will have the same value (to three decimal places) for all four descriptors. Compounds that have the same value for fw and SUMEST, but slightly different values for SUMCHI may be isomers. | # | 5.3 Elementary Structure Information | | Some elementary structure information indices are calculated for each molecule that is processed. This information includes the formula weight and counts of features such as rings, rotatable bonds and elements. | | List of Elementary Structure Information Indices | | Index Name | Index Definition | | | | | | nvx | Number of graph vertices (count of non-hydrogen atoms) | | | nelem | Number of different non-hydrogen elements | | | nrbond | Count of rotatable bonds | | | nrings | Count of individual rings (naphthalene has 2 rings) | | | ncirc | Count of ring circuits (naphthalene has 3 circuits) | | | numHBd | Count of hydrogen bond donating atoms | | | numwHBd | Count of weak hydrogen bond donating atoms ( CH X) | | | numHBa | Count of hydrogen bond accepting atoms | | | nasH | Count of hydrogen atoms | | | nasB | Count of boron atoms | | | nasC | Count of carbon atoms | | | nasN | Count of nitrogen atoms | | | nasO | Count of oxygen atoms | | | nasF | Count of fluorine atoms | | | nasSi | Count of silicon atoms | | | nasP | Count of phosphorous atoms | | | nasS | Count of sulfur atoms | | | nasCl | Count of chlorine atoms | | | nasBr | Count of bromine atoms | | | nasI | Count of iodine atoms | | | Nalk | Count of alkali and alkaline earth metal atoms | | | Noth | Count of atoms not in the above element list | | # | 5.4 Electrotopological State Indices | | The Electrotopological State or E-State structure indices are Structure-Information representation descriptors that encode the electron accessibility of atoms in a molecule. This encodes both the electron distribution and local topology at each atom in atom level and bond level indices. The E-State is based on an intrinsic state for each atom that is modified by all other atoms in the molecule. The atom and bond level indices are grouped in various ways to form atom-type, hydrogen atom-type, functional-group type, group-type, internal hydrogen bonding and bond type indices. Additional information about each type of index, including a comprehensive list of available indices, may be viewed by clicking on the corresponding link in the outline below. | | | | List of Electrotopological State Indices | | Information about the Bond-Type E-State indiecs is given in section 5.6. | | 5.4.1 Atom Level E-State and HE-State Indices | | The Atom-Level E-State, or "S", quantifies the build up or depletion of valence electrons modified by the steric accessibility as the electron accessibility at the atom and is calculated for each atom in the molecule. In this form, the E-State may be used as an atom-level descriptor. The formalism is extended to hydrogen atoms with the hydrogen atom level index "HS". The HS index is used most often with hydrogen atoms that have been explicitly enumerated in the structures. | | Atom Level E-State and HE-State Indices | | | | S1 | Atom Level E-State for atom 1 | | | HS1 | Hydrogen Atom Level HE-State for hydrogen atom 1 | | | | This atom level description can be used to compare the electron accessibility of topologically equivalent positions within structures with a common scaffold. The example below shows two penicillin analogues. The atom level E-State values for atoms 1-16 may be used as structure descriptors because these atom positions are topologically equivalent in both structures. The atom 5 position can be used despite the fact that structure B has no atom at the 5 position. For structure B, s5 will be set to s5 = 0. The 17 and 18 positions cannot be used because of the ambiguity of the numbering at these positions (more than one position in the structure could be numbered 17). | | | | 5.4.2 Global E-State Indices | | Global E-State indices have a value for every, or nearly every, molecule processed and tend to be based on whole molecule calculations. For this reason, global descriptors are often useful in providing broad classification information such as lipophilicity and polarity. Global E-State descriptors tend to focus on the electronic characteristics of a compound rather than size, shape and branching attributes which are encoded in the molecular connectivity indices. | | | | List of Global E-State Indices | | Index | Index Definition | Information Encoded | | | Hmax | Highest Atom Level HE-State | Site of the most polar hydrogen atom | | | Gmax | Highest Atom Level E-State | Possible site of electrophilic attack | | | Hmin | Lowest Atom Level HE-State | Site of the least polar hydrogen atom | | | Gmin | Lowest Atom Level HE-State | Possible site of nucleophilic attack | | | sumI | Sum of intrinsic state values | Heteroatom content | | | sumDELI | Sum of perterbation terms | Measure of non-hydrocarbon character | | | tets2 | Total topological E-State index | High discrimination uniqueness number | | | Qv | E-state polarity index | Whole Molecule Polarity | | | EPSA | E-State Polar Surface Area | E-State version of the Ertl descriptor | | | TPSA | Topological Polar Surface Area | Topological area of specific polar atoms in the molecule as defined and calculated by the Ertl method [1] | | | sumHet | Sum of heteroatom E-State | Sum of the total atom level E-State associated with heteroatoms. | | | sumCar | Sum of carbon atom E-State | Sum of the total atom level E-State associated with carbon atoms. | | | rvalHet | Ratio of heteroatom E-State | Ratio of the total atom level E-State associated with heteroatoms to the total atom level E-State for moleclue | | | rvalCar | Ratio of carbon atom E-State | Ratio of the total atom level E-State associated with carbon atoms to the total atom level E-State for moleclue | | | SumBI | Sum of bond intrinsic states | Bond I-values calculated by the standard mean of atom I-values | | | SumBIG | Sum of bond intrinsic states (G) | Bond I-values calculated by the geometric mean of atom I-values | | | SmDelBI | Sum of the absolute value of bond perturbations | Bond I-values calculated by the standard mean of atom I-values | | | SmDelBIG | Sum of the absolute value of bond perturbations (G) | Bond I-values calculated by the geometric mean of atom I-values | | References: 1. P. Ertl, B. Rohde, P. Selzer "Fast Calculation of Molecular Polar Surface Area as a Sum of Fragment-based Contributions and Its Application to the Prediction of Drug Transport Properties" J.Med.Chem.(2000), 43, 371 4- 3717. | # | 5.4.3 Atom-Type and Hydrogen Atom-Type E-State Indices | | The Atom-Type E-State indices are an extension of the Atom-Level E-State, similar to group additive schemes, in which an index value appears for each atom type in the molecule. The atom-type indices are based on the valence state (SssO is all sp3 divalent oxygen atoms) of the atom being described, irrespective of the functional groups that the atom may be associated with. The "Atom-Level" E-State discissed in section 5.4.1 quantifies the build up or depletion of valence electrons modified by the steric accessibility as the electron accessibility at the atom. The "Atom-Type" E-State qualifies the kinds of interactions that the specified electron accessibility may participate in. Because atoms of a given type may be expected to engage in similar interactions across a heterogenous group of structures, they provide the basis for application to a wide range of problems in which the E-State formalism is used without the need for atom-level superposition. | | | | List of Atom-Type E-State Indices | | Index Name | The sum of the atom level E-State values for all atoms in the molecule of the specified type | | | | SsCH3 | CH3 carbon atoms | | | SssCH2 | CH2 carbon atoms | | | SsssCH | >CH carbon atoms | | | SssssC | >C<</font> carbon atoms | | | | | SdsCH | =CH carbon atoms | | | SdssC | =C<</font> carbon atoms | | | | | StCH | CH carbon atoms | | | StsC | C carbon atoms | | | SaaCH | carbon atoms | | | SaasC | carbon atoms | | | SaaaC | carbon atoms | | | SsNH3p | NH3+ nitrogen atoms ("Neutralize Structures" is off) | | | SsNH2 | NH2 nitrogen atoms | | | SssNH2p | >NH2+ nitrogen atoms ("Neutralize Structures" is off) | | | SssNH | NH nitrogen atoms | | | SsssNHp | >NH+ nitrogen atoms ("Neutralize" is off) | | | SsssN | >N nitrogen atoms | | | | | SdsN | =N nitrogen atoms | | | SaaNH | nitrogen atoms (pyrole nitrogen) | | | SaaN | nitrogen atoms | | | SaasN | nitrogen atoms (substituted pyrole) | | | SssssNp | >N+<</font> nitrogen atoms | | | SdssNp | =N+<</font> nitrogen atoms | | | SaasNp | nitrogen atoms | | | SddsN | nitrogen atoms | | | StdN | N= nitrogen atoms (azide) | | | SdssN | =N<</font> nitrogen atoms (non-protonated N-oxide) | | | SsOH | OH oxygen atoms | | | SsOm | O- oxygen atoms ("Neutralize Structures" is off) | | | SssO | >O oxygen atoms (divalent sp3) | | | SdO | =O oxygen atoms (valence sp2) | | | SaaO | oxygen atoms (furan, oxazole, etc.) | | | SsF | F flourine atoms | | | SsCl | Cl chlorine atoms | | | SsBr | Br bromine atoms | | | SsI | I iodine atoms | | | SsSH | SH sulfur atoms | | | SssS | S sulfur atoms | | | | | SdssS | =S<</font> sulfur atoms | | | SdsSp | =S+ sulfur atoms (methylene blue etc.) | | | SaaS | sulfur atoms | | | SssssssS | sulfur atoms | | | SddssS | sulfur atoms | | | SsssP | P<</font> phosphorous atoms | | | SdsssP | phosphorous atoms | | | SsssssP | phosphorous atoms | | | List of Hydrogen Atom-Type E-State Indices | | Index Name | The sum of the hydrogen atom level HE-State values for all hydrogen atoms in the molecule of the specified type | | | SHvin | hydrogen atoms on vinyl carbon groups | | | SHtvin | hydrogen atoms on terminal vinyl saturated carbon groups | | | SHavin | hydrogen atoms on saturated vinyl group on an aromatic carbon | | | SHtCH | CH hydrogen atoms | | | SHsOH | OH hydrogen atoms | | | SHsNH2 | NH2 hydrogen atoms | | | SHssNH | >NH hydrogen atoms | | | | | SHsSH | SH hydrogen atoms | | # | 5.4.4 Functional-Group Type E-State Indices | | The Atom-Type E-State indices are an extension of the Atom-Level E-State, similar to group additive schemes, in which an index value appears for each atom type in the molecule. Some atom types are based on valence state (SssO is all divalent oxygen atoms) and some are based on organic functional groups (SEther is all ether oxygen atoms). The "Atom-Level" E-State discissed in section 5.3.1 quantifies the build up or depletion of valence electrons modified by the steric accessibility as the electron accessibility at the atom. The "Atom-Type" E-State qualifies the kinds of interactions that the specified electron accessibility may participate in. Because these atom types and functional groups are expected to engage in similar interactions across a heterogenous group of structures, they provide the basis for application to a wide range of problems in which the E-State formalism is used without the need for superposition. | | | | List of Functional-Group Type E-State Indices | | Index Name | The sum of the atom level E-State values for all atoms in the molecule of the specified type | | | | Ssp3NH2 | NH2 nitrogen bonded to an sp3 carbon atom | | | Ssp2NH2 | NH2 nitrogen bonded to an sp2 carbon atom | | | Ssp3NH | NH nitrogen bonded to 2 sp3 carbon atoms | | | Ssp2NH | NH nitrogen bonded to 2 sp2 carbon atoms | | | Ssp3N | N<</font> nitrogen bonded to 3 sp3 carbon atoms | | | Ssp2N | N<</font> nitrogen bonded to 3 sp2 carbon atoms | | | SAzide | N=N N azide nitrogen atoms (other rendering N=N+=N-) | | | SAzirid2 | NH nitrogen atoms in a 3-membered ring | | | SAzirid3 | N<</font> nitrogen atoms in a 3-membered ring | | | SGuano | sum of =NH nitrogen and =C carbon in guanodine groups | | | SBenzam | sum of =NH nitrogen and =C carbon in benzamadine groups | | | Index Name | The sum of the atom level E-State values for all atoms in the molecule of the specified type | | | SKetone | =O oxygen atoms in ketone groups | | | SEther | O oxygen atoms in ether groups | | | SOxirane | O oxygen atoms in oxirane groups (3-membered ring) | | | SOxetane | O oxygen atoms in oxetane groups (4-membered ring) | | | SEster | O oxygen and carbonyl carbon atoms in ester groups | | | Ssp3OH | OH oxygen atoms bonded to sp3 carbon | | | Ssp2OH | OH oxygen atoms bonded to sp2 carbon | | | SAnhyde | =O oxygen and =C<</font> carbon atoms in acid anhydride groups | | | | Oxygen-Nitrogen Functional Groups | | | Index Name | The sum of the atom level E-State values for all atoms in the molecule of the specified type | | | SNoxide | N-oxide nitrogen and dative bond oxygen O | | | SPNoxide | pyridine N-oxide nitrogen and dative bond oxygen O | | | SNitro | nitro group nitrogen atom and oxygen atoms | | | SNitrate | nitrate group O oxygen atoms | | | SOxime | nitrogen and oxygen atoms in oxime groups R C=N OH | | | SNdO | nitrogen and oxygen in terminal N-oxide groups R C N=O | | | SCarbacd | nitrogen and OH oxygen atoms in carbamaic acid groups | | | SCarbam | nitrogen and >O oxygen atoms in carbamate groups | | | Index Name | The sum of the atom level E-State values for all atoms in the molecule of the specified type | | | SCar | carbonyl carbon atoms in carboxylic acid groups | | | SSul | sulfur and =O oxygen atoms in sulfuric acid groups | | | SPhe | carbon atoms adjacent to the OH in phenol groups | | | SGP4OH | >P= phosphorous atoms in phosphoric acid groups (1 OH) | | | SGP3OH | >P= phosphorous atoms in phosphorous acid groups (1 OH) | | | SGP4OH2 | phosphorous atoms in phosphoric acid groups (2 OH) | | | SGP3OH2 | phosphorous atoms in phosphorous acid groups (2 OH) | | | | Aniline Functional Groups | | | Index Name | The sum of the atom level E-State values for all atoms in the molecule of the specified type | | | SArNHAr | NH secondary di-aniline nitrogen atoms (2 alpha aromatics) | | | SArNAr | >N tertiary di-aniline nitrogen atoms (2 alpha aromatics) | | | SArNH3p | NH3+ primary aniline nitrogen atoms ("Neutralize" off) | | | SArNH2 | NH2 primary aniline nitrogen atoms | | | SArNH2p | >NH2+ secondary aniline nitrogen atoms ("Neutralize" off) | | | SArNH | >NH secondary aniline nitrogen atoms | | | SArNHp | >NH+ tertiary aniline nitrogen atoms ("Neutralize" off) | | | SArN | >N tertiary aniline nitrogen atoms | | | SArNp | >N+<</font> quaternary aniline nitrogen atoms | | | | Urea, Thiourea, Amide, Thioamide and Sulfonamide | | | Index Name | The sum of the atom level E-State values for all atoms in the molecule of the specified type | | | SUrea22 | NH (C=O) NH urea nitrogen atoms (2°, 2°) | | | SUrea23 | NH (C=O) N<</font> urea nitrogen atoms (2°, 3°) | | | SUrea33 | >N (C=O) N<</font> urea nitrogen atoms (3°, 3°) | | | StUrea2 | NH (C=O) NH2 terminal urea nitrogen atoms (2°, 1°) | | | StUrea3 | >N (C=O) NH2 terminal urea nitrogen atoms (3°, 1°) | | | SSUrea22 | NH (C=S) NH thiourea nitrogen atoms (2°, 2°) | | | SSUrea23 | NH (C=S) N<</font> thiourea nitrogen atoms (2°, 3°) | | | SSUrea33 | >N (C=S) N<</font> thiourea nitrogen atoms (3°, 3°) | | | StSUrea2 | NH (C=S) NH2 terminal thiourea nitrogen atoms (2°, 1°) | | | StSUrea2 | >N (C=S) NH2 terminal thiourea nitrogen atoms (3°, 1°) | | | SAmide1 | R (C=O) NH2 primary amide nitrogen atoms | | | SAmide2 | R (C=O) NH secondary amide nitrogen atoms | | | SAmide3 | R (C=O) N<</font> tertiary amide nitrogen atoms | | | SSAmide1 | R (C=S) NH2 primary thioamide nitrogen atoms | | | SSAmide2 | R (C=S) NH secondary thioamide nitrogen atoms | | | SSAmide3 | R (C=S) N<</font> tertiary thioamide nitrogen atoms | | | SSOAmid1 | R (O=S=O) NH2 primary sulfonamide nitrogen atoms | | | SSOAmid2 | R (O=S=O) NH secondary sulfonamide nitrogen atoms | | | SSOAmid3 | R (O=S=O) N<</font> tertiary sulfonamide nitrogen atoms | | | SN1SO2N2 | NH (O=S=O) NH2 di-sulfonamide nitrogen atoms (1°, 2°) | | | SN1SO2N3 | >N (O=S=O) NH2 di-sulfonamide nitrogen atoms (1°, 3°) | | | SN2SO2N2 | NH (O=S=O) NH di-sulfonamide nitrogen atoms (2°, 2°) | | | SN2SO2N3 | >N (O=S=O) NH di-sulfonamide nitrogen atoms (2°, 3°) | | | SN3SO2N3 | >N (O=S=O) NH<</font> di-sulfonamide nitrogen atoms (3°, 3°) | | # | 5.4.5 Group-Type and Hydrogen Group-Type E-State Indices | | Group types allow for combining the contributions of related atom types or functional groups into a single broad-scope parameter, allowing structure features to appear in a model in more than one form to reflect multiple modes of interaction. The use of Group-Types allows for information about under-represented structure features to be included in a model in cases where low population level might otherwise result in the feature being overlooked by the feature selection algorithm. Group-Types can be especially useful when modeling small datasets where the total number of input parameters that can be used is limited by statistical concerns. | | | | List of Group-Type E-State Indices | | Index Name | The sum of the atom level E-State values for all atoms in the molecule that are part of the indicated groups | | | SHBa | all hydrogen bond accepting atoms | | | SCarom | all aromatic carbon atoms | | | ScnjTCH | all conjugated terminal carbon atoms (=CH2 and CH) | | | SotArom | all heteroaromatic atoms, excluding pyrrole | | | SotAromH | all heteroaromatic atoms | | | SANoxide | all N-oxide nitrogen and oxygen atoms | | | SsFCl | all F flourine and Cl chlorine atoms | | | SsBrI | all Br bromine and I iodine atoms | | | SallNp | quaternary amine, pyridinium and animinum nitrogen atoms | | | SallAzir | SAzirid2 + SAzirid3 (2° and 3° aziridine nitrogen atoms) | | | SallOxan | SOxirane + SOxetane (sp3 oxygen in a 3 or 4 membered ring) | | | Ssp3NOfr | sp3 nitrogen or oxygen free radical system (SallAzir + SallOxan) | | | sumUrea | all nitrogen atoms in urea groups (1°, 2°, 3°) | | | sumSUrea | all nitrogen atoms in thiourea groups (1°, 2°, 3°) | | | sumAmide | nitrogen atoms in amide groups (1°, 2°, 3°) | | | sumSAmid | nitrogen atoms in thioamide groups (1°, 2°, 3°) | | | sumSO2Am | nitrogen atoms in sulfonamide groups (1°, 2°, 3°) | | | List of Hydrogen Group-Type E-State Indices | | Index Name | The sum of the hydrogen atom level HE-State values for all hydrogen atoms in the molecule of the specified type | | | SHother | hydrogen atoms in all CHX groups. | | | SHCsats | hydrogen atoms on sp3 carbons bonded to other sp3 carbons | | | SHCsatu | hydrogen atoms on sp3 carbons bonded to sp2 carbons | | | SHarom | hydrogen atoms on any aromatic carbon | | | SHHBd | hydrogen atoms on hydrogen bond donating atoms | | | SHwHBd | hydrogen atoms in all Halogen CHX groups (weak donors) | | # | 5.4.6 Internal Hydrogen Bonding E-State Indices | | The E-State hydrogen bonding indices have been designed to encode the potential for pairs of heteroatoms within a molecule to engage in internal hydrogen bonding. The index is constructed by calculating the product of the atom-level E-State for the acceptor atom and the atom-level hydrogen HE-State of the hydrogen atom on the donor. SHBint indices are enumerated for donor and acceptor paris that are separated by a given count of bonds. The SHBint3, SHBint4, SHBint5 and THBint3, THBint4, THBint5 indices represent atom pairs that can for stable ring systems as a result of internal hydrogen bonds. A screening criteria has been applied to these pairs to remove sets where an internal hydrogen bond would not be possible (non-compliant), such as donors and acceptors that are meta substituted neighbors on a planar ring. Although the SHBint2 and SHBint7-SHBint10 indices do not represent atom pairs that can form internal hydrogen bonds, these indices have shown up as important in studying properties such as "drug likeness". The SHBint(n) indices give the largest product of E-State and HE-State of the donor and acceptor pairs separated by n bonds. The SHBI(n) indices give the cumulative sum of E-State and HE-State products for all donor and acceptor pairs separated by n bonds. | | List of Internal Hydrogen Bonding E-State Indices | | Index Name | Product of E-State and HE-State or the count of donor and acceptor pairs separated by a given count of bonds | | | | SHBint2 | Largest product of E-State and HE-State from all acceptor and donor pairs separated by 2 skeletal bonds | | | SHBint3 | Largest product of E-State and HE-State from all internal hydrogen bond compliant acceptor and donor pairs separated by 3 skeletal bonds | | | SHBint4 | Largest product of E-State and HE-State from all internal hydrogen bond compliant acceptor and donor pairs separated by 4 skeletal bonds | | | SHBint5 | Largest product of E-State and HE-State from all internal hydrogen bond compliant acceptor and donor pairs separated by 5 skeletal bonds | | | SHBint6 | Largest product of E-State and HE-State from all acceptor and donor pairs separated by 6 skeletal bonds | | | SHBint7 | Largest product of E-State and HE-State from all acceptor and donor pairs separated by 7 skeletal bonds | | | SHBint8 | Largest product of E-State and HE-State from all acceptor and donor pairs separated by 8 skeletal bonds | | | SHBint9 | Largest product of E-State and HE-State from all acceptor and donor pairs separated by 9 skeletal bonds | | | SHBint10 | Largest product of E-State and HE-State from all acceptor and donor pairs separated by 10 skeletal bonds | | | THBint2 | Sum of the product of E-State and HE-State from all acceptor and donor pairs separated by 2 skeletal bonds | | | THBint3 | Sum of the product of E-State and HE-State from all internal hydrogen bond compliant acceptor and donor pairs separated by 3 skeletal bonds | | | THBint4 | Sum of the product of E-State and HE-State from all internal hydrogen bond compliant acceptor and donor pairs separated by 4 skeletal bonds | | | THBint5 | Sum of the product of E-State and HE-State from all internal hydrogen bond compliant acceptor and donor pairs separated by 5 skeletal bonds | | | THBint6 | Sum of the product of E-State and HE-State from all acceptor and donor pairs separated by 6 skeletal bonds | | | THBint7 | Sum of the product of E-State and HE-State from all acceptor and donor pairs separated by 6 skeletal bonds | | | THBint8 | Sum of the product of E-State and HE-State from all acceptor and donor pairs separated by 6 skeletal bonds | | | THBint9 | Sum of the product of E-State and HE-State from all acceptor and donor pairs separated by 6 skeletal bonds | | | THBint10 | Sum of the product of E-State and HE-State from all acceptor and donor pairs separated by 6 skeletal bonds | | | THB345 | THBint3 + THBint4 + THBint5 | | | NHBint2 | Count of acceptor and donor pairs separated by 2 skeletal bonds | | | NHBint3 | Count of acceptor and donor pairs separated by 3 skeletal bonds | | | NHBint4 | Count of acceptor and donor pairs separated by 4 skeletal bonds | | | NHBint5 | Count of acceptor and donor pairs separated by 5 skeletal bonds | | | NHBint6 | Count of acceptor and donor pairs separated by 6 skeletal bonds | | | NHBint7 | Count of acceptor and donor pairs separated by 7 skeletal bonds | | | NHBint8 | Count of acceptor and donor pairs separated by 8 skeletal bonds | | | NHBint9 | Count of acceptor and donor pairs separated by 9 skeletal bonds | | | NHBint10 | Count of acceptor and donor pairs separated by 10 skeletal bonds | | |
| # | 5.2 Molecular Connectivity and Kappa Shape Indices | | The molecular connectivity and kappa shape indices provide information about the kind of skeletal structure that a compound is based on. The low order chi indices are related to molecular size, volume and surface area. The higher order chi indices encode information about skeletal branching patterns, ring substitution and fused ring architecture. | | List of Simple Chi Molecular Connectivity Indices | | Index Name | Index Definition | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | xpc4 | Simple Chi path cluster 4 | | | | | | | | | | | | | | | | | xch10 | Simple Chi chain 10 | | | List of Valence Chi Molecular Connectivity Indices | | Index Name | Index Definition | | | | | | | | | | | | | | | | | | | | | | | | xvp10 | Valence Chi path 10 | | | xvc3 | Valence Chi cluster 3 | | | xvc4 | Valence Chi cluster 4 | | | xvpc4 | Valence Chi path cluster 4 | | | xvch3 | Valence Chi chain 3 | | | xvch4 | Valence Chi chain 4 | | | xvch5 | Valence Chi chain 5 | | | xvch6 | Valence Chi chain 6 | | | xvch7 | Valence Chi chain 7 | | | xvch8 | Valence Chi chain 8 | | | xvch9 | Valence Chi chain 9 | | | xvch10 | Valence Chi chain 10 | | | List of Simple Difference Chi Molecular Connectivity Indices | | Index Name | Index Definition | | | | dx0 | Simple difference Chi 0 | | | dx1 | Simple difference Chi 1 | | | dx2 | Simple difference Chi 2 | | | dxp3 | Simple difference Chi path 3 | | | dxp4 | Simple difference Chi path 4 | | | dxp5 | Simple difference Chi path 5 | | | dxp6 | Simple difference Chi path 6 | | | dxp7 | Simple difference Chi path 7 | | | dxp8 | Simple difference Chi path 8 | | | dxp9 | Simple difference Chi path 9 | | | dxp10 | Simple difference Chi path 10 | | | List of Valence Difference Chi Molecular Connectivity Indices | | Index Name | Index Definition | | | | dxv0 | Valence difference Chi 0 | | | dxv1 | Valence difference Chi 1 | | | dxv2 | Valence difference Chi 2 | | | dxvp3 | Valence difference Chi path 3 | | | dxvp4 | Valence difference Chi path 4 | | | dxvp5 | Valence difference Chi path 5 | | | dxvp6 | Valence difference Chi path 6 | | | dxvp7 | Valence difference Chi path 7 | | | dxvp8 | Valence difference Chi path 8 | | | dxvp9 | Valence difference Chi path 9 | | | dxvp10 | Valence difference Chi path 10 | | | List of Kappa Shape Indices | | Index Name | Index Definition | | | | nclass | Number of symmetry classes | | | k0 | Kappa alpha zero index | | | | | | | | | phia | phi kappa alpha molecular flexibility index | | | totop | Total topological molecular connectivity uniqueness index | | | knotp | xc3 minus xpc4 gives xc3 subgraph distance from xpc4 | | | knotpv | xvc3 minus xvpc4 gives xvc3 subgraph distance from xvpc4 | | # | 5.5 Bond-Type E-State Indices | | The Bond-Type indices are an extension of the E-State formalism where a bond is characterized using the two atoms that define the bond. The intrinsic state value for the bond is the geometric mean of the intrinsic state values for the two atoms in the bond. As in the atom-level formalism, the actual bond E-State value is defined as the perturbed intrinsic state of the bond, that is the intrinsic state plus the sum of perturbations by all other bonds in the molecule. The bond-level values are summed for all identical bonds in the molecule to form the Bond-Type indices. | | | | List of Bond-Type E-State Categories | # | 5.5.1 Carbon to Carbon Bond-Type Indices | | This page gives the list of Bond-Type E-State indices for bonds between carbon atoms and other carbon atoms. | | | | List of Carbon to Carbon Bond-Type E-State Indices | | Index Name | bond order | Sum of the E-State values for all bonds connecting atoms of the specified atom types | | | e1C1C1 | single | CH3 atoms and CH3 atoms | | | e1C1C2 | single | >CH2 atoms and CH3 atoms | | | e1C1C3 | single | >CH atoms and CH3 atoms | | | e1C1C4 | single | >CH<</font> atoms and CH3 atoms | | | e1C1C2d | single | =CH atoms and CH3 atoms | | | e1C1C3d | single | =C<</font> atoms and CH3 atoms | | | | | e1C1C3a | single | atoms and CH3 atoms | | | e1C2C2 | single | >CH2 atoms and >CH2 atoms | | | e1C2C3 | single | >CH atoms and >CH2 atoms | | | e1C2C4 | single | >C<</font> atoms and >CH2 atoms | | | e1C2C2d | single | =CH atoms and >CH2 atoms | | | e1C2C3ds | single | =C<</font> atoms and >CH2 atoms | | | e1C2C2t | single | C atoms and >CH2 atoms | | | e1C2C3aa | single | atoms and >CH2 atoms | | | e1C3C3 | single | >CH atoms and >CH atoms | | | e1C3C4 | single | >C<</font> atoms and >CH atoms | | | e1C2C3sd | single | =CH atoms and >CH atoms | | | e1C3C3d | single | =C<</font> atoms and >CH atoms | | | | | e1C3C3as | single | atoms and >CH atoms | | | e1C4C4 | single | >C<</font> atoms and >C<</font> atoms | | | e1C2C4d | single | =CH atoms and >C<</font> atoms | | | e1C3C4d | single | =C<</font> atoms and >C<</font> atoms | | | e1C2C4t | single | C atoms and >C<</font> atoms | | | e1C3C4a | single | atoms and >C<</font> atoms | | | e2C1C1 | double | =CH2 atoms and =CH2 atoms | | | e2C1C2s | double | =CH atoms and =CH2 atoms | | | e2C1C3s | double | =C<</font> atoms and =CH2 atoms | | | e2C1C2d | double | =C= atoms and =CH2 atoms | | | e1C2C2dd | single | =CH atoms and =CH atoms | | | e2C2C2ss | double | =CH atoms and =CH atoms | | | e1C2C3d | single | =C<</font> atoms and =CH atoms | | | e2C2C3s | double | =C<</font> atoms and =CH atoms | | | e2C2C2ds | double | =C= atoms and =CH atoms | | | | | e1C2C3a | single | atoms and =CH atoms | | | e1C3C3dd | single | =C<</font> atoms and =C<</font> atoms | | | e2C3C3s | double | =C<</font> atoms and =C<</font> atoms | | | e2C2C3dd | double | =C= atoms and =C<</font> atoms | | | e1C2C3dt | single | C atoms and =C<</font> atoms | | | e1C3C3aa | single | atoms and =C<</font> atoms | | | e2C2C2d | double | =C= atoms and =C= atoms | | | e3C1C1 | triple | CH atoms and CH atoms | | | | | | | | | | | eaC2C2a | aromatic | atoms and atoms | | | eaC2C3s | aromatic | atoms and atoms | | | eaC2C3a | aromatic | atoms and atoms | | | e1C3C3a | single | atoms and atoms | | | eaC3C3s | aromatic | atoms and atoms | | | eaC3C3as | aromatic | atoms and atoms | | | eaC3C3a | aromatic | atoms and atoms | | # | 5.5.1 Carbon to Nitrogen, Onium and Onium Pseudoion Bonds | | This page gives the list of Bond-Type E-State indices for bonds between carbon atoms and nitrogen, onuim and onium pseudoion atoms. | | | | List of Carbon to Nitrogen Bond Indices | | List of Carbon to Onium Bond Indices | | Index Name | bond order | Sum of the E-State values for all bonds connecting atoms of the specified atom types | | | e1C1N4p | single | >N+<</font> atoms and CH3 atoms | | | e1C2N4p | single | >N+<</font> atoms and >CH2 atoms | | | e1C3N4p | single | >N+<</font> atoms and >CH atoms | | | e1C4N4p | single | >N+<</font> atoms and >C<</font> atoms | | | e1C2N4dp | single | >N+<</font> atoms and =CH atoms | | | e1C2N4tp | single | >N+<</font> atoms and C atoms | | | e1C3N4ap | single | >N+<</font> atoms and atoms | | | List of Carbon to Onium Pseudoion Bond Indices | | Index Name | bond order | Sum of the E-State values for all bonds connecting atoms of the specified atom types | | | e1C1N1p | single | NH3+ atoms and CH3 atoms | | | e1C1N2p | single | >NH2+ atoms and CH3 atoms | | | e1C1N3p | single | >NH+ atoms and CH3 atoms | | | e1C2N1p | single | NH3+ atoms and >CH2 atoms | | | e1C2N2p | single | >NH2+ atoms and >CH2 atoms | | | e1C2N3p | single | >NH+ atoms and >CH2 atoms | | | e1C3N1p | single | NH3+ atoms and >CH atoms | | | e1C3N2p | single | >NH2+ atoms and >CH atoms | | | e1C3N3p | single | >NH+ atoms and >CH atoms | | | e1C4N1p | single | NH3+ atoms and >C<</font> atoms | | | e1C4N2p | single | >NH2+ atoms and >C<</font> atoms | | | e1C4N3p | single | >NH+ atoms and >C<</font> atoms | | | e1C2N1dp | single | NH3+ atoms and =CH atoms | | | e1C2N2dp | single | >NH2+ atoms and =CH atoms | | | e1C2N3dp | single | >NH+ atoms and =CH atoms | | | e1C3N1dp | single | NH3+ atoms and =C<</font> atoms | | | e1C3N2dp | single | >NH2+ atoms and =C<</font> atoms | | | e1C3N3dp | single | >NH+ atoms and =C<</font> atoms | | | | | e1C2N2tp | single | >NH2+ atoms and C atoms | | | | | e1C3N1ap | single | NH3+ atoms and atoms | | | e1C3N2ap | single | >NH2+ atoms and atoms | | # | 5.5.3 Carbon to Oxygen and Halogen Bond-Type Indices | | This page gives the list of Bond-Type E-State indices for bonds between carbon atoms and oxygen and halogen atoms. | | | | List of Carbon to Oxygen Bond-Type Indices | | Index Name | bond order | Sum of the E-State values for all bonds connecting atoms of the specified atom types | | | e1C1O1 | single | OH atoms and CH3 atoms | | | e1C1O2 | single | >O atoms and CH3 atoms | | | e1C2O1 | single | OH atoms and >CH2 atoms | | | e1C2O2 | single | >O atoms and >CH2 atoms | | | e1C3O1 | single | OH atoms and >CH atoms | | | e1C3O2 | single | >O atoms and >CH atoms | | | e1C4O1 | single | OH atoms and >C<</font> atoms | | | e1C4O2 | single | >O atoms and >C<</font> atoms | | | e2C1O1 | double | =O atoms and =CH2 atoms | | | e1C2O1d | single | OH atoms and =CH atoms | | | e2C2O1 | double | =O atoms and =CH atoms | | | e1C2O2d | single | >O atoms and =CH atoms | | | e1C3O1d | single | OH atoms =C<</font> atoms | | | e2C3O1s | double | =O atoms and =C<</font> atoms | | | e1C3O2d | single | >O atoms and =C<</font> atoms | | | e2C2O1dd | double | =O atoms and =C= atoms | | | | | e1C2O2t | single | >O atoms and C atoms | | | eaC2O2a | aromatic | atoms and atoms | | | e1C3O1a | single | OH atoms and atoms | | | e1C3O2a | single | >O atoms and atoms | | | eaC3O2s | aromatic | atoms and atoms | | | eaC3O2a | aromatic | atoms and atoms | | | List of Carbon to Halogen Bond-Type Indices | # | 5.5.4 Carbon to Phosphorous and Sulfur | | This page gives the list of Bond-Type E-State indices for bonds between carbon atoms and phosphorous and sulfur atoms. | | | | List of Carbon to Phosphorous Bond Indices | | List of Carbon Sulfur Bond Indices | | Index Name | bond order | Sum of the E-State values for all bonds connecting atoms of the specified atom types | | | e1C1S1 | single | SH atoms and CH3 atoms | | | e1C1S2 | single | >S atoms and CH3 atoms | | | e1C1S6 | single | atoms and CH3 atoms | | | e1C1S4dd | single | atoms and CH3 atoms | | | e1C1S3d | single | =S<</font> atoms and CH3 atoms | | | e1C2S1 | single | SH atoms and >CH2 atoms | | | e1C2S2 | single | >S atoms and >CH2 atoms | | | e1C2S6 | single | atoms and >CH2 atoms | | | e1C2S3dd | single | atoms and >CH2 atoms | | | e1C2S3d | single | =S<</font> atoms and >CH2 atoms | | | e1C3S1 | single | SH atoms and >CH atoms | | | e1C3S2 | single | >S atoms and >CH atoms | | | e1C3S6 | single | atoms and >CH atoms | | | e1C3S4s | single | atoms and >CH atoms | | | e1C3S3s | single | =S<</font> atoms and >CH atoms | | | e1C4S1 | single | SH atoms and >C<</font> atoms | | | e1C4S2 | single | >S atoms and >C<</font> atoms | | | e1C4S6 | single | atoms and >C<</font> atoms | | | e1C4S4dd | single | atoms and >C<</font> atoms | | | e1C4S3d | single | =S<</font> atoms and >C<</font> atoms | | | e2C1S1 | double | =S atoms and =CH2 atoms | | | e2C1S3s | double | =S<</font> atoms and =CH2 atoms | | | e1C2S1d | single | SH atoms and =CH atoms | | | e1C2S2d | single | >S atoms and =CH atoms | | | e1C2S6d | single | atoms and =CH atoms | | | e1C2S4dd | single | atoms and =CH atoms | | | e2C2S1s | double | =S atoms and =CH atoms | | | e1C2S3d | double | =S<</font> atoms and =CH atoms | | | e2C2S3s | double | =S<</font> atoms and =CH atoms | | | e1C3S1d | single | SH atoms and =C<</font> atoms | | | e1C3S2s | single | >S atoms and =C<</font> atoms | | | e1C3S6 | single | atoms and =C<</font> atoms | | | e2C3S4sd | double | atoms and =C<</font> atoms | | | e1C3S4dd | single | atoms and =C<</font> atoms | | | e2C3S1s | double | =S atoms and =C<</font> atoms | | | e1C3S3d | double | =S<</font> atoms and =C<</font> atoms | | | e2C3S3s | double | =S<</font> atoms and =C<</font> atoms | | | e2C2S1 | double | =S atoms and =C= atoms | | | e2C2S3s | double | =S<</font> atoms and =C= atoms | | | e2C2S4ds | double | atoms and =C= atoms | | | | | e1C2S2t | single | >S atoms and C atoms | | | | | e1C2S3dt | single | =S<</font> atoms and C atoms | | | | | eaC2S2a | aromatic | atoms and atoms | | | e1C3S1a | single | SH atoms and atoms | | | e1C3S2a | single | >S atoms and atoms | | | e1C3S6aa | single | atoms and atoms | | | e1C3S3da | single | =S<</font> atoms and atoms | | | e1C3S4da | single | atoms and atoms | | | eaC3S2s | aromatic | atoms and atoms | | | eaC3S2a | aromatic | atoms and atoms | | # | 5.5.5 Nitrogen to Heteroatom Bonds | | This page gives the list of Bond-Type E-State indices for bonds between nitrogen atoms and other heteroatoms. | | | | List of Nitrogen to Nitrogen, Onium, and Onium pseudoion Bond Indices | | List of Oxygen to Nitrogen, Onium and Onium Pseudoion Bonds Indices | | Index Name | bond order | Sum of the E-State values for all bonds connecting atoms of the specified atom types | | | e1N1O2 | single | >O atoms and NH2 atoms | | | e1N2O1 | single | OH atoms and >NH atoms | | | e1N2O2 | single | >O atoms and >NH atoms | | | e1N3O1 | single | OH atoms and >N atoms | | | e1N3O2 | single | >O atoms and >N atoms | | | e1N3O2dd | single | >O atoms and atoms | | | e2N3O1s | double | =O atoms and atoms | | | e1N2O1d | single | OH atoms and =N atoms | | | e1N2O2d | single | >O atoms and =N atoms | | | e2N2O1s | single | =O atoms and =N atoms | | | eaN2O2 | aromatic | atoms and atoms | | | eaN2O2a | aromatic | atoms and atoms | | | e1N3O2aa | single | >O atoms and atoms | | | eaN3O2s | aromatic | atoms and atoms | | | e1N1O2p | single | >O atoms and NH3+ atoms | | | e1N2O2p | single | >O atoms and >NH2+ atoms | | | List of Sulfur to Nitrogen, Onium and Onium Pseudoion Bonds Indices | | Index Name | bond order | Sum of the E-State values for all bonds connecting atoms of the specified atom types | | | e1N1S4dd | single | =S<</font> atoms and NH2 atoms | | | e1N1S3d | single | atoms and NH2 atoms | | | e1N2S3d | single | =S<</font> atoms and >NH atoms | | | e1N2S4dd | single | atoms and >NH atoms | | | e1N3S2 | single | >S atoms and >N atoms | | | e1N3S3d | single | =S<</font> atoms and >N atoms | | | e1N3S4dd | single | atoms and >N atoms | | | e2N1S3s | double | =S<</font> atoms and =NH atoms | | | e2N1S4s | double | atoms and =NH atoms | | | e1N2S2 | single | >S atoms and =N atoms | | | e2N2S1 | double | =S atoms and =N atoms | | | e1N2S3ds | single | =S<</font> atoms and =N atoms | | | e2N2S3s | double | =S<</font> atoms and =N atoms | | | e1N2S4s | single | atoms and =N atoms | | | e2N2S4s | double | atoms and =N atoms | | | eaN2S2 | aromatic | atoms and atoms | | | eaN2S2a | aromatic | atoms and atoms | | | e1N4S2p | single | >S atoms and >N+<</font> atoms | | | e1N4S3dp | single | =S<</font> atoms and >N+<</font> atoms | | | e1N4S4dp | single | atoms and >N+<</font> atoms | | | e1N1S2p | single | >S atoms and NH3+ atoms | | | e1N1S3dp | single | =S<</font> atoms and NH3+ atoms | | | e1N1S4dp | single | atoms and NH3+ atoms | | | e1N2S2p | single | >S atoms and >NH2+ atoms | | | e1N2S3dp | single | =S<</font> atoms and >NH2+ atoms | | | e1N2S4dp | single | atoms and >NH2+ atoms | | | e1N3S2p | single | >S atoms and >NH+ atoms | | | e1N3S3dp | single | =S<</font> atoms and >NH+ atoms | | | e1N3S4dp | single | atoms and >NH+ atoms | | | List of Phosphorous to Nitrogen, Onium and Onium Pseudoion Indices | | List of Halogen to Nitrogen Bond Indices | | Index Name | bond order | Sum of the E-State values for all bonds connecting atoms of the specified atom types | | | e1N3F | single | F atoms and >N atoms | | | e1N2Fd | single | F atoms and =N atoms | | # | 5.5.4 Additional Heteroatom Bonds | | This page gives the list of Bond-Type E-State indices for bonds between heteroatoms not involving nitrogen. | | | | List of Sulfur to Sulfer Bond Indices | | Index Name | bond order | Sum of the E-State values for all bonds connecting atoms of the specified atom types | | | e1S1S1 | single | SH atoms and SH atoms | | | e1S1S2 | single | >S atoms and SH atoms | | | e1S1S3d | single | =S<</font> atoms and SH atoms | | | e1S1S4dd | single | atoms and SH atoms | | | e1S2S2 | single | >S atoms atoms and >S atoms | | | e1S2S3d | single | =S<</font> atoms and >S atoms | | | & |
|
|