Seq2Feature: a comprehensive web-based feature extraction tool

Bioinformatics. 2019 Nov 1;35(22):4797-4799. doi: 10.1093/bioinformatics/btz432.

Abstract

Motivation: Machine learning techniques require various descriptors from protein and nucleic acid sequences to understand/predict their structure and function as well as distinguishing between disease and neutral mutations. Hence, availability of a feature extraction tool is necessary to bridge the gap.

Results: We developed a comprehensive web-based tool, Seq2Feature, which computes 252 protein and 41 DNA sequence-based descriptors. These features include physicochemical, energetic and conformational properties of proteins, mutation matrices and contact potentials as well as nucleotide composition, physicochemical and conformational properties of DNA. We propose that Seq2Feature could serve as an effective tool for extracting protein and DNA sequence-based features as applicable inputs to machine learning algorithms.

Availability and implementation: https://www.iitm.ac.in/bioinfo/SBFE/index.html.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Internet
  • Machine Learning
  • Proteins
  • Software*

Substances

  • Proteins