mlpy is an integrated toolkit for predictive modeling using Python and NumPy.
mlpy provides a set of high-level, convenient procedures for Data Analysis Protocols (DAPs) for common predictive modeling tasks. DAPs are high-level data analysis protocols that implement a modeling workflow and carry out the operations needed for each stage in the workflow.
mlpy provides a suite of tools for feature weighting and ranking, data resampling, error evaluation and experiment landscaping.
mlpy is aimed at numerical analysts, bioinformaticians, chemists, chemometricians, data scientists, statisticians and other data analysts working with data sets where machine learning methods can play a key role.
mlpy Features
mlpy uses the Python programming language and the NumPy package. The code is available under the Apache 2.0 license.
mlpy provides the following features:
High-level language commands that can perform many common data analysis tasks.
It also provides a set of tools for feature weighting and ranking, data resampling, error evaluation and experiment landscaping.
Support for embedding standard S3ML code in Python files.
Easy integration with C and Fortran code through Cython and F2PY.
mlpy is cross-platform, i.e., it can be used on any platform that provides Python and NumPy.
It can be used on the Linux and the Mac OS X operating systems.
mlpy Open Source License:
This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.


mlpy provides routines and data structures to perform N-dimensional data exploration, data processing and machine learning. It integrates PyTables with NumPy and has an extremely modular design that allows development of user extensions for mlpy.
mlpy provides simple interfaces for all major data mining algorithms, including Linear Discriminant Analysis, Naïve Bayes, Neural Networks, Random Forests and Support Vector Machines. Support Vector Machines make extensive use of Gaussian Processes for solving classification and regression problems.
mlpy provides a set of data preprocessing techniques, including feature weighting, data resampling and normalization. It can also generate new features from the original dataset.
mlpy provides Python interfaces for the BioPython suite of tools (e.g. BLAST) for biological sequence analysis. It includes support for FastTree, SSU-rRNA, SSU-rRNA and 16S-rRNA, k-mer counting and similar approaches.
mlpy provides a SciPy module that implements many common and popular numerical computation procedures.
mlpy provides support for Python’s dynamic import of extensions.
mlpy provides a complete, fine-grained and extensible error reporting system.
mlpy has extensive cross-platform compatibility.
mlpy is licensed under the 2-Clause BSD License.


What’s New In?

mlpy uses numerical methods optimized for machine learning. It supports a large number of algorithms, is scalable to large datasets and provides a large number of functions for feature extraction and dimensionality reduction, cross-validation, resampling, statistical inference, feature weighting, model selection, model visualization, and model optimization.
mlpy provides support for Python version 2.7 and 3.x and for NumPy version 1.6 through 1.15, including matrix, array, group, and scipy sparse matrix support.

