Projects that are tagged with large scale.


Logo Indefinite Core Vector Machine 0.1

by fmschleif - January 5, 2018, 22:35:38 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 8444 views, 1589 downloads, 0 subscriptions

About: Armadillo/C++ implementation of the Indefinite Core Vector Machine

Changes:

Some tiny errors in the Nystroem demo scripts - should be ok now Initial Announcement on mloss.org.


Logo bufferkdtree 1.3

by fgieseke - October 20, 2017, 11:39:59 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 6639 views, 2015 downloads, 0 subscriptions

About: The bufferkdtree package is a Python library that aims at accelerating nearest neighbor computations using both k-d trees and modern many-core devices such as graphics processing units (GPUs).

Changes:

Initial Announcement on mloss.org.


Logo Universal Java Matrix Package 0.3.0

by arndt - July 31, 2015, 14:23:14 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 28657 views, 5473 downloads, 0 subscriptions

About: The Universal Java Matrix Package (UJMP) is a data processing tool for Java. Unlike JAMA and Colt, it supports multi-threading and is therefore much faster on current hardware. It does not only support matrices with double values, but instead handles every type of data as a matrix through a common interface, e.g. CSV files, Excel files, images, WAVE audio files, tables in SQL data bases, and much more.

Changes:

Updated to version 0.3.0


Logo JMLR SHOGUN 4.0.0

by sonne - February 5, 2015, 09:09:37 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 215766 views, 34940 downloads, 0 subscriptions

Rating Whole StarWhole StarWhole StarEmpty StarEmpty Star
(based on 6 votes)

About: The SHOGUN machine learning toolbox's focus is on large scale learning methods with focus on Support Vector Machines (SVM), providing interfaces to python, octave, matlab, r and the command line.

Changes:

This release features the work of our 8 GSoC 2014 students [student; mentors]:

  • OpenCV Integration and Computer Vision Applications [Abhijeet Kislay; Kevin Hughes]
  • Large-Scale Multi-Label Classification [Abinash Panda; Thoralf Klein]
  • Large-scale structured prediction with approximate inference [Jiaolong Xu; Shell Hu]
  • Essential Deep Learning Modules [Khaled Nasr; Sergey Lisitsyn, Theofanis Karaletsos]
  • Fundamental Machine Learning: decision trees, kernel density estimation [Parijat Mazumdar ; Fernando Iglesias]
  • Shogun Missionary & Shogun in Education [Saurabh Mahindre; Heiko Strathmann]
  • Testing and Measuring Variable Interactions With Kernels [Soumyajit De; Dino Sejdinovic, Heiko Strathmann]
  • Variational Learning for Gaussian Processes [Wu Lin; Heiko Strathmann, Emtiyaz Khan]

It also contains several cleanups and bugfixes:

Features

  • New Shogun project description [Heiko Strathmann]
  • ID3 algorithm for decision tree learning [Parijat Mazumdar]
  • New modes for PCA matrix factorizations: SVD & EVD, in-place or reallocating [Parijat Mazumdar]
  • Add Neural Networks with linear, logistic and softmax neurons [Khaled Nasr]
  • Add kernel multiclass strategy examples in multiclass notebook [Saurabh Mahindre]
  • Add decision trees notebook containing examples for ID3 algorithm [Parijat Mazumdar]
  • Add sudoku recognizer ipython notebook [Alejandro Hernandez]
  • Add in-place subsets on features, labels, and custom kernels [Heiko Strathmann]
  • Add Principal Component Analysis notebook [Abhijeet Kislay]
  • Add Multiple Kernel Learning notebook [Saurabh Mahindre]
  • Add Multi-Label classes to enable Multi-Label classification [Thoralf Klein]
  • Add rectified linear neurons, dropout and max-norm regularization to neural networks [Khaled Nasr]
  • Add C4.5 algorithm for multiclass classification using decision trees [Parijat Mazumdar]
  • Add support for arbitrary acyclic graph-structured neural networks [Khaled Nasr]
  • Add CART algorithm for classification and regression using decision trees [Parijat Mazumdar]
  • Add CHAID algorithm for multiclass classification and regression using decision trees [Parijat Mazumdar]
  • Add Convolutional Neural Networks [Khaled Nasr]
  • Add Random Forests algorithm for ensemble learning using CART [Parijat Mazumdar]
  • Add Restricted Botlzmann Machines [Khaled Nasr]
  • Add Stochastic Gradient Boosting algorithm for ensemble learning [Parijat Mazumdar]
  • Add Deep contractive and denoising autoencoders [Khaled Nasr]
  • Add Deep belief networks [Khaled Nasr]

Bugfixes

  • Fix reference counting bugs in CList when reference counting is on [Heiko Strathmann, Thoralf Klein, lambday]
  • Fix memory problem in PCA::apply_to_feature_matrix [Parijat Mazumdar]
  • Fix crash in LeastAngleRegression for the case D greater than N [Parijat Mazumdar]
  • Fix memory violations in bundle method solvers [Thoralf Klein]
  • Fix fail in library_mldatahdf5.cpp example when http://mldata.org is not working properly [Parijat Mazumdar]
  • Fix memory leaks in Vowpal Wabbit, LibSVMFile and KernelPCA [Thoralf Klein]
  • Fix memory and control flow issues discovered by Coverity [Thoralf Klein]
  • Fix R modular interface SWIG typemap (Requires SWIG >= 2.0.5) [Matt Huska]

Cleanup and API Changes

  • PCA now depends on Eigen3 instead of LAPACK [Parijat Mazumdar]
  • Removing redundant and fixing implicit imports [Thoralf Klein]
  • Hide many methods from SWIG, reducing compile memory by 500MiB [Heiko Strathmann, Fernando Iglesias, Thoralf Klein]

Logo UniverSVM 1.22

by fabee - October 16, 2012, 11:24:12 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 42410 views, 6743 downloads, 0 subscriptions

About: The UniverSVM is a SVM implementation written in C/C++. Its functionality comprises large scale transduction via CCCP optimization, sparse solutions via CCCP optimization and data-dependent [...]

Changes:

Minor changes: fix bug on set_alphas_b0 function (thanks to Ferdinand Kaiser - [email protected])


Logo Linear SVM with general regularization 1.0

by rflamary - October 5, 2012, 15:34:21 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 12161 views, 3175 downloads, 0 subscriptions

About: This package is an implementation of a linear svm solver with a wide class of regularizations on the svm weight vector (l1, l2, mixed norm l1-lq, adaptive lasso). We provide solvers for the classical single task svm problem and for multi-task with joint feature selection or similarity promoting term.

Changes:

Initial Announcement on mloss.org.


Logo SGD 2.0

by leonbottou - October 11, 2011, 20:59:41 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 25985 views, 4240 downloads, 0 subscriptions

Rating Whole StarWhole StarWhole StarWhole StarEmpty Star
(based on 2 votes)

About: The SGD-2.0 package contains implementations of the SGD and ASGD algorithms for linear SVMs and linear CRFs.

Changes:

Version 2.0 features ASGD.


Logo LASVM 1.1

by leonbottou - August 3, 2009, 15:50:30 CET [ Project Homepage BibTeX Download ] 20540 views, 4121 downloads, 0 subscriptions

About: Reference implementation of the LASVM online and active SVM algorithms as described in the JMLR paper. The interesting bit is a small C library that implements the LASVM process and reprocess [...]

Changes:

Minor bug fix


Logo LibSGDQN 1.1

by antojne - July 2, 2009, 15:02:44 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 18570 views, 3781 downloads, 0 subscriptions

About: LibSGDQN proposes an implementation of SGD-QN, a carefully designed quasi-Newton stochastic gradient descent solver for linear SVMs.

Changes:

small bug fix (thx nicolas ;)


Logo Aleph 0.6

by jiria - January 12, 2009, 20:52:12 CET [ Project Homepage BibTeX Download ] 18010 views, 4419 downloads, 0 subscriptions

About: Aleph is both a multi-platform machine learning framework aimed at simplicity and performance, and a library of selected state-of-the-art algorithms.

Changes:

Initial Announcement on mloss.org.


Logo Disco 0.1

by tuulos - October 6, 2008, 11:14:48 CET [ Project Homepage BibTeX Download ] 14879 views, 2999 downloads, 0 subscriptions

About: Disco is an open-source implementation of the [Map-Reduce framework](http://en.wikipedia.org/wiki/MapReduce) for distributed computing. As the original framework, Disco supports parallel [...]

Changes:

Initial Announcement on mloss.org.


Logo Sleipnir 1.0

by chuttenh - June 30, 2008, 03:22:19 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 13501 views, 3054 downloads, 0 subscriptions

About: The Sleipnir C++ library implements a variety of machine learning and data manipulation algorithms focusing on heterogeneous data integration and efficiency for large biological data collections.

Changes:

Initial Announcement on mloss.org.


Logo mSplicer 0.3

by sonne - May 18, 2008, 13:07:40 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 13285 views, 2763 downloads, 0 subscriptions

Rating Whole StarWhole StarWhole StarWhole StarEmpty Star
(based on 2 votes)

About: For modern biology, precise genome annotations are of prime importance as they allow the accurate definition of genic regions. We employ state of the art machine learning methods to assay and [...]

Changes:

Initial Announcement on mloss.org.


Logo GPDT Gradient Projection Decomposition Technique 1.01

by sezaza - December 21, 2007, 20:10:43 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 17063 views, 3837 downloads, 0 subscriptions

Rating Whole StarWhole StarWhole StarWhole StarEmpty Star
(based on 1 vote)

About: This is a C++ software designed to train large-scale SVMs for binary classification. The algorithm is also implemented in parallel (**PGPDT**) for distributed memory, strictly coupled multiprocessor [...]

Changes:

Initial Announcement on mloss.org.


Logo RapidMiner 4.0

by ingomierswa - November 16, 2007, 02:31:48 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 29177 views, 5458 downloads, 0 comments, 0 subscriptions

Rating Whole StarWhole StarWhole StarWhole StarWhole Star
(based on 5 votes)

About: RapidMiner (formerly YALE) is one of the most widely used open-source data mining suites and software solutions due to its leading-edge technologies and its functional range. Applications of [...]

Changes:

Initial Announcement on mloss.org.