This documentation is for scikit-learn version 0.11-gitOther versions

Citing

If you use the software, please consider citing scikit-learn.

This page

8.24.2.1. sklearn.svm.sparse.SVC

class sklearn.svm.sparse.SVC(C=1.0, kernel='rbf', degree=3, gamma=0.0, coef0=0.0, shrinking=True, probability=False, tol=0.001, cache_size=200, scale_C=False)

SVC for sparse matrices (csr).

See sklearn.svm.SVC for a complete list of parameters

Notes

For best results, this accepts a matrix in csr format (scipy.sparse.csr), but should be able to convert from any array-like object (including other sparse representations).

Examples

>>> import numpy as np
>>> X = np.array([[-1, -1], [-2, -1], [1, 1], [2, 1]])
>>> y = np.array([1, 1, 2, 2])
>>> from sklearn.svm.sparse import SVC
>>> clf = SVC()
>>> clf.fit(X, y)
SVC(C=1.0, cache_size=200, coef0=0.0, degree=3, gamma=0.5, kernel='rbf',
  probability=False, scale_C=False, shrinking=True, tol=0.001)
>>> print clf.predict([[-0.8, -1]])
[ 1.]

Methods

fit(X, y[, class_weight, sample_weight]) Fit the SVM model according to the given training data and parameters.
predict(T) This function does classification or regression on an array of
predict_log_proba(T) Compute the log likehoods each possible outcomes of samples in T.
predict_proba(X) This function does classification or regression on a test vector X
score(X, y) Returns the mean accuracy on the given test data and labels.
set_params(**params) Set the parameters of the estimator.
__init__(C=1.0, kernel='rbf', degree=3, gamma=0.0, coef0=0.0, shrinking=True, probability=False, tol=0.001, cache_size=200, scale_C=False)
fit(X, y, class_weight=None, sample_weight=None)

Fit the SVM model according to the given training data and parameters.

Parameters :

X : sparse matrix, shape = [n_samples, n_features]

Training vectors, where n_samples is the number of samples and n_features is the number of features.

y : array-like, shape = [n_samples]

Target values (integers in classification, real numbers in regression)

class_weight : {dict, ‘auto’}, optional

Weights associated with classes in the form {class_label : weight}. If not given, all classes are supposed to have weight one.

The ‘auto’ mode uses the values of y to automatically adjust weights inversely proportional to class frequencies.

sample_weight : array-like, shape = [n_samples], optional

Weights applied to individual samples (1. for unweighted).

Returns :

self : object

Returns an instance of self.

Notes

For maximum effiency, use a sparse matrix in csr format (scipy.sparse.csr_matrix)

predict(T)

This function does classification or regression on an array of test vectors T.

For a classification model, the predicted class for each sample in T is returned. For a regression model, the function value of T calculated is returned.

For an one-class model, +1 or -1 is returned.

Parameters :T : scipy.sparse.csr, shape = [n_samples, n_features]
Returns :C : array, shape = [n_samples]
predict_log_proba(T)

Compute the log likehoods each possible outcomes of samples in T.

The model need to have probability information computed at training time: fit with attribute probability set to True.

Parameters :

T : array-like, shape = [n_samples, n_features]

Returns :

T : array-like, shape = [n_samples, n_classes]

Returns the log-probabilities of the sample for each class in the model, where classes are ordered by arithmetical order.

Notes

The probability model is created using cross validation, so the results can be slightly different than those obtained by predict. Also, it will meaningless results on very small datasets.

predict_proba(X)

This function does classification or regression on a test vector X given a model with probability information.

Parameters :

X : scipy.sparse.csr, shape = [n_samples, n_features]

Returns :

X : array-like, shape = [n_samples, n_classes]

Returns the probability of the sample for each class in the model, where classes are ordered by arithmetical order.

Notes

The probability model is created using cross validation, so the results can be slightly different than those obtained by predict. Also, it will meaningless results on very small datasets.

score(X, y)

Returns the mean accuracy on the given test data and labels.

Parameters :

X : array-like, shape = [n_samples, n_features]

Training set.

y : array-like, shape = [n_samples]

Labels for X.

Returns :

z : float

set_params(**params)

Set the parameters of the estimator.

The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it’s possible to update each component of a nested object.

Returns :self :