9.1.8.1. sklearn.svm.sparse.SVC¶

class sklearn.svm.sparse.SVC(C=1.0, kernel='rbf', degree=3, gamma=0.0, coef0=0.0, shrinking=True, probability=False, tol=0.001)¶

SVC for sparse matrices (csr).

See sklearn.svm.SVC for a complete list of parameters

Notes

For best results, this accepts a matrix in csr format (scipy.sparse.csr), but should be able to convert from any array-like object (including other sparse representations).

Examples

>>> import numpy as np
>>> X = np.array([[-1, -1], [-2, -1], [1, 1], [2, 1]])
>>> y = np.array([1, 1, 2, 2])
>>> from sklearn.svm.sparse import SVC
>>> clf = SVC()
>>> clf.fit(X, y)
SVC(C=1.0, coef0=0.0, degree=3, gamma=0.5, kernel='rbf', probability=False,
  shrinking=True, tol=0.001)
>>> print clf.predict([[-0.8, -1]])
[ 1.]

Methods

`decision_function`(X)	Distance of the samples T to the separating hyperplane.
`fit`(X, y[, class_weight, sample_weight, ...])	Fit the SVM model according to the given training data and
`predict`(T)	This function does classification or regression on an array of
`predict_log_proba`(T)	Compute the log likehoods each possible outcomes of samples in T.
`predict_proba`(X)	This function does classification or regression on a test vector X
`score`(X, y)	Returns the mean error rate on the given test data and labels.
`set_params`(**params)	Set the parameters of the estimator.

__init__(C=1.0, kernel='rbf', degree=3, gamma=0.0, coef0=0.0, shrinking=True, probability=False, tol=0.001)¶

decision_function(X)¶

Distance of the samples T to the separating hyperplane.

Parameters :

X : array-like, shape = [n_samples, n_features]

Returns :

X : array-like, shape = [n_samples, n_class * (n_class-1) / 2]

Returns the decision function of the sample for each class in the model.

fit(X, y, class_weight=None, sample_weight=[], cache_size=100.0)¶

Fit the SVM model according to the given training data and parameters.

Parameters :

X : sparse matrix, shape = [n_samples, n_features]

Training vectors, where n_samples is the number of samples and n_features is the number of features.

y : array-like, shape = [n_samples]

Target values (integers in classification, real numbers in regression)

class_weight : {dict, ‘auto’}, optional

Weights associated with classes in the form {class_label : weight}. If not given, all classes are supposed to have weight one.

The ‘auto’ mode uses the values of y to automatically adjust weights inversely proportional to class frequencies.

sample_weight : array-like, shape = [n_samples], optional

Weights applied to individual samples (1. for unweighted).

Returns :

self : object

Returns an instance of self.

Notes

For maximum effiency, use a sparse matrix in csr format (scipy.sparse.csr_matrix)

predict(T)¶

This function does classification or regression on an array of test vectors T.

For a classification model, the predicted class for each sample in T is returned. For a regression model, the function value of T calculated is returned.

For an one-class model, +1 or -1 is returned.

Parameters :	T : scipy.sparse.csr, shape = [n_samples, n_features]
Returns :	C : array, shape = [n_samples]

predict_log_proba(T)¶

Compute the log likehoods each possible outcomes of samples in T.

The model need to have probability information computed at training time: fit with attribute probability set to True.

Parameters :

T : array-like, shape = [n_samples, n_features]

Returns :

T : array-like, shape = [n_samples, n_classes]

Returns the log-probabilities of the sample for each class in the model, where classes are ordered by arithmetical order.

Notes

The probability model is created using cross validation, so the results can be slightly different than those obtained by predict. Also, it will meaningless results on very small datasets.

predict_proba(X)¶

This function does classification or regression on a test vector X given a model with probability information.

Parameters :

X : scipy.sparse.csr, shape = [n_samples, n_features]

Returns :

X : array-like, shape = [n_samples, n_classes]

Returns the probability of the sample for each class in the model, where classes are ordered by arithmetical order.

Notes

The probability model is created using cross validation, so the results can be slightly different than those obtained by predict. Also, it will meaningless results on very small datasets.

score(X, y)¶

Returns the mean error rate on the given test data and labels.

Parameters :

X : array-like, shape = [n_samples, n_features]

Training set.

y : array-like, shape = [n_samples]

Labels for X.

Returns :

z : float

set_params(**params)¶

Set the parameters of the estimator.

The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it’s possible to update each component of a nested object.

Returns :	self :

This page

Citing

9.1.8.1. sklearn.svm.sparse.SVC¶