9.1.8.1. sklearn.svm.sparse.SVC¶
- class sklearn.svm.sparse.SVC(C=1.0, kernel='rbf', degree=3, gamma=0.0, coef0=0.0, shrinking=True, probability=False, tol=0.001)¶
SVC for sparse matrices (csr).
See sklearn.svm.SVC for a complete list of parameters
Notes
For best results, this accepts a matrix in csr format (scipy.sparse.csr), but should be able to convert from any array-like object (including other sparse representations).
Examples
>>> import numpy as np >>> X = np.array([[-1, -1], [-2, -1], [1, 1], [2, 1]]) >>> y = np.array([1, 1, 2, 2]) >>> from sklearn.svm.sparse import SVC >>> clf = SVC() >>> clf.fit(X, y) SVC(C=1.0, coef0=0.0, degree=3, gamma=0.5, kernel='rbf', probability=False, shrinking=True, tol=0.001) >>> print clf.predict([[-0.8, -1]]) [ 1.]
Methods
decision_function(X) Distance of the samples T to the separating hyperplane. fit(X, y[, class_weight, sample_weight, ...]) Fit the SVM model according to the given training data and predict(T) This function does classification or regression on an array of predict_log_proba(T) Compute the log likehoods each possible outcomes of samples in T. predict_proba(X) This function does classification or regression on a test vector X score(X, y) Returns the mean error rate on the given test data and labels. set_params(**params) Set the parameters of the estimator. - __init__(C=1.0, kernel='rbf', degree=3, gamma=0.0, coef0=0.0, shrinking=True, probability=False, tol=0.001)¶
- decision_function(X)¶
Distance of the samples T to the separating hyperplane.
Parameters : X : array-like, shape = [n_samples, n_features]
Returns : X : array-like, shape = [n_samples, n_class * (n_class-1) / 2]
Returns the decision function of the sample for each class in the model.
- fit(X, y, class_weight=None, sample_weight=[], cache_size=100.0)¶
Fit the SVM model according to the given training data and parameters.
Parameters : X : sparse matrix, shape = [n_samples, n_features]
Training vectors, where n_samples is the number of samples and n_features is the number of features.
y : array-like, shape = [n_samples]
Target values (integers in classification, real numbers in regression)
class_weight : {dict, ‘auto’}, optional
Weights associated with classes in the form {class_label : weight}. If not given, all classes are supposed to have weight one.
The ‘auto’ mode uses the values of y to automatically adjust weights inversely proportional to class frequencies.
sample_weight : array-like, shape = [n_samples], optional
Weights applied to individual samples (1. for unweighted).
Returns : self : object
Returns an instance of self.
Notes
For maximum effiency, use a sparse matrix in csr format (scipy.sparse.csr_matrix)
- predict(T)¶
This function does classification or regression on an array of test vectors T.
For a classification model, the predicted class for each sample in T is returned. For a regression model, the function value of T calculated is returned.
For an one-class model, +1 or -1 is returned.
Parameters : T : scipy.sparse.csr, shape = [n_samples, n_features] Returns : C : array, shape = [n_samples]
- predict_log_proba(T)¶
Compute the log likehoods each possible outcomes of samples in T.
The model need to have probability information computed at training time: fit with attribute probability set to True.
Parameters : T : array-like, shape = [n_samples, n_features]
Returns : T : array-like, shape = [n_samples, n_classes]
Returns the log-probabilities of the sample for each class in the model, where classes are ordered by arithmetical order.
Notes
The probability model is created using cross validation, so the results can be slightly different than those obtained by predict. Also, it will meaningless results on very small datasets.
- predict_proba(X)¶
This function does classification or regression on a test vector X given a model with probability information.
Parameters : X : scipy.sparse.csr, shape = [n_samples, n_features]
Returns : X : array-like, shape = [n_samples, n_classes]
Returns the probability of the sample for each class in the model, where classes are ordered by arithmetical order.
Notes
The probability model is created using cross validation, so the results can be slightly different than those obtained by predict. Also, it will meaningless results on very small datasets.
- score(X, y)¶
Returns the mean error rate on the given test data and labels.
Parameters : X : array-like, shape = [n_samples, n_features]
Training set.
y : array-like, shape = [n_samples]
Labels for X.
Returns : z : float
- set_params(**params)¶
Set the parameters of the estimator.
The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it’s possible to update each component of a nested object.
Returns : self :