9.2.6. sklearn.linear_model.ElasticNet¶
- class sklearn.linear_model.ElasticNet(alpha=1.0, rho=0.5, fit_intercept=True, normalize=False, precompute='auto', max_iter=1000, overwrite_X=False, tol=0.0001)¶
Linear Model trained with L1 and L2 prior as regularizer
rho=1 is the lasso penalty. Currently, rho <= 0.01 is not reliable, unless you supply your own sequence of alpha.
Parameters : alpha : float
Constant that multiplies the penalty terms. Defaults to 1.0 See the notes for the exact mathematical meaning of this parameter
rho : float
The ElasticNet mixing parameter, with 0 < rho <= 1. For rho = 0 the penalty is an L1 penalty. For rho = 1 it is an L2 penalty. For 0 < rho < 1, the penalty is a combination of L1 and L2
fit_intercept: bool :
Whether the intercept should be estimated or not. If False, the data is assumed to be already centered.
normalize : boolean, optional
If True, the regressors X are normalized
precompute : True | False | ‘auto’ | array-like
Whether to use a precomputed Gram matrix to speed up calculations. If set to ‘auto’ let us decide. The Gram matrix can also be passed as argument.
max_iter: int, optional :
The maximum number of iterations
overwrite_X : boolean, optional
If True, X will not be copied Default is False
tol: float, optional :
The tolerance for the optimization: if the updates are smaller than ‘tol’, the optimization code checks the dual gap for optimality and continues until it is smaller than tol.
Notes
To avoid unnecessary memory duplication the X argument of the fit method should be directly passed as a fortran contiguous numpy array.
The parameter rho corresponds to alpha in the glmnet R package while alpha corresponds to the lambda parameter in glmnet. More specifically, the penalty is:
alpha*rho*L1 + alpha*(1-rho)*L2
If you are interested in controlling the L1 and L2 penalty separately, keep in mind that this is equivalent to:
a*L1 + b*L2
for:
alpha = a + b and rho = a/(a+b)
Methods
fit(X, y[, Xy, coef_init]) Fit Elastic Net model with coordinate descent predict(X) Predict using the linear model score(X, y) Returns the coefficient of determination of the prediction set_params(**params) Set the parameters of the estimator. - __init__(alpha=1.0, rho=0.5, fit_intercept=True, normalize=False, precompute='auto', max_iter=1000, overwrite_X=False, tol=0.0001)¶
- fit(X, y, Xy=None, coef_init=None)¶
Fit Elastic Net model with coordinate descent
Parameters : X: ndarray, (n_samples, n_features) :
Data
y: ndarray, (n_samples) :
Target
Xy : array-like, optional
Xy = np.dot(X.T, y) that can be precomputed. It is useful only when the Gram matrix is precomuted.
coef_init: ndarray of shape n_features :
The initial coeffients to warm-start the optimization
Notes
Coordinate descent is an algorithm that considers each column of data at a time hence it will automatically convert the X input as a fortran contiguous numpy array if necessary.
To avoid memory re-allocation it is advised to allocate the initial data in memory directly using that format.
- predict(X)¶
Predict using the linear model
Parameters : X : numpy array of shape [n_samples, n_features]
Returns : C : array, shape = [n_samples]
Returns predicted values.
- score(X, y)¶
Returns the coefficient of determination of the prediction
Parameters : X : array-like, shape = [n_samples, n_features]
Training set.
y : array-like, shape = [n_samples]
Returns : z : float
- set_params(**params)¶
Set the parameters of the estimator.
The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form <component>__<parameter> so that it’s possible to update each component of a nested object.
Returns : self :