This documentation is for scikit-learn version 0.11-gitOther versions

Citing

If you use the software, please consider citing scikit-learn.

This page

8.4.1.14. sklearn.datasets.fetch_olivetti_faces

sklearn.datasets.fetch_olivetti_faces(data_home=None, shuffle=False, random_state=0, download_if_missing=True)

Loader for the Olivetti faces data-set from AT&T.

Parameters :

data_home : optional, default: None

Specify another download and cache folder for the datasets. By default all scikit learn data is stored in ‘~/scikit_learn_data’ subfolders.

shuffle : boolean, optional

If True the order of the dataset is shuffled to avoid having images of the same person grouped.

download_if_missing: optional, True by default :

If False, raise a IOError if the data is not locally available instead of trying to download the data from the source site.

random_state : optional, integer or RandomState object

The seed or the random number generator used to shuffle the data.

Notes

This dataset consists of 10 pictures each of 40 individuals. The original database was available from (now defunct)

The version retrieved here comes in MATLAB format from the personal web page of Sam Roweis: