CovtypeNorm#
- class capymoa.datasets.CovtypeNorm[source]#
Bases:
DownloadARFFGzip
A normalized version of the classic
Covtype
classification problem.Number of instances: 581,012 (30m^2 cells)
Number of attributes: 54 (10 continuous, 44 categorical)
Number of classes: 7 (forest cover types)
Forest Covertype (or simply covtype) contains the forest cover type for 30 x 30 meter cells obtained from US Forest Service (USFS) Region 2 Resource Information System (RIS) data.
References:
Blackard,Jock. (1998). Covertype. UCI Machine Learning Repository. https://doi.org/10.24432/C50K5N.
See Also:
CovtFD
- Covtype with simulated feature driftsCovtype
- The classic covertype datasetCovtypeTiny
- A truncated version of the classic covertype dataset
- __init__(
- directory: str = PosixPath('data'),
- auto_download: bool = True,
- CLI: str | None = None,
- schema: str | None = None,
- download(working_directory: Path) Path [source]#
Download the dataset and return the path to the downloaded dataset within the working directory.
- Parameters:
working_directory – The directory to download the dataset to.
- Returns:
The path to the downloaded dataset within the working directory.
- extract(stream_archive: Path) Path [source]#
Extract the dataset from the archive and return the path to the extracted dataset.
- Parameters:
stream_archive – The path to the archive containing the dataset.
- Returns:
The path to the extracted dataset.
- next_instance() LabeledInstance | RegressionInstance [source]#
Return the next instance in the stream.
- Raises:
ValueError – If the machine learning task is neither a regression nor a classification task.
- Returns:
A labeled instances or a regression depending on the schema.