breastw {dprep}R Documentation

The Breast Wisconsin dataset

Description

This is the Breast Wisconsin dataset from the UCI Machine Learning Repository. This dataset has 699 instances, sixteen of them with missing values, 9 predictor attributes and one class attribute assuming values 1(benign tumor) and 2(malign tumor).

Usage

data(breastw)

Format

A data frame with 699 observations on the following 10 variables.

V1

Clump Thickness

V2

Uniformity of Cell Size

V3

Uniformity of Cell Shape

V4

Marginal Adhesion

V5

Single Epithelial Cell Size

V6

Bare Nuclei

V7

Bland Chromatin

V8

Normal Nucleoli

V9

Mitoses

V10

Class: 1 for benign, 2 for Malign

Details

All the features assume values in the range 1-10. The dataset contains 699 observations with 16 of them having missing values. It is recommended to impute these values been before further analysis.

Source

The UCI Machine Learning Database Repository at:

Examples

#Detecting outliers in class-1 using the LOF algorithms---
data(breastw)
ce.impute(breastw,"mean",1:9)

[Package dprep version 3.1.1 Index]