distinguish wild mushrooms with decision tree
DESCRIPTION
Distinguish Wild Mushrooms with Decision Tree. Shiqin Yan. Objective. Utilize the already existed database of the mushrooms to build a decision tree to assist the process of determine the whether the mushroom is poisonous . DataSet. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Distinguish Wild Mushrooms with Decision Tree](https://reader036.vdocuments.us/reader036/viewer/2022062305/568165a0550346895dd87a74/html5/thumbnails/1.jpg)
Distinguish Wild Mushrooms with Decision Tree
Shiqin Yan
![Page 2: Distinguish Wild Mushrooms with Decision Tree](https://reader036.vdocuments.us/reader036/viewer/2022062305/568165a0550346895dd87a74/html5/thumbnails/2.jpg)
Objective Utilize the already existed database of the
mushrooms to build a decision tree to assist the process of determine the whether the mushroom is poisonous.
![Page 3: Distinguish Wild Mushrooms with Decision Tree](https://reader036.vdocuments.us/reader036/viewer/2022062305/568165a0550346895dd87a74/html5/thumbnails/3.jpg)
DataSet Existing record drawn from the Audubon
Society Field Guide to North American Mushrooms (1981) . G. H. Lincoff (Pres. ), NewYork: Alfred A. Knopf
Number of Instances: 8124 (classified as either edible or poisonous)
Number of Attributes: 22 Training: 5416, Tuning: 1354, Testing: 1354 Missing attribute values: 2480 (denoted by
“?”), all for attribute 11
![Page 4: Distinguish Wild Mushrooms with Decision Tree](https://reader036.vdocuments.us/reader036/viewer/2022062305/568165a0550346895dd87a74/html5/thumbnails/4.jpg)
Mushroom Features 1. cap-shape: bell=b, conical=c, convex=x,
flat=f, knobbed=k, sunken = s 2. cap-surface: fibrous=f, grooves=g,
scaly=y, smooth=s 3. cap-color: brown=n, buff=b, cinnamon=c,
gray=g, green=r, pink=p, purple=u, red=e, white=w, yellow=y
4. bruise?: bruises=t, no=f 5. odor: almond=a, anise=l, creosote=c,
fishy=y, foul=f …
![Page 5: Distinguish Wild Mushrooms with Decision Tree](https://reader036.vdocuments.us/reader036/viewer/2022062305/568165a0550346895dd87a74/html5/thumbnails/5.jpg)
![Page 6: Distinguish Wild Mushrooms with Decision Tree](https://reader036.vdocuments.us/reader036/viewer/2022062305/568165a0550346895dd87a74/html5/thumbnails/6.jpg)
Approach Mutual information to determine the features
used to split the tree.
Mutual information: Y: label, X: feature Choose feature X which maximizes I(Y;X)
![Page 7: Distinguish Wild Mushrooms with Decision Tree](https://reader036.vdocuments.us/reader036/viewer/2022062305/568165a0550346895dd87a74/html5/thumbnails/7.jpg)
![Page 8: Distinguish Wild Mushrooms with Decision Tree](https://reader036.vdocuments.us/reader036/viewer/2022062305/568165a0550346895dd87a74/html5/thumbnails/8.jpg)
Most informative features extracted from decision tree: odor spore-print-color habitat population
![Page 9: Distinguish Wild Mushrooms with Decision Tree](https://reader036.vdocuments.us/reader036/viewer/2022062305/568165a0550346895dd87a74/html5/thumbnails/9.jpg)
Prior Research
by Wlodzislaw Duch, Department of Computer Methods, Nicholas Copernicus University
![Page 10: Distinguish Wild Mushrooms with Decision Tree](https://reader036.vdocuments.us/reader036/viewer/2022062305/568165a0550346895dd87a74/html5/thumbnails/10.jpg)
Add cross-validation to improve the accuracy
Prune the tree to avoid over-fitting
Future