Machine Learning (11) - 关于 Decision Tree 的小练习

题目

以泰坦尼克号的人员名单为草料，从多个维度值决定一个人是否会遇难。目前我们手上有一份真实的泰坦尼克号的所有人员名单，其中有一列是 Survived, 也就是此人是否遇难。现在的要求是，根据 Pclass，Fare，Age，Sex 列的值来训练模型。

正文

引入数据

import pandas as pd

df = pd.read_csv('/Users/rachel/Sites/pandas/py/ML/9_decision_tree/Exercise/titanic.csv')
df.head()

输出

Machine Learning (11) - 关于 Decision Tree 的小练习

去掉对是否生还结果没有影响的字段

df = df.drop(['PassengerId', 'Name', 'SibSp', 'Parch', 'Ticket','Cabin','Embarked'], axis = 'columns')
df.head()

输出：

Machine Learning (11) - 关于 Decision Tree 的小练习

把非数字列的值转为数字

from sklearn.preprocessing import LabelEncoder
le_sex = LabelEncoder()
df.Sex = le_sex.fit_transform(df.Sex)
df.head()

输出：

Machine Learning (11) - 关于 Decision Tree 的小练习

取出用于训练的 X 列

input = df.drop('Survived', axis = 'columns')
input[:10]

输出：

Machine Learning (11) - 关于 Decision Tree 的小练习

用平均值填充 NAN

input = input.fillna(input.Age.median())
input[:10]

输出：

Machine Learning (11) - 关于 Decision Tree 的小练习

划分训练数据和测试数据

from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(input, target, test_size = 0.2)

训练模型

from sklearn import tree
model = tree.DecisionTreeClassifier()
model.fit(X_train, y_train)

model.score(X_test, y_test) // 输出: 0.7821229050279329

本作品采用《CC 协议》，转载必须注明作者和本文链接

Rachel

金牌译者 610 声望

暂无个人描述~

0 人点赞

推荐文章：

更多推荐...

置顶

[进度 100.00%] Python Masonite 4.0 中文翻译召集（Python 中的类 Laravel 框架） 15 / 20 |

公告

Python Masonite 框架中文翻译召集（Python 中的类 Laravel 框架） 24 / 25 |

博客

收集了一些各大网站 python 的登陆方式,希望对学习 python 的小白，和想写爬虫的你们有所帮助,,本项目用于研究和分享各大网站的模拟登陆方式 17 / 5 |

翻译

Python 3.7 的一些新特性 10 / 2 |

链接

快速掌握一个语言最常用的 50% 11 / 1 |

翻译

使用 Python 一步步搭建自己的区块链 22 / 1 |

讨论数量: 0

(=￣ω￣=)··· 暂无内容！

讨论应以学习和精进为目的。请勿发布不友善或者负能量的内容，与人为善，比聪明更重要！

帮助

未填写

私信

所有博文

文章归档

6年前酷帅吊炸天的 Pandas 常用操作命令汇总 7年前 Machine Learning（14） - K Fold Cross Validation 7年前 Machine Learning（16） - 关于 K Means Clustering 的练习题 7年前 Machine Learning（15） - K Means Clustering 7年前 Machine Learning（13）- Random Forest

64 整理 PC 端微信扫码支付全过程 --- easywechat + Laravel 5.8 55 基于 Laravel 和 Redis 的点赞功能设计 30 Laravel + jscroll 真的只要 10 行代码就可以实现无限加载 11 从 simplemde 写入 + inline-attachment 图片拖拽上传到 parsedown 解析 9 整理小程序登录状态维护笔记

博客标签

redis

laravel

markdown

parsedown

easywechat

SimpleMDE

rand

jscroll

小程序

Vue.js

Pandas

数据分析

微信扫码支付

无限加载

富文本编辑器

inline-attachment

登录状态维护

爬虫

scrapy

成为赞助商

Machine Learning (11) - 关于 Decision Tree 的小练习

题目

正文

引入数据

去掉对是否生还结果没有影响的字段

把非数字列的值转为数字

取出用于训练的 X 列

用平均值填充 NAN

划分训练数据和测试数据

训练模型

推荐文章：

社区赞助商

关于 LearnKu

资源推荐

服务提供商

其他信息

Machine Learning (11) - 关于 Decision Tree 的小练习

题目

正文

引入数据

去掉对是否生还结果没有影响的字段

把非数字列的值转为数字

取出用于训练的 X 列

用平均值填充 NAN

划分训练数据和测试数据

训练模型

推荐文章：

社区赞助商

关于 LearnKu

资源推荐

服务提供商

其他信息

请登录