From 8b47d373aaa0b9a1e11ab5afcdd1cd03a8153aa3 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?=E7=89=87=E5=88=BB?= Date: Sun, 25 Aug 2019 16:52:31 +0800 Subject: [PATCH] =?UTF-8?q?=E4=BF=AE=E6=94=B9=E8=BF=87=E6=9C=9F=E5=9B=BE?= =?UTF-8?q?=E7=89=87=E9=93=BE=E6=8E=A5?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- .../getting-started/titanic/README.md | 38 ++++++++++--------- 1 file changed, 20 insertions(+), 18 deletions(-) diff --git a/docs/Kaggle/competitions/getting-started/titanic/README.md b/docs/Kaggle/competitions/getting-started/titanic/README.md index 2e5167fd..831f2a96 100644 --- a/docs/Kaggle/competitions/getting-started/titanic/README.md +++ b/docs/Kaggle/competitions/getting-started/titanic/README.md @@ -65,16 +65,18 @@ test = pd.read_csv('%s/%s' % (root_path, 'test.csv')) | 特征 | 描述 | 值| | - | - | - | -| survival | 生存 | 0 = No, 1 = Yes | -| pclass | 票类别-社会地位 | 1 = 1st, 2 = 2nd, 3 = 3rd | -| sex | 性别 | | +| PassengerId | 乘客ID | | +| Survived | 生存 | 0 = No, 1 = Yes | +| Pclass | 票类别-社会地位 | 1 = 1st, 2 = 2nd, 3 = 3rd | +| Name | 姓名 | | +| Sex | 性别 | | | Age | 年龄 | | -| sibsp | 兄弟姐妹/配偶 | | -| parch | 父母/孩子的数量 | | -| ticket | 票号 | | -| fare | 乘客票价 | | -| cabin | 客舱号码 | | -| embarked | 登船港口 | C=Cherbourg, Q=Queenstown, S=Southampton | +| SibSp | 兄弟姐妹/配偶 | | +| Parch | 父母/孩子的数量 | | +| Ticket | 票号 | | +| Fare | 乘客票价 | | +| Cabin | 客舱号码 | | +| Embarked | 登船港口 | C=Cherbourg, Q=Queenstown, S=Southampton | ### 特征详情 @@ -82,7 +84,7 @@ test = pd.read_csv('%s/%s' % (root_path, 'test.csv')) train.head(5) ``` -![](/img/competitions/getting-started/titanic/titanic_top_5.jpg) +![](../../../img/competitions/getting-started/titanic/titanic_top_5.jpg) ```python # 返回每列列名,该列非nan值个数,以及该列类型 @@ -134,7 +136,7 @@ memory usage: 36.0+ KB train.describe() ``` -![](/img/competitions/getting-started/titanic/titanic_train_desc.jpg) +![](../../../img/competitions/getting-started/titanic/titanic_train_desc.jpg) ### 特征分析(统计学与绘图) 目的:初步了解数据之间的相关性,为构造特征工程以及模型建立做准备 @@ -237,7 +239,7 @@ a = plt.subplots(figsize=(15,9))#调整画布大小 a = sns.heatmap(train_corr, vmin=-1, vmax=1 , annot=True , square=True)#画热力图 ``` -![png](/img/competitions/getting-started/titanic/titanic_corr_analysis.png) +![png](../../../img/competitions/getting-started/titanic/titanic_corr_analysis.png) > 2)各个数据与结果的关系 @@ -292,7 +294,7 @@ train[['Pclass','Survived']].groupby(['Pclass']).mean().plot.bar() ``` -![png](/img/competitions/getting-started/titanic/titanic_pclass_bar.png) +![png](../../../img/competitions/getting-started/titanic/titanic_pclass_bar.png) * ② Sex,性别 @@ -333,7 +335,7 @@ train[['Sex','Survived']].groupby(['Sex']).mean().plot.bar() ``` -![png](/img/competitions/getting-started/titanic/titanic_sex_bar.png) +![png](../../../img/competitions/getting-started/titanic/titanic_sex_bar.png) * ③ SibSp and Parch 兄妹配偶数/父母子女数 @@ -394,7 +396,7 @@ train[['Parch','Survived']].groupby(['Parch']).mean().plot.bar() ``` -![png](/img/competitions/getting-started/titanic/titanic_parch_bar.png) +![png](../../../img/competitions/getting-started/titanic/titanic_parch_bar.png) * ④ Age年龄与生存情况的分析. @@ -409,7 +411,7 @@ g.map(plt.hist, 'Age', bins=40) ``` -![png](/img/competitions/getting-started/titanic/titanic_age_map.png) +![png](../../../img/competitions/getting-started/titanic/titanic_age_map.png) ```python train.groupby(['Age'])['Survived'].mean().plot() @@ -417,7 +419,7 @@ train.groupby(['Age'])['Survived'].mean().plot() ``` -![png](/img/competitions/getting-started/titanic/titanic_age_axes.png) +![png](../../../img/competitions/getting-started/titanic/titanic_age_axes.png) * ⑤ Embarked登港港口与生存情况的分析 @@ -429,7 +431,7 @@ sns.countplot('Embarked',hue='Survived',data=train) ``` -![png](/img/competitions/getting-started/titanic/titanic_embarked_count.png) +![png](../../../img/competitions/getting-started/titanic/titanic_embarked_count.png) * ⑥ 其他因素