Skip to content

Commit

Permalink
修改过期图片链接
Browse files Browse the repository at this point in the history
  • Loading branch information
jiangzhonglian authored Aug 25, 2019
1 parent 85a1092 commit 8b47d37
Showing 1 changed file with 20 additions and 18 deletions.
38 changes: 20 additions & 18 deletions docs/Kaggle/competitions/getting-started/titanic/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -65,24 +65,26 @@ test = pd.read_csv('%s/%s' % (root_path, 'test.csv'))

| 特征 | 描述 ||
| - | - | - |
| survival | 生存 | 0 = No, 1 = Yes |
| pclass | 票类别-社会地位 | 1 = 1st, 2 = 2nd, 3 = 3rd |
| sex | 性别 | |
| PassengerId | 乘客ID | |
| Survived | 生存 | 0 = No, 1 = Yes |
| Pclass | 票类别-社会地位 | 1 = 1st, 2 = 2nd, 3 = 3rd |
| Name | 姓名 | |
| Sex | 性别 | |
| Age | 年龄 | |
| sibsp | 兄弟姐妹/配偶 | |
| parch | 父母/孩子的数量 | |
| ticket | 票号 | |
| fare | 乘客票价 | |
| cabin | 客舱号码 | |
| embarked | 登船港口 | C=Cherbourg, Q=Queenstown, S=Southampton |
| SibSp | 兄弟姐妹/配偶 | |
| Parch | 父母/孩子的数量 | |
| Ticket | 票号 | |
| Fare | 乘客票价 | |
| Cabin | 客舱号码 | |
| Embarked | 登船港口 | C=Cherbourg, Q=Queenstown, S=Southampton |

### 特征详情

```python
train.head(5)
```

![](/img/competitions/getting-started/titanic/titanic_top_5.jpg)
![](../../../img/competitions/getting-started/titanic/titanic_top_5.jpg)

```python
# 返回每列列名,该列非nan值个数,以及该列类型
Expand Down Expand Up @@ -134,7 +136,7 @@ memory usage: 36.0+ KB
train.describe()
```

![](/img/competitions/getting-started/titanic/titanic_train_desc.jpg)
![](../../../img/competitions/getting-started/titanic/titanic_train_desc.jpg)

### 特征分析(统计学与绘图)
目的:初步了解数据之间的相关性,为构造特征工程以及模型建立做准备
Expand Down Expand Up @@ -237,7 +239,7 @@ a = plt.subplots(figsize=(15,9))#调整画布大小
a = sns.heatmap(train_corr, vmin=-1, vmax=1 , annot=True , square=True)#画热力图
```

![png](/img/competitions/getting-started/titanic/titanic_corr_analysis.png)
![png](../../../img/competitions/getting-started/titanic/titanic_corr_analysis.png)

> 2)各个数据与结果的关系
Expand Down Expand Up @@ -292,7 +294,7 @@ train[['Pclass','Survived']].groupby(['Pclass']).mean().plot.bar()
<matplotlib.axes._subplots.AxesSubplot at 0xc33fa90>
```

![png](/img/competitions/getting-started/titanic/titanic_pclass_bar.png)
![png](../../../img/competitions/getting-started/titanic/titanic_pclass_bar.png)

* ② Sex,性别

Expand Down Expand Up @@ -333,7 +335,7 @@ train[['Sex','Survived']].groupby(['Sex']).mean().plot.bar()
<matplotlib.axes._subplots.AxesSubplot at 0x105c4b630>
```

![png](/img/competitions/getting-started/titanic/titanic_sex_bar.png)
![png](../../../img/competitions/getting-started/titanic/titanic_sex_bar.png)

* ③ SibSp and Parch 兄妹配偶数/父母子女数

Expand Down Expand Up @@ -394,7 +396,7 @@ train[['Parch','Survived']].groupby(['Parch']).mean().plot.bar()
<matplotlib.axes._subplots.AxesSubplot at 0x1144385c0>
```

![png](/img/competitions/getting-started/titanic/titanic_parch_bar.png)
![png](../../../img/competitions/getting-started/titanic/titanic_parch_bar.png)

* ④ Age年龄与生存情况的分析.

Expand All @@ -409,15 +411,15 @@ g.map(plt.hist, 'Age', bins=40)
<seaborn.axisgrid.FacetGrid at 0xc5f7cf8>
```

![png](/img/competitions/getting-started/titanic/titanic_age_map.png)
![png](../../../img/competitions/getting-started/titanic/titanic_age_map.png)

```python
train.groupby(['Age'])['Survived'].mean().plot()

<matplotlib.axes._subplots.AxesSubplot at 0xc71ac18>
```

![png](/img/competitions/getting-started/titanic/titanic_age_axes.png)
![png](../../../img/competitions/getting-started/titanic/titanic_age_axes.png)

* ⑤ Embarked登港港口与生存情况的分析

Expand All @@ -429,7 +431,7 @@ sns.countplot('Embarked',hue='Survived',data=train)
<matplotlib.axes._subplots.AxesSubplot at 0xca1e5f8>
```

![png](/img/competitions/getting-started/titanic/titanic_embarked_count.png)
![png](../../../img/competitions/getting-started/titanic/titanic_embarked_count.png)

* ⑥ 其他因素

Expand Down

0 comments on commit 8b47d37

Please sign in to comment.