dl_project/titanic.ipynb

640 lines
56 KiB
Plaintext
Raw Normal View History

2024-06-11 01:18:29 +02:00
{
"cells": [
{
"cell_type": "markdown",
"source": [
"Preprocessing danych. Niektóre kolumny nie mają wpływu na predykcję i je usuwamy. Trzeba też uzyc one-hot-encodingu do kolumn będących\n",
"kategoriami."
],
"metadata": {
"collapsed": false
}
},
{
"cell_type": "code",
"execution_count": null,
"outputs": [],
"source": [
"import pandas as pd\n",
"\n",
"df = pd.read_csv('data/titanic.csv')\n",
"df.head()\n",
"df = df[['Survived', 'Age', 'Sex', 'Pclass']]\n",
"df = pd.get_dummies(df, columns=['Sex', 'Pclass'])\n",
"df.dropna(inplace=True)\n",
"df.head()"
],
"metadata": {
"collapsed": false,
"is_executing": true
}
},
{
"cell_type": "markdown",
"source": [
"Podzial danych na testowe i treningowe (80/20)"
],
"metadata": {
"collapsed": false
}
},
{
"cell_type": "code",
"execution_count": 20,
"outputs": [],
"source": [
"from sklearn.model_selection import train_test_split\n",
"\n",
"X = df.drop('Survived', axis=1)\n",
"y = df['Survived']\n",
"\n",
"X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, stratify=y, random_state=42)"
],
"metadata": {
"collapsed": false,
"ExecuteTime": {
"end_time": "2024-04-14T12:40:47.464233Z",
"start_time": "2024-04-14T12:40:47.443556Z"
}
}
},
{
"cell_type": "code",
"execution_count": 49,
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"RBF: 0.6153846153846154\n"
]
}
],
"source": [
"from sklearn.svm import SVC\n",
"\n",
"model = SVC(kernel=\"rbf\", probability=True, random_state=42)\n",
"model.fit(X_train, y_train)\n",
"print(f\"RBF: {model.score(X_test, y_test)}\")"
],
"metadata": {
"collapsed": false,
"ExecuteTime": {
"end_time": "2024-04-14T13:41:02.969628Z",
"start_time": "2024-04-14T13:41:02.806174Z"
}
}
},
{
"cell_type": "code",
"execution_count": 50,
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Linear: 0.7832167832167832\n"
]
}
],
"source": [
"from sklearn.svm import SVC\n",
"\n",
"model = SVC(kernel=\"linear\", probability=True, random_state=42)\n",
"model.fit(X_train, y_train)\n",
"print(f\"Linear: {model.score(X_test, y_test)}\")"
],
"metadata": {
"collapsed": false,
"ExecuteTime": {
"end_time": "2024-04-14T13:41:04.509561Z",
"start_time": "2024-04-14T13:41:04.220245Z"
}
}
},
{
"cell_type": "code",
"execution_count": 51,
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Sigmoid: 0.5594405594405595\n"
]
}
],
"source": [
"from sklearn.svm import SVC\n",
"\n",
"model = SVC(kernel=\"sigmoid\", probability=True, random_state=42)\n",
"model.fit(X_train, y_train)\n",
"print(f\"Sigmoid: {model.score(X_test, y_test)}\")"
],
"metadata": {
"collapsed": false,
"ExecuteTime": {
"end_time": "2024-04-14T13:41:05.759314Z",
"start_time": "2024-04-14T13:41:05.654662Z"
}
}
},
{
"cell_type": "markdown",
"source": [
"Wyniki dla domyslnych parametrow dla roznych funkcji jadra nie sa powalajace (56-78%)"
],
"metadata": {
"collapsed": false
}
},
{
"cell_type": "markdown",
"source": [
"Spróbuję sprawdzić różne wariancje funkcji jądra oraz parametrów, które mówią o tym jak bardzo chcemy unikać misklasyfikacji oraz jak bardzo odległe przypadki mają wpływać na decyzję. Daje to sprawdzenie 300 roznych wariancji modelu."
],
"metadata": {
"collapsed": false
}
},
{
"cell_type": "markdown",
"source": [],
"metadata": {
"collapsed": false
}
},
{
"cell_type": "code",
"execution_count": 25,
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Fitting 5 folds for each of 60 candidates, totalling 300 fits\n",
"[CV] END ......................C=0.5, gamma=1, kernel=linear; total time= 0.1s\n",
"[CV] END ......................C=0.5, gamma=1, kernel=linear; total time= 0.1s\n",
"[CV] END ......................C=0.5, gamma=1, kernel=linear; total time= 0.1s\n",
"[CV] END ......................C=0.5, gamma=1, kernel=linear; total time= 0.0s\n",
"[CV] END ......................C=0.5, gamma=1, kernel=linear; total time= 0.1s\n",
"[CV] END .........................C=0.5, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END .........................C=0.5, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END .........................C=0.5, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END .........................C=0.5, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END .........................C=0.5, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END .....................C=0.5, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .....................C=0.5, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .....................C=0.5, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .....................C=0.5, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .....................C=0.5, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ....................C=0.5, gamma=0.1, kernel=linear; total time= 0.1s\n",
"[CV] END ....................C=0.5, gamma=0.1, kernel=linear; total time= 0.1s\n",
"[CV] END ....................C=0.5, gamma=0.1, kernel=linear; total time= 0.1s\n",
"[CV] END ....................C=0.5, gamma=0.1, kernel=linear; total time= 0.0s\n",
"[CV] END ....................C=0.5, gamma=0.1, kernel=linear; total time= 0.0s\n",
"[CV] END .......................C=0.5, gamma=0.1, kernel=rbf; total time= 0.1s\n",
"[CV] END .......................C=0.5, gamma=0.1, kernel=rbf; total time= 0.1s\n",
"[CV] END .......................C=0.5, gamma=0.1, kernel=rbf; total time= 0.1s\n",
"[CV] END .......................C=0.5, gamma=0.1, kernel=rbf; total time= 0.1s\n",
"[CV] END .......................C=0.5, gamma=0.1, kernel=rbf; total time= 0.1s\n",
"[CV] END ...................C=0.5, gamma=0.1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ...................C=0.5, gamma=0.1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ...................C=0.5, gamma=0.1, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ...................C=0.5, gamma=0.1, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ...................C=0.5, gamma=0.1, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ...................C=0.5, gamma=0.01, kernel=linear; total time= 0.1s\n",
"[CV] END ...................C=0.5, gamma=0.01, kernel=linear; total time= 0.1s\n",
"[CV] END ...................C=0.5, gamma=0.01, kernel=linear; total time= 0.1s\n",
"[CV] END ...................C=0.5, gamma=0.01, kernel=linear; total time= 0.0s\n",
"[CV] END ...................C=0.5, gamma=0.01, kernel=linear; total time= 0.1s\n",
"[CV] END ......................C=0.5, gamma=0.01, kernel=rbf; total time= 0.1s\n",
"[CV] END ......................C=0.5, gamma=0.01, kernel=rbf; total time= 0.1s\n",
"[CV] END ......................C=0.5, gamma=0.01, kernel=rbf; total time= 0.1s\n",
"[CV] END ......................C=0.5, gamma=0.01, kernel=rbf; total time= 0.1s\n",
"[CV] END ......................C=0.5, gamma=0.01, kernel=rbf; total time= 0.1s\n",
"[CV] END ..................C=0.5, gamma=0.01, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ..................C=0.5, gamma=0.01, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ..................C=0.5, gamma=0.01, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ..................C=0.5, gamma=0.01, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ..................C=0.5, gamma=0.01, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ..................C=0.5, gamma=0.001, kernel=linear; total time= 0.1s\n",
"[CV] END ..................C=0.5, gamma=0.001, kernel=linear; total time= 0.1s\n",
"[CV] END ..................C=0.5, gamma=0.001, kernel=linear; total time= 0.1s\n",
"[CV] END ..................C=0.5, gamma=0.001, kernel=linear; total time= 0.0s\n",
"[CV] END ..................C=0.5, gamma=0.001, kernel=linear; total time= 0.0s\n",
"[CV] END .....................C=0.5, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END .....................C=0.5, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END .....................C=0.5, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END .....................C=0.5, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END .....................C=0.5, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END .................C=0.5, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .................C=0.5, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .................C=0.5, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .................C=0.5, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .................C=0.5, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .................C=0.5, gamma=0.0001, kernel=linear; total time= 0.1s\n",
"[CV] END .................C=0.5, gamma=0.0001, kernel=linear; total time= 0.1s\n",
"[CV] END .................C=0.5, gamma=0.0001, kernel=linear; total time= 0.1s\n",
"[CV] END .................C=0.5, gamma=0.0001, kernel=linear; total time= 0.0s\n",
"[CV] END .................C=0.5, gamma=0.0001, kernel=linear; total time= 0.0s\n",
"[CV] END ....................C=0.5, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END ....................C=0.5, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END ....................C=0.5, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END ....................C=0.5, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END ....................C=0.5, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END ................C=0.5, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ................C=0.5, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ................C=0.5, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ................C=0.5, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ................C=0.5, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ........................C=1, gamma=1, kernel=linear; total time= 0.1s\n",
"[CV] END ........................C=1, gamma=1, kernel=linear; total time= 0.1s\n",
"[CV] END ........................C=1, gamma=1, kernel=linear; total time= 0.1s\n",
"[CV] END ........................C=1, gamma=1, kernel=linear; total time= 0.0s\n",
"[CV] END ........................C=1, gamma=1, kernel=linear; total time= 0.1s\n",
"[CV] END ...........................C=1, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END ...........................C=1, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END ...........................C=1, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END ...........................C=1, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END ...........................C=1, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END .......................C=1, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .......................C=1, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .......................C=1, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .......................C=1, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .......................C=1, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ......................C=1, gamma=0.1, kernel=linear; total time= 0.1s\n",
"[CV] END ......................C=1, gamma=0.1, kernel=linear; total time= 0.1s\n",
"[CV] END ......................C=1, gamma=0.1, kernel=linear; total time= 0.1s\n",
"[CV] END ......................C=1, gamma=0.1, kernel=linear; total time= 0.0s\n",
"[CV] END ......................C=1, gamma=0.1, kernel=linear; total time= 0.1s\n",
"[CV] END .........................C=1, gamma=0.1, kernel=rbf; total time= 0.1s\n",
"[CV] END .........................C=1, gamma=0.1, kernel=rbf; total time= 0.1s\n",
"[CV] END .........................C=1, gamma=0.1, kernel=rbf; total time= 0.1s\n",
"[CV] END .........................C=1, gamma=0.1, kernel=rbf; total time= 0.1s\n",
"[CV] END .........................C=1, gamma=0.1, kernel=rbf; total time= 0.1s\n",
"[CV] END .....................C=1, gamma=0.1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .....................C=1, gamma=0.1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .....................C=1, gamma=0.1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .....................C=1, gamma=0.1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .....................C=1, gamma=0.1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .....................C=1, gamma=0.01, kernel=linear; total time= 0.1s\n",
"[CV] END .....................C=1, gamma=0.01, kernel=linear; total time= 0.1s\n",
"[CV] END .....................C=1, gamma=0.01, kernel=linear; total time= 0.1s\n",
"[CV] END .....................C=1, gamma=0.01, kernel=linear; total time= 0.0s\n",
"[CV] END .....................C=1, gamma=0.01, kernel=linear; total time= 0.1s\n",
"[CV] END ........................C=1, gamma=0.01, kernel=rbf; total time= 0.1s\n",
"[CV] END ........................C=1, gamma=0.01, kernel=rbf; total time= 0.1s\n",
"[CV] END ........................C=1, gamma=0.01, kernel=rbf; total time= 0.1s\n",
"[CV] END ........................C=1, gamma=0.01, kernel=rbf; total time= 0.1s\n",
"[CV] END ........................C=1, gamma=0.01, kernel=rbf; total time= 0.1s\n",
"[CV] END ....................C=1, gamma=0.01, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ....................C=1, gamma=0.01, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ....................C=1, gamma=0.01, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ....................C=1, gamma=0.01, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ....................C=1, gamma=0.01, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ....................C=1, gamma=0.001, kernel=linear; total time= 0.1s\n",
"[CV] END ....................C=1, gamma=0.001, kernel=linear; total time= 0.1s\n",
"[CV] END ....................C=1, gamma=0.001, kernel=linear; total time= 0.1s\n",
"[CV] END ....................C=1, gamma=0.001, kernel=linear; total time= 0.0s\n",
"[CV] END ....................C=1, gamma=0.001, kernel=linear; total time= 0.1s\n",
"[CV] END .......................C=1, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END .......................C=1, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END .......................C=1, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END .......................C=1, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END .......................C=1, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END ...................C=1, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ...................C=1, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ...................C=1, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ...................C=1, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ...................C=1, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ...................C=1, gamma=0.0001, kernel=linear; total time= 0.1s\n",
"[CV] END ...................C=1, gamma=0.0001, kernel=linear; total time= 0.1s\n",
"[CV] END ...................C=1, gamma=0.0001, kernel=linear; total time= 0.1s\n",
"[CV] END ...................C=1, gamma=0.0001, kernel=linear; total time= 0.0s\n",
"[CV] END ...................C=1, gamma=0.0001, kernel=linear; total time= 0.1s\n",
"[CV] END ......................C=1, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END ......................C=1, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END ......................C=1, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END ......................C=1, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END ......................C=1, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END ..................C=1, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ..................C=1, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ..................C=1, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ..................C=1, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ..................C=1, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END .......................C=10, gamma=1, kernel=linear; total time= 0.6s\n",
"[CV] END .......................C=10, gamma=1, kernel=linear; total time= 0.4s\n",
"[CV] END .......................C=10, gamma=1, kernel=linear; total time= 0.3s\n",
"[CV] END .......................C=10, gamma=1, kernel=linear; total time= 0.2s\n",
"[CV] END .......................C=10, gamma=1, kernel=linear; total time= 0.3s\n",
"[CV] END ..........................C=10, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END ..........................C=10, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END ..........................C=10, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END ..........................C=10, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END ..........................C=10, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END ......................C=10, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ......................C=10, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ......................C=10, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ......................C=10, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ......................C=10, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .....................C=10, gamma=0.1, kernel=linear; total time= 0.5s\n",
"[CV] END .....................C=10, gamma=0.1, kernel=linear; total time= 0.4s\n",
"[CV] END .....................C=10, gamma=0.1, kernel=linear; total time= 0.3s\n",
"[CV] END .....................C=10, gamma=0.1, kernel=linear; total time= 0.2s\n",
"[CV] END .....................C=10, gamma=0.1, kernel=linear; total time= 0.3s\n",
"[CV] END ........................C=10, gamma=0.1, kernel=rbf; total time= 0.1s\n",
"[CV] END ........................C=10, gamma=0.1, kernel=rbf; total time= 0.1s\n",
"[CV] END ........................C=10, gamma=0.1, kernel=rbf; total time= 0.1s\n",
"[CV] END ........................C=10, gamma=0.1, kernel=rbf; total time= 0.1s\n",
"[CV] END ........................C=10, gamma=0.1, kernel=rbf; total time= 0.1s\n",
"[CV] END ....................C=10, gamma=0.1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ....................C=10, gamma=0.1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ....................C=10, gamma=0.1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ....................C=10, gamma=0.1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ....................C=10, gamma=0.1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ....................C=10, gamma=0.01, kernel=linear; total time= 0.6s\n",
"[CV] END ....................C=10, gamma=0.01, kernel=linear; total time= 0.4s\n",
"[CV] END ....................C=10, gamma=0.01, kernel=linear; total time= 0.3s\n",
"[CV] END ....................C=10, gamma=0.01, kernel=linear; total time= 0.2s\n",
"[CV] END ....................C=10, gamma=0.01, kernel=linear; total time= 0.3s\n",
"[CV] END .......................C=10, gamma=0.01, kernel=rbf; total time= 0.1s\n",
"[CV] END .......................C=10, gamma=0.01, kernel=rbf; total time= 0.1s\n",
"[CV] END .......................C=10, gamma=0.01, kernel=rbf; total time= 0.1s\n",
"[CV] END .......................C=10, gamma=0.01, kernel=rbf; total time= 0.1s\n",
"[CV] END .......................C=10, gamma=0.01, kernel=rbf; total time= 0.1s\n",
"[CV] END ...................C=10, gamma=0.01, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ...................C=10, gamma=0.01, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ...................C=10, gamma=0.01, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ...................C=10, gamma=0.01, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ...................C=10, gamma=0.01, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ...................C=10, gamma=0.001, kernel=linear; total time= 0.5s\n",
"[CV] END ...................C=10, gamma=0.001, kernel=linear; total time= 0.4s\n",
"[CV] END ...................C=10, gamma=0.001, kernel=linear; total time= 0.3s\n",
"[CV] END ...................C=10, gamma=0.001, kernel=linear; total time= 0.2s\n",
"[CV] END ...................C=10, gamma=0.001, kernel=linear; total time= 0.3s\n",
"[CV] END ......................C=10, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END ......................C=10, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END ......................C=10, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END ......................C=10, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END ......................C=10, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END ..................C=10, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ..................C=10, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ..................C=10, gamma=0.001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ..................C=10, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ..................C=10, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ..................C=10, gamma=0.0001, kernel=linear; total time= 0.7s\n",
"[CV] END ..................C=10, gamma=0.0001, kernel=linear; total time= 0.4s\n",
"[CV] END ..................C=10, gamma=0.0001, kernel=linear; total time= 0.3s\n",
"[CV] END ..................C=10, gamma=0.0001, kernel=linear; total time= 0.2s\n",
"[CV] END ..................C=10, gamma=0.0001, kernel=linear; total time= 0.3s\n",
"[CV] END .....................C=10, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END .....................C=10, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END .....................C=10, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END .....................C=10, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END .....................C=10, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END .................C=10, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END .................C=10, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END .................C=10, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END .................C=10, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END .................C=10, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ......................C=100, gamma=1, kernel=linear; total time= 9.8s\n",
"[CV] END ......................C=100, gamma=1, kernel=linear; total time= 5.1s\n",
"[CV] END ......................C=100, gamma=1, kernel=linear; total time= 10.0s\n",
"[CV] END ......................C=100, gamma=1, kernel=linear; total time= 3.6s\n",
"[CV] END ......................C=100, gamma=1, kernel=linear; total time= 5.2s\n",
"[CV] END .........................C=100, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END .........................C=100, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END .........................C=100, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END .........................C=100, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END .........................C=100, gamma=1, kernel=rbf; total time= 0.1s\n",
"[CV] END .....................C=100, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .....................C=100, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .....................C=100, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .....................C=100, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .....................C=100, gamma=1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ....................C=100, gamma=0.1, kernel=linear; total time= 11.1s\n",
"[CV] END ....................C=100, gamma=0.1, kernel=linear; total time= 4.7s\n",
"[CV] END ....................C=100, gamma=0.1, kernel=linear; total time= 8.4s\n",
"[CV] END ....................C=100, gamma=0.1, kernel=linear; total time= 2.0s\n",
"[CV] END ....................C=100, gamma=0.1, kernel=linear; total time= 3.8s\n",
"[CV] END .......................C=100, gamma=0.1, kernel=rbf; total time= 0.2s\n",
"[CV] END .......................C=100, gamma=0.1, kernel=rbf; total time= 0.1s\n",
"[CV] END .......................C=100, gamma=0.1, kernel=rbf; total time= 0.2s\n",
"[CV] END .......................C=100, gamma=0.1, kernel=rbf; total time= 0.3s\n",
"[CV] END .......................C=100, gamma=0.1, kernel=rbf; total time= 0.2s\n",
"[CV] END ...................C=100, gamma=0.1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ...................C=100, gamma=0.1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ...................C=100, gamma=0.1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ...................C=100, gamma=0.1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ...................C=100, gamma=0.1, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ...................C=100, gamma=0.01, kernel=linear; total time= 10.5s\n",
"[CV] END ...................C=100, gamma=0.01, kernel=linear; total time= 4.6s\n",
"[CV] END ...................C=100, gamma=0.01, kernel=linear; total time= 7.4s\n",
"[CV] END ...................C=100, gamma=0.01, kernel=linear; total time= 1.8s\n",
"[CV] END ...................C=100, gamma=0.01, kernel=linear; total time= 3.4s\n",
"[CV] END ......................C=100, gamma=0.01, kernel=rbf; total time= 0.2s\n",
"[CV] END ......................C=100, gamma=0.01, kernel=rbf; total time= 0.2s\n",
"[CV] END ......................C=100, gamma=0.01, kernel=rbf; total time= 0.2s\n",
"[CV] END ......................C=100, gamma=0.01, kernel=rbf; total time= 0.3s\n",
"[CV] END ......................C=100, gamma=0.01, kernel=rbf; total time= 0.2s\n",
"[CV] END ..................C=100, gamma=0.01, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ..................C=100, gamma=0.01, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ..................C=100, gamma=0.01, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ..................C=100, gamma=0.01, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ..................C=100, gamma=0.01, kernel=sigmoid; total time= 0.0s\n",
"[CV] END ..................C=100, gamma=0.001, kernel=linear; total time= 10.6s\n",
"[CV] END ..................C=100, gamma=0.001, kernel=linear; total time= 5.0s\n",
"[CV] END ..................C=100, gamma=0.001, kernel=linear; total time= 9.5s\n",
"[CV] END ..................C=100, gamma=0.001, kernel=linear; total time= 2.2s\n",
"[CV] END ..................C=100, gamma=0.001, kernel=linear; total time= 4.1s\n",
"[CV] END .....................C=100, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END .....................C=100, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END .....................C=100, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END .....................C=100, gamma=0.001, kernel=rbf; total time= 0.2s\n",
"[CV] END .....................C=100, gamma=0.001, kernel=rbf; total time= 0.1s\n",
"[CV] END .................C=100, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .................C=100, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .................C=100, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .................C=100, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .................C=100, gamma=0.001, kernel=sigmoid; total time= 0.0s\n",
"[CV] END .................C=100, gamma=0.0001, kernel=linear; total time= 10.9s\n",
"[CV] END .................C=100, gamma=0.0001, kernel=linear; total time= 4.5s\n",
"[CV] END .................C=100, gamma=0.0001, kernel=linear; total time= 7.5s\n",
"[CV] END .................C=100, gamma=0.0001, kernel=linear; total time= 1.8s\n",
"[CV] END .................C=100, gamma=0.0001, kernel=linear; total time= 3.4s\n",
"[CV] END ....................C=100, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END ....................C=100, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END ....................C=100, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END ....................C=100, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END ....................C=100, gamma=0.0001, kernel=rbf; total time= 0.1s\n",
"[CV] END ................C=100, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ................C=100, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ................C=100, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ................C=100, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n",
"[CV] END ................C=100, gamma=0.0001, kernel=sigmoid; total time= 0.1s\n"
]
}
],
"source": [
"from sklearn.model_selection import GridSearchCV\n",
"\n",
"model = SVC(probability=True, random_state=42)\n",
"\n",
"param_grid = {\n",
" 'C': [0.5, 1, 10, 100],\n",
" 'gamma': [1, 0.1, 0.01, 0.001, 0.0001],\n",
" 'kernel': ['linear', 'rbf', 'sigmoid']\n",
"}\n",
"\n",
"grid_search = GridSearchCV(estimator=model, param_grid=param_grid, cv=5, verbose=2)\n",
"grid_search.fit(X, y)\n",
"\n",
"best_model = grid_search.best_estimator_"
],
"metadata": {
"collapsed": false,
"ExecuteTime": {
"end_time": "2024-04-14T13:01:09.444905Z",
"start_time": "2024-04-14T12:58:11.642838Z"
}
}
},
{
"cell_type": "code",
"execution_count": 26,
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"{'C': 1, 'gamma': 1, 'kernel': 'rbf'}\n"
]
}
],
"source": [
"print(grid_search.best_params_)"
],
"metadata": {
"collapsed": false,
"ExecuteTime": {
"end_time": "2024-04-14T13:01:14.725788Z",
"start_time": "2024-04-14T13:01:14.717342Z"
}
}
},
{
"cell_type": "markdown",
"source": [],
"metadata": {
"collapsed": false
}
},
{
"cell_type": "code",
"execution_count": 36,
"outputs": [
{
"data": {
"text/plain": "0.8951048951048951"
},
"execution_count": 36,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"best_model.score(X_test, y_test)\n"
],
"metadata": {
"collapsed": false,
"ExecuteTime": {
"end_time": "2024-04-14T13:06:57.012689Z",
"start_time": "2024-04-14T13:06:57.001809Z"
}
}
},
{
"cell_type": "markdown",
"source": [
"Najlepszy okazal sie model z funkcja jadra RBF oraz z C = 1 oraz gamma = 1. Skutecznosc wzrosla az do 89%. Mozna by pewnie zawężać jeszcze C oraz gamma aby uzyskac jeszcze wieksza dokladnosc."
],
"metadata": {
"collapsed": false
}
},
{
"cell_type": "code",
"execution_count": 41,
"outputs": [
{
"name": "stderr",
"output_type": "stream",
"text": [
"/usr/local/lib/python3.9/site-packages/sklearn/utils/deprecation.py:87: FutureWarning: Function plot_confusion_matrix is deprecated; Function `plot_confusion_matrix` is deprecated in 1.0 and will be removed in 1.2. Use one of the class methods: ConfusionMatrixDisplay.from_predictions or ConfusionMatrixDisplay.from_estimator.\n",
" warnings.warn(msg, category=FutureWarning)\n"
]
},
{
"data": {
"text/plain": "<sklearn.metrics._plot.confusion_matrix.ConfusionMatrixDisplay at 0x1244ac610>"
},
"execution_count": 41,
"metadata": {},
"output_type": "execute_result"
},
{
"data": {
"text/plain": "<Figure size 640x480 with 2 Axes>",
"image/png": "iVBORw0KGgoAAAANSUhEUgAAAiYAAAHfCAYAAABkuxMKAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjUuMiwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy8qNh9FAAAACXBIWXMAAA9hAAAPYQGoP6dpAAA+0ElEQVR4nO3df3zP9f7/8fv7bfaD7b2Z2CybiIzjV6NYP4jG6JQxJTU1pc73lCSSH+eEopp+iJQ4FVvOsZTCQeEUGQonK6XS8tvKtvrENtPZ7/f3D8f79M6w997v7f1+vXe7dnld8vr9eEfvPTwez+frZbJarVYBAAB4ALO7AwAAADiLxAQAAHgMEhMAAOAxSEwAAIDHIDEBAAAeg8QEAAB4DBITAADgMXzcHQD+p7KyUsePH1dQUJBMJpO7wwEAOMhqterUqVOKiIiQ2Vx7f/cvLi5WaWmp09fx9fWVv7+/CyJyHRITD3L8+HFFRka6OwwAgJOys7PVsmXLWrl2cXGxAoKaSuW/On2t8PBwHT582KOSExITDxIUFCRJ8u2YLFMDXzdHA9SOY1tecHcIQK05VViotq0jbd/ntaG0tFQq/1V+HZMlZ35WVJQq99s3VVpaSmKCqp1t35ga+JKYwGtZLBZ3hwDUujppx/v4O/WzwmryzGGmJCYAABiRSZIzCZCHDmUkMQEAwIhM5jOLM+d7IM+MCgAA1EtUTAAAMCKTyclWjmf2ckhMAAAwIlo5AAAAtYuKCQAARkQrBwAAeA4nWzke2jTxzKgAAEC9RMUEAAAjopUDAAA8BrNyAAAAahcVEwAAjIhWDgAA8Bhe2sohMQEAwIi8tGLimekSAACol6iYAABgRLRyAACAxzCZnExMaOUAAABcEBUTAACMyGw6szhzvgciMQEAwIi8dIyJZ0YFAADqJSomAAAYkZc+x4TEBAAAI6KVAwAAULuomAAAYES0cgAAgMfw0lYOiQkAAEbkpRUTz0yXAABAvUTFBAAAI6KVAwAAPAatHAAAgNpFxQQAAENyspXjobUJEhMAAIyIVg4AAEDtomICAIARmUxOzsrxzIoJiQkAAEbkpdOFPTMqAABQL1ExAQDAiBj8CgAAPMbZVo4ziwMuu+wymUymc5YxY8ZIkoqLizVmzBg1bdpUgYGBGjZsmPLy8hz+WCQmAAAY0dmKiTOLAz777DPl5OTYlg8//FCSdNttt0mSxo8fr7Vr12rFihXKyMjQ8ePHlZiY6PDHopUDAAAuqlmzZnbrs2fP1uWXX64+ffqooKBAixcvVnp6uvr16ydJSk1NVYcOHbRz50716tWr2vehYgIAgBG5qJVTWFhot5SUlFz01qWlpfrHP/6he++9VyaTSZmZmSorK1NcXJztmOjoaEVFRWnHjh0OfSwSEwAAjMhFrZzIyEgFBwfblpSUlIveevXq1crPz9eoUaMkSbm5ufL19VVISIjdcWFhYcrNzXXoY9HKAQCgHsvOzpbFYrGt+/n5XfScxYsXa9CgQYqIiHB5PCQmAAAY0NlZMU5cQJJksVjsEpOLOXr0qD766COtXLnSti08PFylpaXKz8+3q5rk5eUpPDzcobBo5QAAYEBVTd11dKmJ1NRUNW/eXH/84x9t27p3766GDRtq06ZNtm1ZWVk6duyYYmNjHbo+FRMAAFAtlZWVSk1NVXJysnx8/pdCBAcHa/To0ZowYYJCQ0NlsVg0duxYxcbGOjQjRyIxAQDAmEz/XZw530EfffSRjh07pnvvvfecfXPnzpXZbNawYcNUUlKi+Ph4vfrqqw7fg8QEAAADctUYE0cMGDBAVqu1yn3+/v5asGCBFixYUPOYxBgTAADgQaiYAABgQO6omNQFEhMAAAyIxAQAAHgMb01MGGMCAAA8BhUTAACMyA3ThesCiQkAAAZEKwcAAKCWUTEBAMCATCY5WTFxXSyuRGICAIABmeRkK8dDMxNaOQAAwGNQMQEAwIC8dfAriQkAAEbkpdOFaeUAAACPQcUEAAAjcrKVY6WVAwAAXMXZMSbOzeipPSQmAAAYkLcmJowxAQAAHoOKCQAARuSls3JITAAAMCBaOQAAALWMigkAAAbkrRUTEhMAAAzIWxMTWjkAAMBjUDEBAMCAvLViQmICAIAReel0YVo5AADAY1AxAQDAgGjlAAAAj0FiAgAAPIa3JiaMMQEAAB6DigkAAEbkpbNySEwAADAgWjkAAAC1jIoJvN6X/3xSURFNz9n+xoqteuy5d3TZpZdo1rih6tWtjXwb+mjTjn2a/MIK/XzilBuiBVzj+E/5euLlf+qjHd/oP8Vlat3yEi2YPlJXdmzl7tDgIt5aMSExOQ+TyaRVq1ZpyJAhNb7GqFGjlJ+fr9WrV7ssLjiuX/LzatDgf/8Ddrg8QqsXjNXqj75QI39frXxljL7e/6MSHnhZkvSXP/9Rb734/9T/njmyWq3uChuosfzCXzXwvhd1ffd2WvHSg7okJFAHs39WiKWRu0ODC5nkZGLioYNM6l1iMmrUKL355puSJB8fH4WGhqpLly664447NGrUKJnNZ7pbOTk5atKkiTtDhYv8kl9kt/5Icicdyv5Zn3y+X317RiuqRVP1GfmsTp0uliQ9+MTfdXjzc+p91RXK+HeWO0IGnDLvzQ91aVgTLZhxl21bq0svcWNEQPXVyzEmAwcOVE5Ojo4cOaL169erb9++GjdunG6++WaVl5dLksLDw+Xn5+fmSOFqDX0aaPigq7RszQ5Jkp+vj6xWq0pKy23HFJeWq7LSql5dL3dXmIBTNmzbqys7RGnUlMVqN2CKeifN1purPnF3WHCxs60cZxZPVC8TEz8/P4WHh+vSSy9VTEyM/vKXv+if//yn1q9fr7S0NElnfsN/24LJzs7W8OHDFRISotDQUCUkJOjIkSO2/RUVFZowYYJCQkLUtGlTTZo0iTaAB/rjDV0UHBig9HW7JEmf7T2iX4tL9cTYBAX4NVQjf1/NGjdUPj4NFH6Jxc3RAjVz5Mf/05L3tqlNZDO99/IY3TvsOk2Z867eWrfT3aHBlUwuWDxQvUxMqtKvXz917dpVK1euPGdfWVmZ4uPjFRQUpG3btumTTz5RYGCgBg4cqNLSUknSnDlzlJaWpiVLlmj79u06ceKEVq1adcF7lpSUqLCw0G5B7Ro5+Bp9tONb5f5fgaQzbZ5RUxZr4PWd9MPWOTr68fMKDgrQnn3HVFlJYgljqqy0qkv7SE0fM1hd2kdqVOJ1unvINUpdud3doQEXRWLyG9HR0XZVkLPefvttVVZW6o033lDnzp3VoUMHpaam6tixY9qyZYskad68eZo6daoSExPVoUMHLVq0SMHBwRe8X0pKioKDg21LZGRkLXwqnBUZ3kQ3XN1eS1d/arf9413fKWbok2o3YKou7z9Ff56xVC2ah+jIj//npkgB54RdYlF0m3C7bVdcFq4fck+6KSLUBne0cn788UeNHDlSTZs2VUBAgDp37qzdu3fb9lutVk2fPl0tWrRQQECA4uLitH//fofuQWLyG1artcrfqC+//FIHDhxQUFCQAgMDFRgYqNDQUBUXF+vgwYMqKChQTk6OevbsaTvHx8dHPXr0uOD9pk6dqoKCAtuSnZ3t8s+E/7nzllj9fPKU/vXJN1XuP1FwWoVF/9H1Pa5QsyaBWr9tbx1HCLhGz65ttP/oT3bbDh77SS3DQ90UEWpDXScmJ0+e1LXXXquGDRtq/fr1+vbbbzVnzhy7iSLPPfec5s+fr0WLFmnXrl1q3Lix4uPjVVxcXO371LtZOReyb98+tW7d+pztRUVF6t69u5YtW3bOvmbNmtX4fn5+fgywrSMmk0lJt/TS8vd3qaKi0m7fnbf00veHc/V/J4t0dZfWSplwq15962Md+N0XO2AUD97RT/Gj52hO6kYNjYtR5jdH9OaqTzT3L3e4OzS4kMl0ZnHmfEnnDCM438+mZ599VpGRkUpNTbVt++3PTKvVqnnz5unxxx9XQkKCJGnp0qUKCwvT6tWrNWLEiGrFRcXkvzZv3qy9e/dq2LBh5+yLiYnR/v371bx5c7Vt29ZuOduGadGihXb
},
"metadata": {},
"output_type": "display_data"
}
],
"source": [
"from sklearn.metrics import plot_confusion_matrix\n",
"\n",
"plot_confusion_matrix(best_model, X_test, y_test, display_labels=['Died', 'Survived'], cmap='Blues', xticks_rotation='vertical')"
],
"metadata": {
"collapsed": false,
"ExecuteTime": {
"end_time": "2024-04-14T13:35:14.404281Z",
"start_time": "2024-04-14T13:35:14.211756Z"
}
}
},
{
"cell_type": "markdown",
"source": [
"Confusion matrix pokazujący rozklad TP, TN, FP oraz FN. W zależności od tego czy bardziej chcemy unikac FN czy tez FP mozna doostosowac model, nawet kosztem ogolnej skutecznosci."
],
"metadata": {
"collapsed": false
}
}
],
"metadata": {
"kernelspec": {
"display_name": "Python 3",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 2
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython2",
"version": "2.7.6"
}
},
"nbformat": 4,
"nbformat_minor": 0
}