Skip to content

Instantly share code, notes, and snippets.

@fenago
Last active October 21, 2024 07:38
Show Gist options
  • Select an option

  • Save fenago/08a0cfe98d212d90ed8f7e38c8006d66 to your computer and use it in GitHub Desktop.

Select an option

Save fenago/08a0cfe98d212d90ed8f7e38c8006d66 to your computer and use it in GitHub Desktop.
mod01_examples.ipynb
Display the source blob
Display the rendered blob
Raw
{
"cells": [
{
"cell_type": "markdown",
"metadata": {
"id": "view-in-github",
"colab_type": "text"
},
"source": [
"<a href=\"https://colab.research.google.com/gist/fenago/08a0cfe98d212d90ed8f7e38c8006d66/mod01_examples.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "ZkLxlu9hq3wF"
},
"source": [
"# Module 1: Introduction"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "q_t58MSxq3wJ"
},
"source": [
"## Get the data"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/",
"height": 1000
},
"id": "-3Q3FsPIq3wK",
"outputId": "a1ad2b21-97bf-4074-8418-4ec644c9ca73"
},
"outputs": [
{
"output_type": "execute_result",
"data": {
"text/plain": [
" cycle branch type matchup \\\n",
"0 2016 President polls-plus Clinton vs. Trump vs. Johnson \n",
"1 2016 President polls-plus Clinton vs. Trump vs. Johnson \n",
"2 2016 President polls-plus Clinton vs. Trump vs. Johnson \n",
"3 2016 President polls-plus Clinton vs. Trump vs. Johnson \n",
"4 2016 President polls-plus Clinton vs. Trump vs. Johnson \n",
"... ... ... ... ... \n",
"12619 2016 President polls-only Clinton vs. Trump vs. Johnson \n",
"12620 2016 President polls-only Clinton vs. Trump vs. Johnson \n",
"12621 2016 President polls-only Clinton vs. Trump vs. Johnson \n",
"12622 2016 President polls-only Clinton vs. Trump vs. Johnson \n",
"12623 2016 President polls-only Clinton vs. Trump vs. Johnson \n",
"\n",
" forecastdate state startdate enddate \\\n",
"0 11/8/16 U.S. 11/3/2016 11/6/2016 \n",
"1 11/8/16 U.S. 11/1/2016 11/7/2016 \n",
"2 11/8/16 U.S. 11/2/2016 11/6/2016 \n",
"3 11/8/16 U.S. 11/4/2016 11/7/2016 \n",
"4 11/8/16 U.S. 11/3/2016 11/6/2016 \n",
"... ... ... ... ... \n",
"12619 11/8/16 New Hampshire 7/9/2016 7/18/2016 \n",
"12620 11/8/16 Wisconsin 10/21/2016 11/2/2016 \n",
"12621 11/8/16 New York 8/7/2016 8/10/2016 \n",
"12622 11/8/16 Virginia 9/30/2016 10/6/2016 \n",
"12623 11/8/16 Wisconsin 6/9/2016 6/12/2016 \n",
"\n",
" pollster grade ... adjpoll_clinton adjpoll_trump \\\n",
"0 ABC News/Washington Post A+ ... 45.20163 41.72430 \n",
"1 Google Consumer Surveys B ... 43.34557 41.21439 \n",
"2 Ipsos A- ... 42.02638 38.81620 \n",
"3 YouGov B ... 45.65676 40.92004 \n",
"4 Gravis Marketing B- ... 46.84089 42.33184 \n",
"... ... ... ... ... ... \n",
"12619 University of New Hampshire B+ ... 40.24983 43.04717 \n",
"12620 Ipsos A- ... 46.54218 38.96884 \n",
"12621 Siena College A ... 53.83622 32.47939 \n",
"12622 Ipsos A- ... 49.57558 39.96954 \n",
"12623 Marquette University A ... 46.40999 39.19839 \n",
"\n",
" adjpoll_johnson adjpoll_mcmullin multiversions \\\n",
"0 4.626221 NaN NaN \n",
"1 5.175792 NaN NaN \n",
"2 6.844734 NaN NaN \n",
"3 6.069454 NaN NaN \n",
"4 3.726098 NaN NaN \n",
"... ... ... ... \n",
"12619 6.924110 NaN NaN \n",
"12620 NaN NaN NaN \n",
"12621 3.881193 NaN NaN \n",
"12622 NaN NaN NaN \n",
"12623 NaN NaN NaN \n",
"\n",
" url poll_id \\\n",
"0 https://www.washingtonpost.com/news/the-fix/wp... 48630 \n",
"1 https://datastudio.google.com/u/0/#/org//repor... 48847 \n",
"2 http://projects.fivethirtyeight.com/polls/2016... 48922 \n",
"3 https://d25d2506sfb94s.cloudfront.net/cumulus_... 48687 \n",
"4 http://www.gravispolls.com/2016/11/final-natio... 48848 \n",
"... ... ... \n",
"12619 https://cola.unh.edu/sites/cola.unh.edu/files/... 44650 \n",
"12620 http://www.reuters.com/statesofthenation/ 48259 \n",
"12621 https://www.siena.edu/assets/files/news/SNY081... 44852 \n",
"12622 http://www.reuters.com/statesofthenation/ 46675 \n",
"12623 https://law.marquette.edu/poll/2016/06/15/new-... 44341 \n",
"\n",
" question_id createddate timestamp \n",
"0 76192 11/7/16 09:35:33 8 Nov 2016 \n",
"1 76443 11/7/16 09:35:33 8 Nov 2016 \n",
"2 76636 11/8/16 09:35:33 8 Nov 2016 \n",
"3 76262 11/7/16 09:35:33 8 Nov 2016 \n",
"4 76444 11/7/16 09:35:33 8 Nov 2016 \n",
"... ... ... ... \n",
"12619 68189 7/21/16 09:14:14 8 Nov 2016 \n",
"12620 75560 11/3/16 09:14:14 8 Nov 2016 \n",
"12621 68743 8/15/16 09:14:14 8 Nov 2016 \n",
"12622 72969 10/10/16 09:14:14 8 Nov 2016 \n",
"12623 66966 6/15/16 09:14:14 8 Nov 2016 \n",
"\n",
"[12624 rows x 27 columns]"
],
"text/html": [
"\n",
" <div id=\"df-4d8cd7fb-fa04-4e02-94b3-5e20c808bb45\" class=\"colab-df-container\">\n",
" <div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>cycle</th>\n",
" <th>branch</th>\n",
" <th>type</th>\n",
" <th>matchup</th>\n",
" <th>forecastdate</th>\n",
" <th>state</th>\n",
" <th>startdate</th>\n",
" <th>enddate</th>\n",
" <th>pollster</th>\n",
" <th>grade</th>\n",
" <th>...</th>\n",
" <th>adjpoll_clinton</th>\n",
" <th>adjpoll_trump</th>\n",
" <th>adjpoll_johnson</th>\n",
" <th>adjpoll_mcmullin</th>\n",
" <th>multiversions</th>\n",
" <th>url</th>\n",
" <th>poll_id</th>\n",
" <th>question_id</th>\n",
" <th>createddate</th>\n",
" <th>timestamp</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>0</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-plus</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>U.S.</td>\n",
" <td>11/3/2016</td>\n",
" <td>11/6/2016</td>\n",
" <td>ABC News/Washington Post</td>\n",
" <td>A+</td>\n",
" <td>...</td>\n",
" <td>45.20163</td>\n",
" <td>41.72430</td>\n",
" <td>4.626221</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://www.washingtonpost.com/news/the-fix/wp...</td>\n",
" <td>48630</td>\n",
" <td>76192</td>\n",
" <td>11/7/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-plus</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>U.S.</td>\n",
" <td>11/1/2016</td>\n",
" <td>11/7/2016</td>\n",
" <td>Google Consumer Surveys</td>\n",
" <td>B</td>\n",
" <td>...</td>\n",
" <td>43.34557</td>\n",
" <td>41.21439</td>\n",
" <td>5.175792</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://datastudio.google.com/u/0/#/org//repor...</td>\n",
" <td>48847</td>\n",
" <td>76443</td>\n",
" <td>11/7/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-plus</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>U.S.</td>\n",
" <td>11/2/2016</td>\n",
" <td>11/6/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>42.02638</td>\n",
" <td>38.81620</td>\n",
" <td>6.844734</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://projects.fivethirtyeight.com/polls/2016...</td>\n",
" <td>48922</td>\n",
" <td>76636</td>\n",
" <td>11/8/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>3</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-plus</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>U.S.</td>\n",
" <td>11/4/2016</td>\n",
" <td>11/7/2016</td>\n",
" <td>YouGov</td>\n",
" <td>B</td>\n",
" <td>...</td>\n",
" <td>45.65676</td>\n",
" <td>40.92004</td>\n",
" <td>6.069454</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://d25d2506sfb94s.cloudfront.net/cumulus_...</td>\n",
" <td>48687</td>\n",
" <td>76262</td>\n",
" <td>11/7/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>4</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-plus</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>U.S.</td>\n",
" <td>11/3/2016</td>\n",
" <td>11/6/2016</td>\n",
" <td>Gravis Marketing</td>\n",
" <td>B-</td>\n",
" <td>...</td>\n",
" <td>46.84089</td>\n",
" <td>42.33184</td>\n",
" <td>3.726098</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.gravispolls.com/2016/11/final-natio...</td>\n",
" <td>48848</td>\n",
" <td>76444</td>\n",
" <td>11/7/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>...</th>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" </tr>\n",
" <tr>\n",
" <th>12619</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-only</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>New Hampshire</td>\n",
" <td>7/9/2016</td>\n",
" <td>7/18/2016</td>\n",
" <td>University of New Hampshire</td>\n",
" <td>B+</td>\n",
" <td>...</td>\n",
" <td>40.24983</td>\n",
" <td>43.04717</td>\n",
" <td>6.924110</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://cola.unh.edu/sites/cola.unh.edu/files/...</td>\n",
" <td>44650</td>\n",
" <td>68189</td>\n",
" <td>7/21/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>12620</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-only</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>Wisconsin</td>\n",
" <td>10/21/2016</td>\n",
" <td>11/2/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>46.54218</td>\n",
" <td>38.96884</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>48259</td>\n",
" <td>75560</td>\n",
" <td>11/3/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>12621</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-only</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>New York</td>\n",
" <td>8/7/2016</td>\n",
" <td>8/10/2016</td>\n",
" <td>Siena College</td>\n",
" <td>A</td>\n",
" <td>...</td>\n",
" <td>53.83622</td>\n",
" <td>32.47939</td>\n",
" <td>3.881193</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://www.siena.edu/assets/files/news/SNY081...</td>\n",
" <td>44852</td>\n",
" <td>68743</td>\n",
" <td>8/15/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>12622</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-only</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>Virginia</td>\n",
" <td>9/30/2016</td>\n",
" <td>10/6/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>49.57558</td>\n",
" <td>39.96954</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>46675</td>\n",
" <td>72969</td>\n",
" <td>10/10/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>12623</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-only</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>Wisconsin</td>\n",
" <td>6/9/2016</td>\n",
" <td>6/12/2016</td>\n",
" <td>Marquette University</td>\n",
" <td>A</td>\n",
" <td>...</td>\n",
" <td>46.40999</td>\n",
" <td>39.19839</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://law.marquette.edu/poll/2016/06/15/new-...</td>\n",
" <td>44341</td>\n",
" <td>66966</td>\n",
" <td>6/15/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"<p>12624 rows × 27 columns</p>\n",
"</div>\n",
" <div class=\"colab-df-buttons\">\n",
"\n",
" <div class=\"colab-df-container\">\n",
" <button class=\"colab-df-convert\" onclick=\"convertToInteractive('df-4d8cd7fb-fa04-4e02-94b3-5e20c808bb45')\"\n",
" title=\"Convert this dataframe to an interactive table.\"\n",
" style=\"display:none;\">\n",
"\n",
" <svg xmlns=\"http://www.w3.org/2000/svg\" height=\"24px\" viewBox=\"0 -960 960 960\">\n",
" <path d=\"M120-120v-720h720v720H120Zm60-500h600v-160H180v160Zm220 220h160v-160H400v160Zm0 220h160v-160H400v160ZM180-400h160v-160H180v160Zm440 0h160v-160H620v160ZM180-180h160v-160H180v160Zm440 0h160v-160H620v160Z\"/>\n",
" </svg>\n",
" </button>\n",
"\n",
" <style>\n",
" .colab-df-container {\n",
" display:flex;\n",
" gap: 12px;\n",
" }\n",
"\n",
" .colab-df-convert {\n",
" background-color: #E8F0FE;\n",
" border: none;\n",
" border-radius: 50%;\n",
" cursor: pointer;\n",
" display: none;\n",
" fill: #1967D2;\n",
" height: 32px;\n",
" padding: 0 0 0 0;\n",
" width: 32px;\n",
" }\n",
"\n",
" .colab-df-convert:hover {\n",
" background-color: #E2EBFA;\n",
" box-shadow: 0px 1px 2px rgba(60, 64, 67, 0.3), 0px 1px 3px 1px rgba(60, 64, 67, 0.15);\n",
" fill: #174EA6;\n",
" }\n",
"\n",
" .colab-df-buttons div {\n",
" margin-bottom: 4px;\n",
" }\n",
"\n",
" [theme=dark] .colab-df-convert {\n",
" background-color: #3B4455;\n",
" fill: #D2E3FC;\n",
" }\n",
"\n",
" [theme=dark] .colab-df-convert:hover {\n",
" background-color: #434B5C;\n",
" box-shadow: 0px 1px 3px 1px rgba(0, 0, 0, 0.15);\n",
" filter: drop-shadow(0px 1px 2px rgba(0, 0, 0, 0.3));\n",
" fill: #FFFFFF;\n",
" }\n",
" </style>\n",
"\n",
" <script>\n",
" const buttonEl =\n",
" document.querySelector('#df-4d8cd7fb-fa04-4e02-94b3-5e20c808bb45 button.colab-df-convert');\n",
" buttonEl.style.display =\n",
" google.colab.kernel.accessAllowed ? 'block' : 'none';\n",
"\n",
" async function convertToInteractive(key) {\n",
" const element = document.querySelector('#df-4d8cd7fb-fa04-4e02-94b3-5e20c808bb45');\n",
" const dataTable =\n",
" await google.colab.kernel.invokeFunction('convertToInteractive',\n",
" [key], {});\n",
" if (!dataTable) return;\n",
"\n",
" const docLinkHtml = 'Like what you see? Visit the ' +\n",
" '<a target=\"_blank\" href=https://colab.research.google.com/notebooks/data_table.ipynb>data table notebook</a>'\n",
" + ' to learn more about interactive tables.';\n",
" element.innerHTML = '';\n",
" dataTable['output_type'] = 'display_data';\n",
" await google.colab.output.renderOutput(dataTable, element);\n",
" const docLink = document.createElement('div');\n",
" docLink.innerHTML = docLinkHtml;\n",
" element.appendChild(docLink);\n",
" }\n",
" </script>\n",
" </div>\n",
"\n",
"\n",
"<div id=\"df-ebc122d6-a25e-416f-b89a-f207027d9a00\">\n",
" <button class=\"colab-df-quickchart\" onclick=\"quickchart('df-ebc122d6-a25e-416f-b89a-f207027d9a00')\"\n",
" title=\"Suggest charts\"\n",
" style=\"display:none;\">\n",
"\n",
"<svg xmlns=\"http://www.w3.org/2000/svg\" height=\"24px\"viewBox=\"0 0 24 24\"\n",
" width=\"24px\">\n",
" <g>\n",
" <path d=\"M19 3H5c-1.1 0-2 .9-2 2v14c0 1.1.9 2 2 2h14c1.1 0 2-.9 2-2V5c0-1.1-.9-2-2-2zM9 17H7v-7h2v7zm4 0h-2V7h2v10zm4 0h-2v-4h2v4z\"/>\n",
" </g>\n",
"</svg>\n",
" </button>\n",
"\n",
"<style>\n",
" .colab-df-quickchart {\n",
" --bg-color: #E8F0FE;\n",
" --fill-color: #1967D2;\n",
" --hover-bg-color: #E2EBFA;\n",
" --hover-fill-color: #174EA6;\n",
" --disabled-fill-color: #AAA;\n",
" --disabled-bg-color: #DDD;\n",
" }\n",
"\n",
" [theme=dark] .colab-df-quickchart {\n",
" --bg-color: #3B4455;\n",
" --fill-color: #D2E3FC;\n",
" --hover-bg-color: #434B5C;\n",
" --hover-fill-color: #FFFFFF;\n",
" --disabled-bg-color: #3B4455;\n",
" --disabled-fill-color: #666;\n",
" }\n",
"\n",
" .colab-df-quickchart {\n",
" background-color: var(--bg-color);\n",
" border: none;\n",
" border-radius: 50%;\n",
" cursor: pointer;\n",
" display: none;\n",
" fill: var(--fill-color);\n",
" height: 32px;\n",
" padding: 0;\n",
" width: 32px;\n",
" }\n",
"\n",
" .colab-df-quickchart:hover {\n",
" background-color: var(--hover-bg-color);\n",
" box-shadow: 0 1px 2px rgba(60, 64, 67, 0.3), 0 1px 3px 1px rgba(60, 64, 67, 0.15);\n",
" fill: var(--button-hover-fill-color);\n",
" }\n",
"\n",
" .colab-df-quickchart-complete:disabled,\n",
" .colab-df-quickchart-complete:disabled:hover {\n",
" background-color: var(--disabled-bg-color);\n",
" fill: var(--disabled-fill-color);\n",
" box-shadow: none;\n",
" }\n",
"\n",
" .colab-df-spinner {\n",
" border: 2px solid var(--fill-color);\n",
" border-color: transparent;\n",
" border-bottom-color: var(--fill-color);\n",
" animation:\n",
" spin 1s steps(1) infinite;\n",
" }\n",
"\n",
" @keyframes spin {\n",
" 0% {\n",
" border-color: transparent;\n",
" border-bottom-color: var(--fill-color);\n",
" border-left-color: var(--fill-color);\n",
" }\n",
" 20% {\n",
" border-color: transparent;\n",
" border-left-color: var(--fill-color);\n",
" border-top-color: var(--fill-color);\n",
" }\n",
" 30% {\n",
" border-color: transparent;\n",
" border-left-color: var(--fill-color);\n",
" border-top-color: var(--fill-color);\n",
" border-right-color: var(--fill-color);\n",
" }\n",
" 40% {\n",
" border-color: transparent;\n",
" border-right-color: var(--fill-color);\n",
" border-top-color: var(--fill-color);\n",
" }\n",
" 60% {\n",
" border-color: transparent;\n",
" border-right-color: var(--fill-color);\n",
" }\n",
" 80% {\n",
" border-color: transparent;\n",
" border-right-color: var(--fill-color);\n",
" border-bottom-color: var(--fill-color);\n",
" }\n",
" 90% {\n",
" border-color: transparent;\n",
" border-bottom-color: var(--fill-color);\n",
" }\n",
" }\n",
"</style>\n",
"\n",
" <script>\n",
" async function quickchart(key) {\n",
" const quickchartButtonEl =\n",
" document.querySelector('#' + key + ' button');\n",
" quickchartButtonEl.disabled = true; // To prevent multiple clicks.\n",
" quickchartButtonEl.classList.add('colab-df-spinner');\n",
" try {\n",
" const charts = await google.colab.kernel.invokeFunction(\n",
" 'suggestCharts', [key], {});\n",
" } catch (error) {\n",
" console.error('Error during call to suggestCharts:', error);\n",
" }\n",
" quickchartButtonEl.classList.remove('colab-df-spinner');\n",
" quickchartButtonEl.classList.add('colab-df-quickchart-complete');\n",
" }\n",
" (() => {\n",
" let quickchartButtonEl =\n",
" document.querySelector('#df-ebc122d6-a25e-416f-b89a-f207027d9a00 button');\n",
" quickchartButtonEl.style.display =\n",
" google.colab.kernel.accessAllowed ? 'block' : 'none';\n",
" })();\n",
" </script>\n",
"</div>\n",
"\n",
" <div id=\"id_58ea2c50-89f1-4bb2-8a2b-2aa253138dc5\">\n",
" <style>\n",
" .colab-df-generate {\n",
" background-color: #E8F0FE;\n",
" border: none;\n",
" border-radius: 50%;\n",
" cursor: pointer;\n",
" display: none;\n",
" fill: #1967D2;\n",
" height: 32px;\n",
" padding: 0 0 0 0;\n",
" width: 32px;\n",
" }\n",
"\n",
" .colab-df-generate:hover {\n",
" background-color: #E2EBFA;\n",
" box-shadow: 0px 1px 2px rgba(60, 64, 67, 0.3), 0px 1px 3px 1px rgba(60, 64, 67, 0.15);\n",
" fill: #174EA6;\n",
" }\n",
"\n",
" [theme=dark] .colab-df-generate {\n",
" background-color: #3B4455;\n",
" fill: #D2E3FC;\n",
" }\n",
"\n",
" [theme=dark] .colab-df-generate:hover {\n",
" background-color: #434B5C;\n",
" box-shadow: 0px 1px 3px 1px rgba(0, 0, 0, 0.15);\n",
" filter: drop-shadow(0px 1px 2px rgba(0, 0, 0, 0.3));\n",
" fill: #FFFFFF;\n",
" }\n",
" </style>\n",
" <button class=\"colab-df-generate\" onclick=\"generateWithVariable('polls')\"\n",
" title=\"Generate code using this dataframe.\"\n",
" style=\"display:none;\">\n",
"\n",
" <svg xmlns=\"http://www.w3.org/2000/svg\" height=\"24px\"viewBox=\"0 0 24 24\"\n",
" width=\"24px\">\n",
" <path d=\"M7,19H8.4L18.45,9,17,7.55,7,17.6ZM5,21V16.75L18.45,3.32a2,2,0,0,1,2.83,0l1.4,1.43a1.91,1.91,0,0,1,.58,1.4,1.91,1.91,0,0,1-.58,1.4L9.25,21ZM18.45,9,17,7.55Zm-12,3A5.31,5.31,0,0,0,4.9,8.1,5.31,5.31,0,0,0,1,6.5,5.31,5.31,0,0,0,4.9,4.9,5.31,5.31,0,0,0,6.5,1,5.31,5.31,0,0,0,8.1,4.9,5.31,5.31,0,0,0,12,6.5,5.46,5.46,0,0,0,6.5,12Z\"/>\n",
" </svg>\n",
" </button>\n",
" <script>\n",
" (() => {\n",
" const buttonEl =\n",
" document.querySelector('#id_58ea2c50-89f1-4bb2-8a2b-2aa253138dc5 button.colab-df-generate');\n",
" buttonEl.style.display =\n",
" google.colab.kernel.accessAllowed ? 'block' : 'none';\n",
"\n",
" buttonEl.onclick = () => {\n",
" google.colab.notebook.generateWithVariable('polls');\n",
" }\n",
" })();\n",
" </script>\n",
" </div>\n",
"\n",
" </div>\n",
" </div>\n"
],
"application/vnd.google.colaboratory.intrinsic+json": {
"type": "dataframe",
"variable_name": "polls"
}
},
"metadata": {},
"execution_count": 1
}
],
"source": [
"import pandas as pd\n",
"poll_url = 'http://projects.fivethirtyeight.com/general-model/president_general_polls_2016.csv'\n",
"polls = pd.read_csv(poll_url)\n",
"polls"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "lkMNF4DZq3wN"
},
"source": [
"## Sort the data"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/",
"height": 603
},
"id": "x_E9hzLOq3wN",
"outputId": "d698eb2a-6fe0-49ca-93e3-e428576ff9bc"
},
"outputs": [
{
"output_type": "execute_result",
"data": {
"text/plain": [
" cycle branch type matchup \\\n",
"10862 2016 President polls-only Clinton vs. Trump vs. Johnson \n",
"6718 2016 President now-cast Clinton vs. Trump vs. Johnson \n",
"11203 2016 President polls-only Clinton vs. Trump vs. Johnson \n",
"2105 2016 President polls-plus Clinton vs. Trump vs. Johnson \n",
"2801 2016 President polls-plus Clinton vs. Trump vs. Johnson \n",
"\n",
" forecastdate state startdate enddate pollster grade ... \\\n",
"10862 11/8/16 Kentucky 9/9/2016 9/22/2016 Ipsos A- ... \n",
"6718 11/8/16 Indiana 9/9/2016 9/22/2016 Ipsos A- ... \n",
"11203 11/8/16 Texas 9/9/2016 9/15/2016 Ipsos A- ... \n",
"2105 11/8/16 Montana 9/9/2016 9/29/2016 Ipsos A- ... \n",
"2801 11/8/16 Arizona 9/9/2016 9/22/2016 Ipsos A- ... \n",
"\n",
" adjpoll_clinton adjpoll_trump adjpoll_johnson adjpoll_mcmullin \\\n",
"10862 37.39425 54.17959 NaN NaN \n",
"6718 33.73702 52.89504 NaN NaN \n",
"11203 28.37295 51.58496 NaN NaN \n",
"2105 38.08105 52.35901 NaN NaN \n",
"2801 41.36205 48.14312 NaN NaN \n",
"\n",
" multiversions url poll_id \\\n",
"10862 NaN http://www.reuters.com/statesofthenation/ 46070 \n",
"6718 NaN http://www.reuters.com/statesofthenation/ 46067 \n",
"11203 NaN http://www.reuters.com/statesofthenation/ 45869 \n",
"2105 NaN http://www.reuters.com/statesofthenation/ 46360 \n",
"2801 NaN http://www.reuters.com/statesofthenation/ 46057 \n",
"\n",
" question_id createddate timestamp \n",
"10862 72062 9/26/16 09:14:14 8 Nov 2016 \n",
"6718 72059 9/26/16 09:24:53 8 Nov 2016 \n",
"11203 71633 9/16/16 09:14:14 8 Nov 2016 \n",
"2105 72488 10/3/16 09:35:33 8 Nov 2016 \n",
"2801 72049 9/26/16 09:35:33 8 Nov 2016 \n",
"\n",
"[5 rows x 27 columns]"
],
"text/html": [
"\n",
" <div id=\"df-9864ff64-971b-484e-9974-cd578bac4fa5\" class=\"colab-df-container\">\n",
" <div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>cycle</th>\n",
" <th>branch</th>\n",
" <th>type</th>\n",
" <th>matchup</th>\n",
" <th>forecastdate</th>\n",
" <th>state</th>\n",
" <th>startdate</th>\n",
" <th>enddate</th>\n",
" <th>pollster</th>\n",
" <th>grade</th>\n",
" <th>...</th>\n",
" <th>adjpoll_clinton</th>\n",
" <th>adjpoll_trump</th>\n",
" <th>adjpoll_johnson</th>\n",
" <th>adjpoll_mcmullin</th>\n",
" <th>multiversions</th>\n",
" <th>url</th>\n",
" <th>poll_id</th>\n",
" <th>question_id</th>\n",
" <th>createddate</th>\n",
" <th>timestamp</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>10862</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-only</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>Kentucky</td>\n",
" <td>9/9/2016</td>\n",
" <td>9/22/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>37.39425</td>\n",
" <td>54.17959</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>46070</td>\n",
" <td>72062</td>\n",
" <td>9/26/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>6718</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>now-cast</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>Indiana</td>\n",
" <td>9/9/2016</td>\n",
" <td>9/22/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>33.73702</td>\n",
" <td>52.89504</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>46067</td>\n",
" <td>72059</td>\n",
" <td>9/26/16</td>\n",
" <td>09:24:53 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>11203</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-only</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>Texas</td>\n",
" <td>9/9/2016</td>\n",
" <td>9/15/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>28.37295</td>\n",
" <td>51.58496</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>45869</td>\n",
" <td>71633</td>\n",
" <td>9/16/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2105</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-plus</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>Montana</td>\n",
" <td>9/9/2016</td>\n",
" <td>9/29/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>38.08105</td>\n",
" <td>52.35901</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>46360</td>\n",
" <td>72488</td>\n",
" <td>10/3/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2801</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-plus</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>Arizona</td>\n",
" <td>9/9/2016</td>\n",
" <td>9/22/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>41.36205</td>\n",
" <td>48.14312</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>46057</td>\n",
" <td>72049</td>\n",
" <td>9/26/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"<p>5 rows × 27 columns</p>\n",
"</div>\n",
" <div class=\"colab-df-buttons\">\n",
"\n",
" <div class=\"colab-df-container\">\n",
" <button class=\"colab-df-convert\" onclick=\"convertToInteractive('df-9864ff64-971b-484e-9974-cd578bac4fa5')\"\n",
" title=\"Convert this dataframe to an interactive table.\"\n",
" style=\"display:none;\">\n",
"\n",
" <svg xmlns=\"http://www.w3.org/2000/svg\" height=\"24px\" viewBox=\"0 -960 960 960\">\n",
" <path d=\"M120-120v-720h720v720H120Zm60-500h600v-160H180v160Zm220 220h160v-160H400v160Zm0 220h160v-160H400v160ZM180-400h160v-160H180v160Zm440 0h160v-160H620v160ZM180-180h160v-160H180v160Zm440 0h160v-160H620v160Z\"/>\n",
" </svg>\n",
" </button>\n",
"\n",
" <style>\n",
" .colab-df-container {\n",
" display:flex;\n",
" gap: 12px;\n",
" }\n",
"\n",
" .colab-df-convert {\n",
" background-color: #E8F0FE;\n",
" border: none;\n",
" border-radius: 50%;\n",
" cursor: pointer;\n",
" display: none;\n",
" fill: #1967D2;\n",
" height: 32px;\n",
" padding: 0 0 0 0;\n",
" width: 32px;\n",
" }\n",
"\n",
" .colab-df-convert:hover {\n",
" background-color: #E2EBFA;\n",
" box-shadow: 0px 1px 2px rgba(60, 64, 67, 0.3), 0px 1px 3px 1px rgba(60, 64, 67, 0.15);\n",
" fill: #174EA6;\n",
" }\n",
"\n",
" .colab-df-buttons div {\n",
" margin-bottom: 4px;\n",
" }\n",
"\n",
" [theme=dark] .colab-df-convert {\n",
" background-color: #3B4455;\n",
" fill: #D2E3FC;\n",
" }\n",
"\n",
" [theme=dark] .colab-df-convert:hover {\n",
" background-color: #434B5C;\n",
" box-shadow: 0px 1px 3px 1px rgba(0, 0, 0, 0.15);\n",
" filter: drop-shadow(0px 1px 2px rgba(0, 0, 0, 0.3));\n",
" fill: #FFFFFF;\n",
" }\n",
" </style>\n",
"\n",
" <script>\n",
" const buttonEl =\n",
" document.querySelector('#df-9864ff64-971b-484e-9974-cd578bac4fa5 button.colab-df-convert');\n",
" buttonEl.style.display =\n",
" google.colab.kernel.accessAllowed ? 'block' : 'none';\n",
"\n",
" async function convertToInteractive(key) {\n",
" const element = document.querySelector('#df-9864ff64-971b-484e-9974-cd578bac4fa5');\n",
" const dataTable =\n",
" await google.colab.kernel.invokeFunction('convertToInteractive',\n",
" [key], {});\n",
" if (!dataTable) return;\n",
"\n",
" const docLinkHtml = 'Like what you see? Visit the ' +\n",
" '<a target=\"_blank\" href=https://colab.research.google.com/notebooks/data_table.ipynb>data table notebook</a>'\n",
" + ' to learn more about interactive tables.';\n",
" element.innerHTML = '';\n",
" dataTable['output_type'] = 'display_data';\n",
" await google.colab.output.renderOutput(dataTable, element);\n",
" const docLink = document.createElement('div');\n",
" docLink.innerHTML = docLinkHtml;\n",
" element.appendChild(docLink);\n",
" }\n",
" </script>\n",
" </div>\n",
"\n",
"\n",
"<div id=\"df-b4c203e7-f0ff-4c5a-80ce-051a2023ad06\">\n",
" <button class=\"colab-df-quickchart\" onclick=\"quickchart('df-b4c203e7-f0ff-4c5a-80ce-051a2023ad06')\"\n",
" title=\"Suggest charts\"\n",
" style=\"display:none;\">\n",
"\n",
"<svg xmlns=\"http://www.w3.org/2000/svg\" height=\"24px\"viewBox=\"0 0 24 24\"\n",
" width=\"24px\">\n",
" <g>\n",
" <path d=\"M19 3H5c-1.1 0-2 .9-2 2v14c0 1.1.9 2 2 2h14c1.1 0 2-.9 2-2V5c0-1.1-.9-2-2-2zM9 17H7v-7h2v7zm4 0h-2V7h2v10zm4 0h-2v-4h2v4z\"/>\n",
" </g>\n",
"</svg>\n",
" </button>\n",
"\n",
"<style>\n",
" .colab-df-quickchart {\n",
" --bg-color: #E8F0FE;\n",
" --fill-color: #1967D2;\n",
" --hover-bg-color: #E2EBFA;\n",
" --hover-fill-color: #174EA6;\n",
" --disabled-fill-color: #AAA;\n",
" --disabled-bg-color: #DDD;\n",
" }\n",
"\n",
" [theme=dark] .colab-df-quickchart {\n",
" --bg-color: #3B4455;\n",
" --fill-color: #D2E3FC;\n",
" --hover-bg-color: #434B5C;\n",
" --hover-fill-color: #FFFFFF;\n",
" --disabled-bg-color: #3B4455;\n",
" --disabled-fill-color: #666;\n",
" }\n",
"\n",
" .colab-df-quickchart {\n",
" background-color: var(--bg-color);\n",
" border: none;\n",
" border-radius: 50%;\n",
" cursor: pointer;\n",
" display: none;\n",
" fill: var(--fill-color);\n",
" height: 32px;\n",
" padding: 0;\n",
" width: 32px;\n",
" }\n",
"\n",
" .colab-df-quickchart:hover {\n",
" background-color: var(--hover-bg-color);\n",
" box-shadow: 0 1px 2px rgba(60, 64, 67, 0.3), 0 1px 3px 1px rgba(60, 64, 67, 0.15);\n",
" fill: var(--button-hover-fill-color);\n",
" }\n",
"\n",
" .colab-df-quickchart-complete:disabled,\n",
" .colab-df-quickchart-complete:disabled:hover {\n",
" background-color: var(--disabled-bg-color);\n",
" fill: var(--disabled-fill-color);\n",
" box-shadow: none;\n",
" }\n",
"\n",
" .colab-df-spinner {\n",
" border: 2px solid var(--fill-color);\n",
" border-color: transparent;\n",
" border-bottom-color: var(--fill-color);\n",
" animation:\n",
" spin 1s steps(1) infinite;\n",
" }\n",
"\n",
" @keyframes spin {\n",
" 0% {\n",
" border-color: transparent;\n",
" border-bottom-color: var(--fill-color);\n",
" border-left-color: var(--fill-color);\n",
" }\n",
" 20% {\n",
" border-color: transparent;\n",
" border-left-color: var(--fill-color);\n",
" border-top-color: var(--fill-color);\n",
" }\n",
" 30% {\n",
" border-color: transparent;\n",
" border-left-color: var(--fill-color);\n",
" border-top-color: var(--fill-color);\n",
" border-right-color: var(--fill-color);\n",
" }\n",
" 40% {\n",
" border-color: transparent;\n",
" border-right-color: var(--fill-color);\n",
" border-top-color: var(--fill-color);\n",
" }\n",
" 60% {\n",
" border-color: transparent;\n",
" border-right-color: var(--fill-color);\n",
" }\n",
" 80% {\n",
" border-color: transparent;\n",
" border-right-color: var(--fill-color);\n",
" border-bottom-color: var(--fill-color);\n",
" }\n",
" 90% {\n",
" border-color: transparent;\n",
" border-bottom-color: var(--fill-color);\n",
" }\n",
" }\n",
"</style>\n",
"\n",
" <script>\n",
" async function quickchart(key) {\n",
" const quickchartButtonEl =\n",
" document.querySelector('#' + key + ' button');\n",
" quickchartButtonEl.disabled = true; // To prevent multiple clicks.\n",
" quickchartButtonEl.classList.add('colab-df-spinner');\n",
" try {\n",
" const charts = await google.colab.kernel.invokeFunction(\n",
" 'suggestCharts', [key], {});\n",
" } catch (error) {\n",
" console.error('Error during call to suggestCharts:', error);\n",
" }\n",
" quickchartButtonEl.classList.remove('colab-df-spinner');\n",
" quickchartButtonEl.classList.add('colab-df-quickchart-complete');\n",
" }\n",
" (() => {\n",
" let quickchartButtonEl =\n",
" document.querySelector('#df-b4c203e7-f0ff-4c5a-80ce-051a2023ad06 button');\n",
" quickchartButtonEl.style.display =\n",
" google.colab.kernel.accessAllowed ? 'block' : 'none';\n",
" })();\n",
" </script>\n",
"</div>\n",
"\n",
" </div>\n",
" </div>\n"
],
"application/vnd.google.colaboratory.intrinsic+json": {
"type": "dataframe",
"variable_name": "polls"
}
},
"metadata": {},
"execution_count": 2
}
],
"source": [
"polls.sort_values('startdate', ascending=False, inplace=True)\n",
"polls.head()"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "yD2bmdawq3wO"
},
"source": [
"## Use lists, slices, tuples, and dictionary objects"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "lki5vJRRq3wO"
},
"source": [
"### A list"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/",
"height": 196
},
"id": "OYEVdwwaq3wP",
"outputId": "049d774e-10cd-4c3e-e54d-0eee50d364fe"
},
"outputs": [
{
"output_type": "execute_result",
"data": {
"text/plain": [
" type state startdate enddate pollster grade samplesize \\\n",
"10862 polls-only Kentucky 9/9/2016 9/22/2016 Ipsos A- 322.0 \n",
"6718 now-cast Indiana 9/9/2016 9/22/2016 Ipsos A- 367.0 \n",
"\n",
" population poll_wt rawpoll_clinton ... adjpoll_clinton \\\n",
"10862 lv 0.003555 37.31 ... 37.39425 \n",
"6718 lv 0.002889 34.92 ... 33.73702 \n",
"\n",
" adjpoll_trump adjpoll_johnson adjpoll_mcmullin multiversions \\\n",
"10862 54.17959 NaN NaN NaN \n",
"6718 52.89504 NaN NaN NaN \n",
"\n",
" url poll_id question_id \\\n",
"10862 http://www.reuters.com/statesofthenation/ 46070 72062 \n",
"6718 http://www.reuters.com/statesofthenation/ 46067 72059 \n",
"\n",
" createddate timestamp \n",
"10862 9/26/16 09:14:14 8 Nov 2016 \n",
"6718 9/26/16 09:24:53 8 Nov 2016 \n",
"\n",
"[2 rows x 23 columns]"
],
"text/html": [
"\n",
" <div id=\"df-5ff761c9-ff14-4c04-ae71-610aa143448b\" class=\"colab-df-container\">\n",
" <div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>type</th>\n",
" <th>state</th>\n",
" <th>startdate</th>\n",
" <th>enddate</th>\n",
" <th>pollster</th>\n",
" <th>grade</th>\n",
" <th>samplesize</th>\n",
" <th>population</th>\n",
" <th>poll_wt</th>\n",
" <th>rawpoll_clinton</th>\n",
" <th>...</th>\n",
" <th>adjpoll_clinton</th>\n",
" <th>adjpoll_trump</th>\n",
" <th>adjpoll_johnson</th>\n",
" <th>adjpoll_mcmullin</th>\n",
" <th>multiversions</th>\n",
" <th>url</th>\n",
" <th>poll_id</th>\n",
" <th>question_id</th>\n",
" <th>createddate</th>\n",
" <th>timestamp</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>10862</th>\n",
" <td>polls-only</td>\n",
" <td>Kentucky</td>\n",
" <td>9/9/2016</td>\n",
" <td>9/22/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>322.0</td>\n",
" <td>lv</td>\n",
" <td>0.003555</td>\n",
" <td>37.31</td>\n",
" <td>...</td>\n",
" <td>37.39425</td>\n",
" <td>54.17959</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>46070</td>\n",
" <td>72062</td>\n",
" <td>9/26/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>6718</th>\n",
" <td>now-cast</td>\n",
" <td>Indiana</td>\n",
" <td>9/9/2016</td>\n",
" <td>9/22/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>367.0</td>\n",
" <td>lv</td>\n",
" <td>0.002889</td>\n",
" <td>34.92</td>\n",
" <td>...</td>\n",
" <td>33.73702</td>\n",
" <td>52.89504</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>46067</td>\n",
" <td>72059</td>\n",
" <td>9/26/16</td>\n",
" <td>09:24:53 8 Nov 2016</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"<p>2 rows × 23 columns</p>\n",
"</div>\n",
" <div class=\"colab-df-buttons\">\n",
"\n",
" <div class=\"colab-df-container\">\n",
" <button class=\"colab-df-convert\" onclick=\"convertToInteractive('df-5ff761c9-ff14-4c04-ae71-610aa143448b')\"\n",
" title=\"Convert this dataframe to an interactive table.\"\n",
" style=\"display:none;\">\n",
"\n",
" <svg xmlns=\"http://www.w3.org/2000/svg\" height=\"24px\" viewBox=\"0 -960 960 960\">\n",
" <path d=\"M120-120v-720h720v720H120Zm60-500h600v-160H180v160Zm220 220h160v-160H400v160Zm0 220h160v-160H400v160ZM180-400h160v-160H180v160Zm440 0h160v-160H620v160ZM180-180h160v-160H180v160Zm440 0h160v-160H620v160Z\"/>\n",
" </svg>\n",
" </button>\n",
"\n",
" <style>\n",
" .colab-df-container {\n",
" display:flex;\n",
" gap: 12px;\n",
" }\n",
"\n",
" .colab-df-convert {\n",
" background-color: #E8F0FE;\n",
" border: none;\n",
" border-radius: 50%;\n",
" cursor: pointer;\n",
" display: none;\n",
" fill: #1967D2;\n",
" height: 32px;\n",
" padding: 0 0 0 0;\n",
" width: 32px;\n",
" }\n",
"\n",
" .colab-df-convert:hover {\n",
" background-color: #E2EBFA;\n",
" box-shadow: 0px 1px 2px rgba(60, 64, 67, 0.3), 0px 1px 3px 1px rgba(60, 64, 67, 0.15);\n",
" fill: #174EA6;\n",
" }\n",
"\n",
" .colab-df-buttons div {\n",
" margin-bottom: 4px;\n",
" }\n",
"\n",
" [theme=dark] .colab-df-convert {\n",
" background-color: #3B4455;\n",
" fill: #D2E3FC;\n",
" }\n",
"\n",
" [theme=dark] .colab-df-convert:hover {\n",
" background-color: #434B5C;\n",
" box-shadow: 0px 1px 3px 1px rgba(0, 0, 0, 0.15);\n",
" filter: drop-shadow(0px 1px 2px rgba(0, 0, 0, 0.3));\n",
" fill: #FFFFFF;\n",
" }\n",
" </style>\n",
"\n",
" <script>\n",
" const buttonEl =\n",
" document.querySelector('#df-5ff761c9-ff14-4c04-ae71-610aa143448b button.colab-df-convert');\n",
" buttonEl.style.display =\n",
" google.colab.kernel.accessAllowed ? 'block' : 'none';\n",
"\n",
" async function convertToInteractive(key) {\n",
" const element = document.querySelector('#df-5ff761c9-ff14-4c04-ae71-610aa143448b');\n",
" const dataTable =\n",
" await google.colab.kernel.invokeFunction('convertToInteractive',\n",
" [key], {});\n",
" if (!dataTable) return;\n",
"\n",
" const docLinkHtml = 'Like what you see? Visit the ' +\n",
" '<a target=\"_blank\" href=https://colab.research.google.com/notebooks/data_table.ipynb>data table notebook</a>'\n",
" + ' to learn more about interactive tables.';\n",
" element.innerHTML = '';\n",
" dataTable['output_type'] = 'display_data';\n",
" await google.colab.output.renderOutput(dataTable, element);\n",
" const docLink = document.createElement('div');\n",
" docLink.innerHTML = docLinkHtml;\n",
" element.appendChild(docLink);\n",
" }\n",
" </script>\n",
" </div>\n",
"\n",
"\n",
"<div id=\"df-bfc3f392-3c5b-4450-96ca-a6480af1f2f2\">\n",
" <button class=\"colab-df-quickchart\" onclick=\"quickchart('df-bfc3f392-3c5b-4450-96ca-a6480af1f2f2')\"\n",
" title=\"Suggest charts\"\n",
" style=\"display:none;\">\n",
"\n",
"<svg xmlns=\"http://www.w3.org/2000/svg\" height=\"24px\"viewBox=\"0 0 24 24\"\n",
" width=\"24px\">\n",
" <g>\n",
" <path d=\"M19 3H5c-1.1 0-2 .9-2 2v14c0 1.1.9 2 2 2h14c1.1 0 2-.9 2-2V5c0-1.1-.9-2-2-2zM9 17H7v-7h2v7zm4 0h-2V7h2v10zm4 0h-2v-4h2v4z\"/>\n",
" </g>\n",
"</svg>\n",
" </button>\n",
"\n",
"<style>\n",
" .colab-df-quickchart {\n",
" --bg-color: #E8F0FE;\n",
" --fill-color: #1967D2;\n",
" --hover-bg-color: #E2EBFA;\n",
" --hover-fill-color: #174EA6;\n",
" --disabled-fill-color: #AAA;\n",
" --disabled-bg-color: #DDD;\n",
" }\n",
"\n",
" [theme=dark] .colab-df-quickchart {\n",
" --bg-color: #3B4455;\n",
" --fill-color: #D2E3FC;\n",
" --hover-bg-color: #434B5C;\n",
" --hover-fill-color: #FFFFFF;\n",
" --disabled-bg-color: #3B4455;\n",
" --disabled-fill-color: #666;\n",
" }\n",
"\n",
" .colab-df-quickchart {\n",
" background-color: var(--bg-color);\n",
" border: none;\n",
" border-radius: 50%;\n",
" cursor: pointer;\n",
" display: none;\n",
" fill: var(--fill-color);\n",
" height: 32px;\n",
" padding: 0;\n",
" width: 32px;\n",
" }\n",
"\n",
" .colab-df-quickchart:hover {\n",
" background-color: var(--hover-bg-color);\n",
" box-shadow: 0 1px 2px rgba(60, 64, 67, 0.3), 0 1px 3px 1px rgba(60, 64, 67, 0.15);\n",
" fill: var(--button-hover-fill-color);\n",
" }\n",
"\n",
" .colab-df-quickchart-complete:disabled,\n",
" .colab-df-quickchart-complete:disabled:hover {\n",
" background-color: var(--disabled-bg-color);\n",
" fill: var(--disabled-fill-color);\n",
" box-shadow: none;\n",
" }\n",
"\n",
" .colab-df-spinner {\n",
" border: 2px solid var(--fill-color);\n",
" border-color: transparent;\n",
" border-bottom-color: var(--fill-color);\n",
" animation:\n",
" spin 1s steps(1) infinite;\n",
" }\n",
"\n",
" @keyframes spin {\n",
" 0% {\n",
" border-color: transparent;\n",
" border-bottom-color: var(--fill-color);\n",
" border-left-color: var(--fill-color);\n",
" }\n",
" 20% {\n",
" border-color: transparent;\n",
" border-left-color: var(--fill-color);\n",
" border-top-color: var(--fill-color);\n",
" }\n",
" 30% {\n",
" border-color: transparent;\n",
" border-left-color: var(--fill-color);\n",
" border-top-color: var(--fill-color);\n",
" border-right-color: var(--fill-color);\n",
" }\n",
" 40% {\n",
" border-color: transparent;\n",
" border-right-color: var(--fill-color);\n",
" border-top-color: var(--fill-color);\n",
" }\n",
" 60% {\n",
" border-color: transparent;\n",
" border-right-color: var(--fill-color);\n",
" }\n",
" 80% {\n",
" border-color: transparent;\n",
" border-right-color: var(--fill-color);\n",
" border-bottom-color: var(--fill-color);\n",
" }\n",
" 90% {\n",
" border-color: transparent;\n",
" border-bottom-color: var(--fill-color);\n",
" }\n",
" }\n",
"</style>\n",
"\n",
" <script>\n",
" async function quickchart(key) {\n",
" const quickchartButtonEl =\n",
" document.querySelector('#' + key + ' button');\n",
" quickchartButtonEl.disabled = true; // To prevent multiple clicks.\n",
" quickchartButtonEl.classList.add('colab-df-spinner');\n",
" try {\n",
" const charts = await google.colab.kernel.invokeFunction(\n",
" 'suggestCharts', [key], {});\n",
" } catch (error) {\n",
" console.error('Error during call to suggestCharts:', error);\n",
" }\n",
" quickchartButtonEl.classList.remove('colab-df-spinner');\n",
" quickchartButtonEl.classList.add('colab-df-quickchart-complete');\n",
" }\n",
" (() => {\n",
" let quickchartButtonEl =\n",
" document.querySelector('#df-bfc3f392-3c5b-4450-96ca-a6480af1f2f2 button');\n",
" quickchartButtonEl.style.display =\n",
" google.colab.kernel.accessAllowed ? 'block' : 'none';\n",
" })();\n",
" </script>\n",
"</div>\n",
"\n",
" </div>\n",
" </div>\n"
],
"application/vnd.google.colaboratory.intrinsic+json": {
"type": "dataframe",
"variable_name": "polls"
}
},
"metadata": {},
"execution_count": 3
}
],
"source": [
"polls.drop(columns=['cycle','branch','matchup','forecastdate'], inplace=True)\n",
"polls.head(2)"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "-0U5PbEJq3wP"
},
"source": [
"### A tuple"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "7Mgo1EVlq3wP",
"outputId": "8d441a64-b479-426d-c2e3-8ddec91955a1"
},
"outputs": [
{
"data": {
"text/plain": [
"<Axes: >"
]
},
"execution_count": 6,
"metadata": {},
"output_type": "execute_result"
},
{
"data": {
"image/png": "",
"text/plain": [
"<Figure size 640x480 with 1 Axes>"
]
},
"metadata": {},
"output_type": "display_data"
}
],
"source": [
"polls.plot.line(xlim=('2016-06','2016-11'))"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "UsCxzhQgq3wQ"
},
"source": [
"### A dictionary"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "xkptfhweq3wQ",
"outputId": "3139e612-c25e-40be-caa7-af4b8fae1c68"
},
"outputs": [
{
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>type</th>\n",
" <th>state</th>\n",
" <th>startdate</th>\n",
" <th>enddate</th>\n",
" <th>pollster</th>\n",
" <th>grade</th>\n",
" <th>samplesize</th>\n",
" <th>population</th>\n",
" <th>poll_wt</th>\n",
" <th>rawpoll_clinton</th>\n",
" <th>...</th>\n",
" <th>Clinton</th>\n",
" <th>Trump</th>\n",
" <th>adjpoll_johnson</th>\n",
" <th>adjpoll_mcmullin</th>\n",
" <th>multiversions</th>\n",
" <th>url</th>\n",
" <th>poll_id</th>\n",
" <th>question_id</th>\n",
" <th>createddate</th>\n",
" <th>timestamp</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>10862</th>\n",
" <td>polls-only</td>\n",
" <td>Kentucky</td>\n",
" <td>9/9/2016</td>\n",
" <td>9/22/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>322.0</td>\n",
" <td>lv</td>\n",
" <td>3.554900e-03</td>\n",
" <td>37.31</td>\n",
" <td>...</td>\n",
" <td>37.39425</td>\n",
" <td>54.17959</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>46070</td>\n",
" <td>72062</td>\n",
" <td>9/26/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>6718</th>\n",
" <td>now-cast</td>\n",
" <td>Indiana</td>\n",
" <td>9/9/2016</td>\n",
" <td>9/22/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>367.0</td>\n",
" <td>lv</td>\n",
" <td>2.889400e-03</td>\n",
" <td>34.92</td>\n",
" <td>...</td>\n",
" <td>33.73702</td>\n",
" <td>52.89504</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>46067</td>\n",
" <td>72059</td>\n",
" <td>9/26/16</td>\n",
" <td>09:24:53 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>11203</th>\n",
" <td>polls-only</td>\n",
" <td>Texas</td>\n",
" <td>9/9/2016</td>\n",
" <td>9/15/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>462.0</td>\n",
" <td>lv</td>\n",
" <td>1.283900e-03</td>\n",
" <td>28.51</td>\n",
" <td>...</td>\n",
" <td>28.37295</td>\n",
" <td>51.58496</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>45869</td>\n",
" <td>71633</td>\n",
" <td>9/16/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2105</th>\n",
" <td>polls-plus</td>\n",
" <td>Montana</td>\n",
" <td>9/9/2016</td>\n",
" <td>9/29/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>144.0</td>\n",
" <td>lv</td>\n",
" <td>8.000500e-03</td>\n",
" <td>39.76</td>\n",
" <td>...</td>\n",
" <td>38.08105</td>\n",
" <td>52.35901</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>46360</td>\n",
" <td>72488</td>\n",
" <td>10/3/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2801</th>\n",
" <td>polls-plus</td>\n",
" <td>Arizona</td>\n",
" <td>9/9/2016</td>\n",
" <td>9/22/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>428.0</td>\n",
" <td>lv</td>\n",
" <td>1.228500e-03</td>\n",
" <td>41.65</td>\n",
" <td>...</td>\n",
" <td>41.36205</td>\n",
" <td>48.14312</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>46057</td>\n",
" <td>72049</td>\n",
" <td>9/26/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>...</th>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" </tr>\n",
" <tr>\n",
" <th>3981</th>\n",
" <td>polls-plus</td>\n",
" <td>U.S.</td>\n",
" <td>1/10/2016</td>\n",
" <td>1/10/2016</td>\n",
" <td>Gravis Marketing</td>\n",
" <td>B-</td>\n",
" <td>2416.0</td>\n",
" <td>rv</td>\n",
" <td>9.470000e-09</td>\n",
" <td>49.00</td>\n",
" <td>...</td>\n",
" <td>47.18561</td>\n",
" <td>50.33852</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.oann.com/pollresults/</td>\n",
" <td>35856</td>\n",
" <td>48165</td>\n",
" <td>1/12/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>4013</th>\n",
" <td>polls-plus</td>\n",
" <td>U.S.</td>\n",
" <td>1/10/2016</td>\n",
" <td>1/14/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>1339.0</td>\n",
" <td>lv</td>\n",
" <td>2.720000e-09</td>\n",
" <td>43.50</td>\n",
" <td>...</td>\n",
" <td>41.77744</td>\n",
" <td>36.23294</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://polling.reuters.com/#poll/TM651Y15_13/f...</td>\n",
" <td>35871</td>\n",
" <td>65430</td>\n",
" <td>5/5/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>12439</th>\n",
" <td>polls-only</td>\n",
" <td>U.S.</td>\n",
" <td>1/1/2016</td>\n",
" <td>1/5/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>1465.0</td>\n",
" <td>lv</td>\n",
" <td>1.890000e-09</td>\n",
" <td>41.80</td>\n",
" <td>...</td>\n",
" <td>40.13148</td>\n",
" <td>35.99752</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://polling.reuters.com/#poll/TM651Y15_13/f...</td>\n",
" <td>35484</td>\n",
" <td>65439</td>\n",
" <td>5/5/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>8231</th>\n",
" <td>now-cast</td>\n",
" <td>U.S.</td>\n",
" <td>1/1/2016</td>\n",
" <td>1/5/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>1465.0</td>\n",
" <td>lv</td>\n",
" <td>1.890000e-09</td>\n",
" <td>41.80</td>\n",
" <td>...</td>\n",
" <td>40.16681</td>\n",
" <td>36.06849</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://polling.reuters.com/#poll/TM651Y15_13/f...</td>\n",
" <td>35484</td>\n",
" <td>65439</td>\n",
" <td>5/5/16</td>\n",
" <td>09:24:53 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>4023</th>\n",
" <td>polls-plus</td>\n",
" <td>U.S.</td>\n",
" <td>1/1/2016</td>\n",
" <td>1/5/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>1465.0</td>\n",
" <td>lv</td>\n",
" <td>1.890000e-09</td>\n",
" <td>41.80</td>\n",
" <td>...</td>\n",
" <td>40.11739</td>\n",
" <td>36.00981</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://polling.reuters.com/#poll/TM651Y15_13/f...</td>\n",
" <td>35484</td>\n",
" <td>65439</td>\n",
" <td>5/5/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"<p>12624 rows × 23 columns</p>\n",
"</div>"
],
"text/plain": [
" type state startdate enddate pollster grade \\\n",
"10862 polls-only Kentucky 9/9/2016 9/22/2016 Ipsos A- \n",
"6718 now-cast Indiana 9/9/2016 9/22/2016 Ipsos A- \n",
"11203 polls-only Texas 9/9/2016 9/15/2016 Ipsos A- \n",
"2105 polls-plus Montana 9/9/2016 9/29/2016 Ipsos A- \n",
"2801 polls-plus Arizona 9/9/2016 9/22/2016 Ipsos A- \n",
"... ... ... ... ... ... ... \n",
"3981 polls-plus U.S. 1/10/2016 1/10/2016 Gravis Marketing B- \n",
"4013 polls-plus U.S. 1/10/2016 1/14/2016 Ipsos A- \n",
"12439 polls-only U.S. 1/1/2016 1/5/2016 Ipsos A- \n",
"8231 now-cast U.S. 1/1/2016 1/5/2016 Ipsos A- \n",
"4023 polls-plus U.S. 1/1/2016 1/5/2016 Ipsos A- \n",
"\n",
" samplesize population poll_wt rawpoll_clinton ... Clinton \\\n",
"10862 322.0 lv 3.554900e-03 37.31 ... 37.39425 \n",
"6718 367.0 lv 2.889400e-03 34.92 ... 33.73702 \n",
"11203 462.0 lv 1.283900e-03 28.51 ... 28.37295 \n",
"2105 144.0 lv 8.000500e-03 39.76 ... 38.08105 \n",
"2801 428.0 lv 1.228500e-03 41.65 ... 41.36205 \n",
"... ... ... ... ... ... ... \n",
"3981 2416.0 rv 9.470000e-09 49.00 ... 47.18561 \n",
"4013 1339.0 lv 2.720000e-09 43.50 ... 41.77744 \n",
"12439 1465.0 lv 1.890000e-09 41.80 ... 40.13148 \n",
"8231 1465.0 lv 1.890000e-09 41.80 ... 40.16681 \n",
"4023 1465.0 lv 1.890000e-09 41.80 ... 40.11739 \n",
"\n",
" Trump adjpoll_johnson adjpoll_mcmullin multiversions \\\n",
"10862 54.17959 NaN NaN NaN \n",
"6718 52.89504 NaN NaN NaN \n",
"11203 51.58496 NaN NaN NaN \n",
"2105 52.35901 NaN NaN NaN \n",
"2801 48.14312 NaN NaN NaN \n",
"... ... ... ... ... \n",
"3981 50.33852 NaN NaN NaN \n",
"4013 36.23294 NaN NaN NaN \n",
"12439 35.99752 NaN NaN NaN \n",
"8231 36.06849 NaN NaN NaN \n",
"4023 36.00981 NaN NaN NaN \n",
"\n",
" url poll_id question_id \\\n",
"10862 http://www.reuters.com/statesofthenation/ 46070 72062 \n",
"6718 http://www.reuters.com/statesofthenation/ 46067 72059 \n",
"11203 http://www.reuters.com/statesofthenation/ 45869 71633 \n",
"2105 http://www.reuters.com/statesofthenation/ 46360 72488 \n",
"2801 http://www.reuters.com/statesofthenation/ 46057 72049 \n",
"... ... ... ... \n",
"3981 http://www.oann.com/pollresults/ 35856 48165 \n",
"4013 http://polling.reuters.com/#poll/TM651Y15_13/f... 35871 65430 \n",
"12439 http://polling.reuters.com/#poll/TM651Y15_13/f... 35484 65439 \n",
"8231 http://polling.reuters.com/#poll/TM651Y15_13/f... 35484 65439 \n",
"4023 http://polling.reuters.com/#poll/TM651Y15_13/f... 35484 65439 \n",
"\n",
" createddate timestamp \n",
"10862 9/26/16 09:14:14 8 Nov 2016 \n",
"6718 9/26/16 09:24:53 8 Nov 2016 \n",
"11203 9/16/16 09:14:14 8 Nov 2016 \n",
"2105 10/3/16 09:35:33 8 Nov 2016 \n",
"2801 9/26/16 09:35:33 8 Nov 2016 \n",
"... ... ... \n",
"3981 1/12/16 09:35:33 8 Nov 2016 \n",
"4013 5/5/16 09:35:33 8 Nov 2016 \n",
"12439 5/5/16 09:14:14 8 Nov 2016 \n",
"8231 5/5/16 09:24:53 8 Nov 2016 \n",
"4023 5/5/16 09:35:33 8 Nov 2016 \n",
"\n",
"[12624 rows x 23 columns]"
]
},
"execution_count": 7,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"polls.rename(columns={'adjpoll_clinton':'Clinton',\n",
" 'adjpoll_trump':'Trump'})"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "VRfioNa8q3wR"
},
"source": [
"### Two slices"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "g8t3KfEEq3wR",
"outputId": "23243e1f-3444-4d62-e6c1-3b772f5aa35e"
},
"outputs": [
{
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>state</th>\n",
" <th>startdate</th>\n",
" <th>enddate</th>\n",
" <th>pollster</th>\n",
" <th>grade</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>0</th>\n",
" <td>U.S.</td>\n",
" <td>11/3/2016</td>\n",
" <td>11/6/2016</td>\n",
" <td>ABC News/Washington Post</td>\n",
" <td>A+</td>\n",
" </tr>\n",
" <tr>\n",
" <th>8448</th>\n",
" <td>Michigan</td>\n",
" <td>11/3/2016</td>\n",
" <td>11/4/2016</td>\n",
" <td>Public Policy Polling</td>\n",
" <td>B+</td>\n",
" </tr>\n",
" <tr>\n",
" <th>8420</th>\n",
" <td>U.S.</td>\n",
" <td>11/3/2016</td>\n",
" <td>11/6/2016</td>\n",
" <td>Gravis Marketing</td>\n",
" <td>B-</td>\n",
" </tr>\n",
" <tr>\n",
" <th>8623</th>\n",
" <td>Michigan</td>\n",
" <td>11/3/2016</td>\n",
" <td>11/3/2016</td>\n",
" <td>Strategic National</td>\n",
" <td>NaN</td>\n",
" </tr>\n",
" <tr>\n",
" <th>8479</th>\n",
" <td>Pennsylvania</td>\n",
" <td>11/3/2016</td>\n",
" <td>11/6/2016</td>\n",
" <td>Gravis Marketing</td>\n",
" <td>B-</td>\n",
" </tr>\n",
" <tr>\n",
" <th>...</th>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" </tr>\n",
" <tr>\n",
" <th>10134</th>\n",
" <td>Utah</td>\n",
" <td>10/27/2016</td>\n",
" <td>11/2/2016</td>\n",
" <td>SurveyMonkey</td>\n",
" <td>C-</td>\n",
" </tr>\n",
" <tr>\n",
" <th>4308</th>\n",
" <td>Florida</td>\n",
" <td>10/27/2016</td>\n",
" <td>11/1/2016</td>\n",
" <td>CNN/Opinion Research Corp.</td>\n",
" <td>A-</td>\n",
" </tr>\n",
" <tr>\n",
" <th>6043</th>\n",
" <td>Missouri</td>\n",
" <td>10/27/2016</td>\n",
" <td>11/2/2016</td>\n",
" <td>SurveyMonkey</td>\n",
" <td>C-</td>\n",
" </tr>\n",
" <tr>\n",
" <th>6118</th>\n",
" <td>Kentucky</td>\n",
" <td>10/27/2016</td>\n",
" <td>11/2/2016</td>\n",
" <td>SurveyMonkey</td>\n",
" <td>C-</td>\n",
" </tr>\n",
" <tr>\n",
" <th>6129</th>\n",
" <td>New York</td>\n",
" <td>10/27/2016</td>\n",
" <td>11/2/2016</td>\n",
" <td>SurveyMonkey</td>\n",
" <td>C-</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"<p>234 rows × 5 columns</p>\n",
"</div>"
],
"text/plain": [
" state startdate enddate pollster grade\n",
"0 U.S. 11/3/2016 11/6/2016 ABC News/Washington Post A+\n",
"8448 Michigan 11/3/2016 11/4/2016 Public Policy Polling B+\n",
"8420 U.S. 11/3/2016 11/6/2016 Gravis Marketing B-\n",
"8623 Michigan 11/3/2016 11/3/2016 Strategic National NaN\n",
"8479 Pennsylvania 11/3/2016 11/6/2016 Gravis Marketing B-\n",
"... ... ... ... ... ...\n",
"10134 Utah 10/27/2016 11/2/2016 SurveyMonkey C-\n",
"4308 Florida 10/27/2016 11/1/2016 CNN/Opinion Research Corp. A-\n",
"6043 Missouri 10/27/2016 11/2/2016 SurveyMonkey C-\n",
"6118 Kentucky 10/27/2016 11/2/2016 SurveyMonkey C-\n",
"6129 New York 10/27/2016 11/2/2016 SurveyMonkey C-\n",
"\n",
"[234 rows x 5 columns]"
]
},
"execution_count": 8,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"polls.loc[0:100:10,'state':'grade']"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "mIybvQK-q3wR"
},
"source": [
"## How to code a list comprehension"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "U6rpyL45q3wS",
"outputId": "3a914d33-4a53-411e-ed55-eaadae12e5e5"
},
"outputs": [
{
"data": {
"text/plain": [
"[1900, 1902, 1904, 1906, 1908, 1910, 1912, 1914, 1916, 1918]"
]
},
"execution_count": 9,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"xticks = [x for x in range(1900,1920,2)]\n",
"xticks"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "qoqUHuVwq3wS"
},
"source": [
"## How to continue statements"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "HdJ-3ZJhq3wS"
},
"source": [
"### With implicit continuation"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "AaPrpe9Bq3wS",
"outputId": "7713f465-dbb4-49cd-ed6e-038a512db3ed"
},
"outputs": [
{
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>type</th>\n",
" <th>state</th>\n",
" <th>startdate</th>\n",
" <th>enddate</th>\n",
" <th>pollster</th>\n",
" <th>grade</th>\n",
" <th>samplesize</th>\n",
" <th>population</th>\n",
" <th>poll_wt</th>\n",
" <th>rawpoll_clinton</th>\n",
" <th>...</th>\n",
" <th>adjpoll_clinton</th>\n",
" <th>adjpoll_trump</th>\n",
" <th>adjpoll_johnson</th>\n",
" <th>adjpoll_mcmullin</th>\n",
" <th>multiversions</th>\n",
" <th>url</th>\n",
" <th>poll_id</th>\n",
" <th>question_id</th>\n",
" <th>createddate</th>\n",
" <th>timestamp</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>2551</th>\n",
" <td>polls-plus</td>\n",
" <td>Wyoming</td>\n",
" <td>9/7/2016</td>\n",
" <td>9/13/2016</td>\n",
" <td>Google Consumer Surveys</td>\n",
" <td>B</td>\n",
" <td>63.0</td>\n",
" <td>lv</td>\n",
" <td>0.002562</td>\n",
" <td>16.57</td>\n",
" <td>...</td>\n",
" <td>24.41425</td>\n",
" <td>67.16723</td>\n",
" <td>10.678870</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://drive.google.com/drive/u/0/folders/0B2...</td>\n",
" <td>45792</td>\n",
" <td>71476</td>\n",
" <td>9/14/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>10967</th>\n",
" <td>polls-only</td>\n",
" <td>Wyoming</td>\n",
" <td>9/7/2016</td>\n",
" <td>9/13/2016</td>\n",
" <td>Google Consumer Surveys</td>\n",
" <td>B</td>\n",
" <td>63.0</td>\n",
" <td>lv</td>\n",
" <td>0.002562</td>\n",
" <td>16.57</td>\n",
" <td>...</td>\n",
" <td>24.43031</td>\n",
" <td>67.15052</td>\n",
" <td>10.670120</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://drive.google.com/drive/u/0/folders/0B2...</td>\n",
" <td>45792</td>\n",
" <td>71476</td>\n",
" <td>9/14/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>6763</th>\n",
" <td>now-cast</td>\n",
" <td>Wyoming</td>\n",
" <td>9/7/2016</td>\n",
" <td>9/13/2016</td>\n",
" <td>Google Consumer Surveys</td>\n",
" <td>B</td>\n",
" <td>63.0</td>\n",
" <td>lv</td>\n",
" <td>0.002562</td>\n",
" <td>16.57</td>\n",
" <td>...</td>\n",
" <td>24.39564</td>\n",
" <td>67.21143</td>\n",
" <td>10.681150</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://drive.google.com/drive/u/0/folders/0B2...</td>\n",
" <td>45792</td>\n",
" <td>71476</td>\n",
" <td>9/14/16</td>\n",
" <td>09:24:53 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>898</th>\n",
" <td>polls-plus</td>\n",
" <td>Wyoming</td>\n",
" <td>9/6/2016</td>\n",
" <td>9/11/2016</td>\n",
" <td>DFM Research</td>\n",
" <td>B-</td>\n",
" <td>402.0</td>\n",
" <td>lv</td>\n",
" <td>0.162356</td>\n",
" <td>19.00</td>\n",
" <td>...</td>\n",
" <td>21.23883</td>\n",
" <td>58.54750</td>\n",
" <td>6.891332</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://www.scribd.com/document/324811878/WY-A...</td>\n",
" <td>45919</td>\n",
" <td>71766</td>\n",
" <td>9/21/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>5106</th>\n",
" <td>now-cast</td>\n",
" <td>Wyoming</td>\n",
" <td>9/6/2016</td>\n",
" <td>9/11/2016</td>\n",
" <td>DFM Research</td>\n",
" <td>B-</td>\n",
" <td>402.0</td>\n",
" <td>lv</td>\n",
" <td>0.162356</td>\n",
" <td>19.00</td>\n",
" <td>...</td>\n",
" <td>21.14712</td>\n",
" <td>58.52647</td>\n",
" <td>6.927159</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://www.scribd.com/document/324811878/WY-A...</td>\n",
" <td>45919</td>\n",
" <td>71766</td>\n",
" <td>9/21/16</td>\n",
" <td>09:24:53 8 Nov 2016</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"<p>5 rows × 23 columns</p>\n",
"</div>"
],
"text/plain": [
" type state startdate enddate pollster \\\n",
"2551 polls-plus Wyoming 9/7/2016 9/13/2016 Google Consumer Surveys \n",
"10967 polls-only Wyoming 9/7/2016 9/13/2016 Google Consumer Surveys \n",
"6763 now-cast Wyoming 9/7/2016 9/13/2016 Google Consumer Surveys \n",
"898 polls-plus Wyoming 9/6/2016 9/11/2016 DFM Research \n",
"5106 now-cast Wyoming 9/6/2016 9/11/2016 DFM Research \n",
"\n",
" grade samplesize population poll_wt rawpoll_clinton ... \\\n",
"2551 B 63.0 lv 0.002562 16.57 ... \n",
"10967 B 63.0 lv 0.002562 16.57 ... \n",
"6763 B 63.0 lv 0.002562 16.57 ... \n",
"898 B- 402.0 lv 0.162356 19.00 ... \n",
"5106 B- 402.0 lv 0.162356 19.00 ... \n",
"\n",
" adjpoll_clinton adjpoll_trump adjpoll_johnson adjpoll_mcmullin \\\n",
"2551 24.41425 67.16723 10.678870 NaN \n",
"10967 24.43031 67.15052 10.670120 NaN \n",
"6763 24.39564 67.21143 10.681150 NaN \n",
"898 21.23883 58.54750 6.891332 NaN \n",
"5106 21.14712 58.52647 6.927159 NaN \n",
"\n",
" multiversions url \\\n",
"2551 NaN https://drive.google.com/drive/u/0/folders/0B2... \n",
"10967 NaN https://drive.google.com/drive/u/0/folders/0B2... \n",
"6763 NaN https://drive.google.com/drive/u/0/folders/0B2... \n",
"898 NaN https://www.scribd.com/document/324811878/WY-A... \n",
"5106 NaN https://www.scribd.com/document/324811878/WY-A... \n",
"\n",
" poll_id question_id createddate timestamp \n",
"2551 45792 71476 9/14/16 09:35:33 8 Nov 2016 \n",
"10967 45792 71476 9/14/16 09:14:14 8 Nov 2016 \n",
"6763 45792 71476 9/14/16 09:24:53 8 Nov 2016 \n",
"898 45919 71766 9/21/16 09:35:33 8 Nov 2016 \n",
"5106 45919 71766 9/21/16 09:24:53 8 Nov 2016 \n",
"\n",
"[5 rows x 23 columns]"
]
},
"execution_count": 10,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"polls.sort_values(['state','startdate'],\n",
" ascending=False,\n",
" inplace=True)\n",
"polls.head()"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "DkzSVbXZq3wT"
},
"source": [
"### With explicit continuation"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "fEYDJ55aq3wT",
"outputId": "e7b5ff44-effa-45f1-e19c-c601bf2bfb39"
},
"outputs": [
{
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>type</th>\n",
" <th>state</th>\n",
" <th>startdate</th>\n",
" <th>enddate</th>\n",
" <th>pollster</th>\n",
" <th>grade</th>\n",
" <th>samplesize</th>\n",
" <th>population</th>\n",
" <th>poll_wt</th>\n",
" <th>rawpoll_clinton</th>\n",
" <th>...</th>\n",
" <th>adjpoll_clinton</th>\n",
" <th>adjpoll_trump</th>\n",
" <th>adjpoll_johnson</th>\n",
" <th>adjpoll_mcmullin</th>\n",
" <th>multiversions</th>\n",
" <th>url</th>\n",
" <th>poll_id</th>\n",
" <th>question_id</th>\n",
" <th>createddate</th>\n",
" <th>timestamp</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>2551</th>\n",
" <td>polls-plus</td>\n",
" <td>Wyoming</td>\n",
" <td>9/7/2016</td>\n",
" <td>9/13/2016</td>\n",
" <td>Google Consumer Surveys</td>\n",
" <td>B</td>\n",
" <td>63.0</td>\n",
" <td>lv</td>\n",
" <td>0.002562</td>\n",
" <td>16.57</td>\n",
" <td>...</td>\n",
" <td>24.41425</td>\n",
" <td>67.16723</td>\n",
" <td>10.678870</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://drive.google.com/drive/u/0/folders/0B2...</td>\n",
" <td>45792</td>\n",
" <td>71476</td>\n",
" <td>9/14/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>10967</th>\n",
" <td>polls-only</td>\n",
" <td>Wyoming</td>\n",
" <td>9/7/2016</td>\n",
" <td>9/13/2016</td>\n",
" <td>Google Consumer Surveys</td>\n",
" <td>B</td>\n",
" <td>63.0</td>\n",
" <td>lv</td>\n",
" <td>0.002562</td>\n",
" <td>16.57</td>\n",
" <td>...</td>\n",
" <td>24.43031</td>\n",
" <td>67.15052</td>\n",
" <td>10.670120</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://drive.google.com/drive/u/0/folders/0B2...</td>\n",
" <td>45792</td>\n",
" <td>71476</td>\n",
" <td>9/14/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>6763</th>\n",
" <td>now-cast</td>\n",
" <td>Wyoming</td>\n",
" <td>9/7/2016</td>\n",
" <td>9/13/2016</td>\n",
" <td>Google Consumer Surveys</td>\n",
" <td>B</td>\n",
" <td>63.0</td>\n",
" <td>lv</td>\n",
" <td>0.002562</td>\n",
" <td>16.57</td>\n",
" <td>...</td>\n",
" <td>24.39564</td>\n",
" <td>67.21143</td>\n",
" <td>10.681150</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://drive.google.com/drive/u/0/folders/0B2...</td>\n",
" <td>45792</td>\n",
" <td>71476</td>\n",
" <td>9/14/16</td>\n",
" <td>09:24:53 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>898</th>\n",
" <td>polls-plus</td>\n",
" <td>Wyoming</td>\n",
" <td>9/6/2016</td>\n",
" <td>9/11/2016</td>\n",
" <td>DFM Research</td>\n",
" <td>B-</td>\n",
" <td>402.0</td>\n",
" <td>lv</td>\n",
" <td>0.162356</td>\n",
" <td>19.00</td>\n",
" <td>...</td>\n",
" <td>21.23883</td>\n",
" <td>58.54750</td>\n",
" <td>6.891332</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://www.scribd.com/document/324811878/WY-A...</td>\n",
" <td>45919</td>\n",
" <td>71766</td>\n",
" <td>9/21/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>5106</th>\n",
" <td>now-cast</td>\n",
" <td>Wyoming</td>\n",
" <td>9/6/2016</td>\n",
" <td>9/11/2016</td>\n",
" <td>DFM Research</td>\n",
" <td>B-</td>\n",
" <td>402.0</td>\n",
" <td>lv</td>\n",
" <td>0.162356</td>\n",
" <td>19.00</td>\n",
" <td>...</td>\n",
" <td>21.14712</td>\n",
" <td>58.52647</td>\n",
" <td>6.927159</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://www.scribd.com/document/324811878/WY-A...</td>\n",
" <td>45919</td>\n",
" <td>71766</td>\n",
" <td>9/21/16</td>\n",
" <td>09:24:53 8 Nov 2016</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"<p>5 rows × 23 columns</p>\n",
"</div>"
],
"text/plain": [
" type state startdate enddate pollster \\\n",
"2551 polls-plus Wyoming 9/7/2016 9/13/2016 Google Consumer Surveys \n",
"10967 polls-only Wyoming 9/7/2016 9/13/2016 Google Consumer Surveys \n",
"6763 now-cast Wyoming 9/7/2016 9/13/2016 Google Consumer Surveys \n",
"898 polls-plus Wyoming 9/6/2016 9/11/2016 DFM Research \n",
"5106 now-cast Wyoming 9/6/2016 9/11/2016 DFM Research \n",
"\n",
" grade samplesize population poll_wt rawpoll_clinton ... \\\n",
"2551 B 63.0 lv 0.002562 16.57 ... \n",
"10967 B 63.0 lv 0.002562 16.57 ... \n",
"6763 B 63.0 lv 0.002562 16.57 ... \n",
"898 B- 402.0 lv 0.162356 19.00 ... \n",
"5106 B- 402.0 lv 0.162356 19.00 ... \n",
"\n",
" adjpoll_clinton adjpoll_trump adjpoll_johnson adjpoll_mcmullin \\\n",
"2551 24.41425 67.16723 10.678870 NaN \n",
"10967 24.43031 67.15052 10.670120 NaN \n",
"6763 24.39564 67.21143 10.681150 NaN \n",
"898 21.23883 58.54750 6.891332 NaN \n",
"5106 21.14712 58.52647 6.927159 NaN \n",
"\n",
" multiversions url \\\n",
"2551 NaN https://drive.google.com/drive/u/0/folders/0B2... \n",
"10967 NaN https://drive.google.com/drive/u/0/folders/0B2... \n",
"6763 NaN https://drive.google.com/drive/u/0/folders/0B2... \n",
"898 NaN https://www.scribd.com/document/324811878/WY-A... \n",
"5106 NaN https://www.scribd.com/document/324811878/WY-A... \n",
"\n",
" poll_id question_id createddate timestamp \n",
"2551 45792 71476 9/14/16 09:35:33 8 Nov 2016 \n",
"10967 45792 71476 9/14/16 09:14:14 8 Nov 2016 \n",
"6763 45792 71476 9/14/16 09:24:53 8 Nov 2016 \n",
"898 45919 71766 9/21/16 09:35:33 8 Nov 2016 \n",
"5106 45919 71766 9/21/16 09:24:53 8 Nov 2016 \n",
"\n",
"[5 rows x 23 columns]"
]
},
"execution_count": 11,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"polls.sort_values(['state','startdate'], \\\n",
" ascending=False, \\\n",
" inplace=True)\n",
"polls.head()"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "-Desx3JUq3wU"
},
"source": [
"## How to use the Magic Commands"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "pNCu6h1Tq3wU",
"outputId": "a98a563a-2ae9-4390-8e8c-d024e50f5cae"
},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"CPU times: total: 141 ms\n",
"Wall time: 42.6 s\n"
]
},
{
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>cycle</th>\n",
" <th>branch</th>\n",
" <th>type</th>\n",
" <th>matchup</th>\n",
" <th>forecastdate</th>\n",
" <th>state</th>\n",
" <th>startdate</th>\n",
" <th>enddate</th>\n",
" <th>pollster</th>\n",
" <th>grade</th>\n",
" <th>...</th>\n",
" <th>adjpoll_clinton</th>\n",
" <th>adjpoll_trump</th>\n",
" <th>adjpoll_johnson</th>\n",
" <th>adjpoll_mcmullin</th>\n",
" <th>multiversions</th>\n",
" <th>url</th>\n",
" <th>poll_id</th>\n",
" <th>question_id</th>\n",
" <th>createddate</th>\n",
" <th>timestamp</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>0</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-plus</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>U.S.</td>\n",
" <td>11/3/2016</td>\n",
" <td>11/6/2016</td>\n",
" <td>ABC News/Washington Post</td>\n",
" <td>A+</td>\n",
" <td>...</td>\n",
" <td>45.20163</td>\n",
" <td>41.72430</td>\n",
" <td>4.626221</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://www.washingtonpost.com/news/the-fix/wp...</td>\n",
" <td>48630</td>\n",
" <td>76192</td>\n",
" <td>11/7/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>1</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-plus</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>U.S.</td>\n",
" <td>11/1/2016</td>\n",
" <td>11/7/2016</td>\n",
" <td>Google Consumer Surveys</td>\n",
" <td>B</td>\n",
" <td>...</td>\n",
" <td>43.34557</td>\n",
" <td>41.21439</td>\n",
" <td>5.175792</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://datastudio.google.com/u/0/#/org//repor...</td>\n",
" <td>48847</td>\n",
" <td>76443</td>\n",
" <td>11/7/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-plus</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>U.S.</td>\n",
" <td>11/2/2016</td>\n",
" <td>11/6/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>42.02638</td>\n",
" <td>38.81620</td>\n",
" <td>6.844734</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://projects.fivethirtyeight.com/polls/2016...</td>\n",
" <td>48922</td>\n",
" <td>76636</td>\n",
" <td>11/8/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>3</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-plus</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>U.S.</td>\n",
" <td>11/4/2016</td>\n",
" <td>11/7/2016</td>\n",
" <td>YouGov</td>\n",
" <td>B</td>\n",
" <td>...</td>\n",
" <td>45.65676</td>\n",
" <td>40.92004</td>\n",
" <td>6.069454</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://d25d2506sfb94s.cloudfront.net/cumulus_...</td>\n",
" <td>48687</td>\n",
" <td>76262</td>\n",
" <td>11/7/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>4</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-plus</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>U.S.</td>\n",
" <td>11/3/2016</td>\n",
" <td>11/6/2016</td>\n",
" <td>Gravis Marketing</td>\n",
" <td>B-</td>\n",
" <td>...</td>\n",
" <td>46.84089</td>\n",
" <td>42.33184</td>\n",
" <td>3.726098</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.gravispolls.com/2016/11/final-natio...</td>\n",
" <td>48848</td>\n",
" <td>76444</td>\n",
" <td>11/7/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>...</th>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" </tr>\n",
" <tr>\n",
" <th>12619</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-only</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>New Hampshire</td>\n",
" <td>7/9/2016</td>\n",
" <td>7/18/2016</td>\n",
" <td>University of New Hampshire</td>\n",
" <td>B+</td>\n",
" <td>...</td>\n",
" <td>40.24983</td>\n",
" <td>43.04717</td>\n",
" <td>6.924110</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://cola.unh.edu/sites/cola.unh.edu/files/...</td>\n",
" <td>44650</td>\n",
" <td>68189</td>\n",
" <td>7/21/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>12620</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-only</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>Wisconsin</td>\n",
" <td>10/21/2016</td>\n",
" <td>11/2/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>46.54218</td>\n",
" <td>38.96884</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>48259</td>\n",
" <td>75560</td>\n",
" <td>11/3/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>12621</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-only</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>New York</td>\n",
" <td>8/7/2016</td>\n",
" <td>8/10/2016</td>\n",
" <td>Siena College</td>\n",
" <td>A</td>\n",
" <td>...</td>\n",
" <td>53.83622</td>\n",
" <td>32.47939</td>\n",
" <td>3.881193</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://www.siena.edu/assets/files/news/SNY081...</td>\n",
" <td>44852</td>\n",
" <td>68743</td>\n",
" <td>8/15/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>12622</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-only</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>Virginia</td>\n",
" <td>9/30/2016</td>\n",
" <td>10/6/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>49.57558</td>\n",
" <td>39.96954</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>46675</td>\n",
" <td>72969</td>\n",
" <td>10/10/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>12623</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-only</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>Wisconsin</td>\n",
" <td>6/9/2016</td>\n",
" <td>6/12/2016</td>\n",
" <td>Marquette University</td>\n",
" <td>A</td>\n",
" <td>...</td>\n",
" <td>46.40999</td>\n",
" <td>39.19839</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>https://law.marquette.edu/poll/2016/06/15/new-...</td>\n",
" <td>44341</td>\n",
" <td>66966</td>\n",
" <td>6/15/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"<p>12624 rows × 27 columns</p>\n",
"</div>"
],
"text/plain": [
" cycle branch type matchup \\\n",
"0 2016 President polls-plus Clinton vs. Trump vs. Johnson \n",
"1 2016 President polls-plus Clinton vs. Trump vs. Johnson \n",
"2 2016 President polls-plus Clinton vs. Trump vs. Johnson \n",
"3 2016 President polls-plus Clinton vs. Trump vs. Johnson \n",
"4 2016 President polls-plus Clinton vs. Trump vs. Johnson \n",
"... ... ... ... ... \n",
"12619 2016 President polls-only Clinton vs. Trump vs. Johnson \n",
"12620 2016 President polls-only Clinton vs. Trump vs. Johnson \n",
"12621 2016 President polls-only Clinton vs. Trump vs. Johnson \n",
"12622 2016 President polls-only Clinton vs. Trump vs. Johnson \n",
"12623 2016 President polls-only Clinton vs. Trump vs. Johnson \n",
"\n",
" forecastdate state startdate enddate \\\n",
"0 11/8/16 U.S. 11/3/2016 11/6/2016 \n",
"1 11/8/16 U.S. 11/1/2016 11/7/2016 \n",
"2 11/8/16 U.S. 11/2/2016 11/6/2016 \n",
"3 11/8/16 U.S. 11/4/2016 11/7/2016 \n",
"4 11/8/16 U.S. 11/3/2016 11/6/2016 \n",
"... ... ... ... ... \n",
"12619 11/8/16 New Hampshire 7/9/2016 7/18/2016 \n",
"12620 11/8/16 Wisconsin 10/21/2016 11/2/2016 \n",
"12621 11/8/16 New York 8/7/2016 8/10/2016 \n",
"12622 11/8/16 Virginia 9/30/2016 10/6/2016 \n",
"12623 11/8/16 Wisconsin 6/9/2016 6/12/2016 \n",
"\n",
" pollster grade ... adjpoll_clinton adjpoll_trump \\\n",
"0 ABC News/Washington Post A+ ... 45.20163 41.72430 \n",
"1 Google Consumer Surveys B ... 43.34557 41.21439 \n",
"2 Ipsos A- ... 42.02638 38.81620 \n",
"3 YouGov B ... 45.65676 40.92004 \n",
"4 Gravis Marketing B- ... 46.84089 42.33184 \n",
"... ... ... ... ... ... \n",
"12619 University of New Hampshire B+ ... 40.24983 43.04717 \n",
"12620 Ipsos A- ... 46.54218 38.96884 \n",
"12621 Siena College A ... 53.83622 32.47939 \n",
"12622 Ipsos A- ... 49.57558 39.96954 \n",
"12623 Marquette University A ... 46.40999 39.19839 \n",
"\n",
" adjpoll_johnson adjpoll_mcmullin multiversions \\\n",
"0 4.626221 NaN NaN \n",
"1 5.175792 NaN NaN \n",
"2 6.844734 NaN NaN \n",
"3 6.069454 NaN NaN \n",
"4 3.726098 NaN NaN \n",
"... ... ... ... \n",
"12619 6.924110 NaN NaN \n",
"12620 NaN NaN NaN \n",
"12621 3.881193 NaN NaN \n",
"12622 NaN NaN NaN \n",
"12623 NaN NaN NaN \n",
"\n",
" url poll_id \\\n",
"0 https://www.washingtonpost.com/news/the-fix/wp... 48630 \n",
"1 https://datastudio.google.com/u/0/#/org//repor... 48847 \n",
"2 http://projects.fivethirtyeight.com/polls/2016... 48922 \n",
"3 https://d25d2506sfb94s.cloudfront.net/cumulus_... 48687 \n",
"4 http://www.gravispolls.com/2016/11/final-natio... 48848 \n",
"... ... ... \n",
"12619 https://cola.unh.edu/sites/cola.unh.edu/files/... 44650 \n",
"12620 http://www.reuters.com/statesofthenation/ 48259 \n",
"12621 https://www.siena.edu/assets/files/news/SNY081... 44852 \n",
"12622 http://www.reuters.com/statesofthenation/ 46675 \n",
"12623 https://law.marquette.edu/poll/2016/06/15/new-... 44341 \n",
"\n",
" question_id createddate timestamp \n",
"0 76192 11/7/16 09:35:33 8 Nov 2016 \n",
"1 76443 11/7/16 09:35:33 8 Nov 2016 \n",
"2 76636 11/8/16 09:35:33 8 Nov 2016 \n",
"3 76262 11/7/16 09:35:33 8 Nov 2016 \n",
"4 76444 11/7/16 09:35:33 8 Nov 2016 \n",
"... ... ... ... \n",
"12619 68189 7/21/16 09:14:14 8 Nov 2016 \n",
"12620 75560 11/3/16 09:14:14 8 Nov 2016 \n",
"12621 68743 8/15/16 09:14:14 8 Nov 2016 \n",
"12622 72969 10/10/16 09:14:14 8 Nov 2016 \n",
"12623 66966 6/15/16 09:14:14 8 Nov 2016 \n",
"\n",
"[12624 rows x 27 columns]"
]
},
"execution_count": 12,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"%time polls = pd.read_csv(poll_url)\n",
"polls"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "cpEGedPCq3wW",
"outputId": "0eb921e9-8d8c-480c-d636-61801ae91698"
},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"CPU times: total: 31.2 ms\n",
"Wall time: 15.6 ms\n"
]
},
{
"data": {
"text/html": [
"<div>\n",
"<style scoped>\n",
" .dataframe tbody tr th:only-of-type {\n",
" vertical-align: middle;\n",
" }\n",
"\n",
" .dataframe tbody tr th {\n",
" vertical-align: top;\n",
" }\n",
"\n",
" .dataframe thead th {\n",
" text-align: right;\n",
" }\n",
"</style>\n",
"<table border=\"1\" class=\"dataframe\">\n",
" <thead>\n",
" <tr style=\"text-align: right;\">\n",
" <th></th>\n",
" <th>cycle</th>\n",
" <th>branch</th>\n",
" <th>type</th>\n",
" <th>matchup</th>\n",
" <th>forecastdate</th>\n",
" <th>state</th>\n",
" <th>startdate</th>\n",
" <th>enddate</th>\n",
" <th>pollster</th>\n",
" <th>grade</th>\n",
" <th>...</th>\n",
" <th>adjpoll_clinton</th>\n",
" <th>adjpoll_trump</th>\n",
" <th>adjpoll_johnson</th>\n",
" <th>adjpoll_mcmullin</th>\n",
" <th>multiversions</th>\n",
" <th>url</th>\n",
" <th>poll_id</th>\n",
" <th>question_id</th>\n",
" <th>createddate</th>\n",
" <th>timestamp</th>\n",
" </tr>\n",
" </thead>\n",
" <tbody>\n",
" <tr>\n",
" <th>10862</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-only</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>Kentucky</td>\n",
" <td>9/9/2016</td>\n",
" <td>9/22/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>37.39425</td>\n",
" <td>54.17959</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>46070</td>\n",
" <td>72062</td>\n",
" <td>9/26/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>6718</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>now-cast</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>Indiana</td>\n",
" <td>9/9/2016</td>\n",
" <td>9/22/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>33.73702</td>\n",
" <td>52.89504</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>46067</td>\n",
" <td>72059</td>\n",
" <td>9/26/16</td>\n",
" <td>09:24:53 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>11203</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-only</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>Texas</td>\n",
" <td>9/9/2016</td>\n",
" <td>9/15/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>28.37295</td>\n",
" <td>51.58496</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>45869</td>\n",
" <td>71633</td>\n",
" <td>9/16/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2105</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-plus</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>Montana</td>\n",
" <td>9/9/2016</td>\n",
" <td>9/29/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>38.08105</td>\n",
" <td>52.35901</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>46360</td>\n",
" <td>72488</td>\n",
" <td>10/3/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>2801</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-plus</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>Arizona</td>\n",
" <td>9/9/2016</td>\n",
" <td>9/22/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>41.36205</td>\n",
" <td>48.14312</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.reuters.com/statesofthenation/</td>\n",
" <td>46057</td>\n",
" <td>72049</td>\n",
" <td>9/26/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>...</th>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" <td>...</td>\n",
" </tr>\n",
" <tr>\n",
" <th>3981</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-plus</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>U.S.</td>\n",
" <td>1/10/2016</td>\n",
" <td>1/10/2016</td>\n",
" <td>Gravis Marketing</td>\n",
" <td>B-</td>\n",
" <td>...</td>\n",
" <td>47.18561</td>\n",
" <td>50.33852</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://www.oann.com/pollresults/</td>\n",
" <td>35856</td>\n",
" <td>48165</td>\n",
" <td>1/12/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>4013</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-plus</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>U.S.</td>\n",
" <td>1/10/2016</td>\n",
" <td>1/14/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>41.77744</td>\n",
" <td>36.23294</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://polling.reuters.com/#poll/TM651Y15_13/f...</td>\n",
" <td>35871</td>\n",
" <td>65430</td>\n",
" <td>5/5/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>12439</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-only</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>U.S.</td>\n",
" <td>1/1/2016</td>\n",
" <td>1/5/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>40.13148</td>\n",
" <td>35.99752</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://polling.reuters.com/#poll/TM651Y15_13/f...</td>\n",
" <td>35484</td>\n",
" <td>65439</td>\n",
" <td>5/5/16</td>\n",
" <td>09:14:14 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>8231</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>now-cast</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>U.S.</td>\n",
" <td>1/1/2016</td>\n",
" <td>1/5/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>40.16681</td>\n",
" <td>36.06849</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://polling.reuters.com/#poll/TM651Y15_13/f...</td>\n",
" <td>35484</td>\n",
" <td>65439</td>\n",
" <td>5/5/16</td>\n",
" <td>09:24:53 8 Nov 2016</td>\n",
" </tr>\n",
" <tr>\n",
" <th>4023</th>\n",
" <td>2016</td>\n",
" <td>President</td>\n",
" <td>polls-plus</td>\n",
" <td>Clinton vs. Trump vs. Johnson</td>\n",
" <td>11/8/16</td>\n",
" <td>U.S.</td>\n",
" <td>1/1/2016</td>\n",
" <td>1/5/2016</td>\n",
" <td>Ipsos</td>\n",
" <td>A-</td>\n",
" <td>...</td>\n",
" <td>40.11739</td>\n",
" <td>36.00981</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>NaN</td>\n",
" <td>http://polling.reuters.com/#poll/TM651Y15_13/f...</td>\n",
" <td>35484</td>\n",
" <td>65439</td>\n",
" <td>5/5/16</td>\n",
" <td>09:35:33 8 Nov 2016</td>\n",
" </tr>\n",
" </tbody>\n",
"</table>\n",
"<p>12624 rows × 27 columns</p>\n",
"</div>"
],
"text/plain": [
" cycle branch type matchup \\\n",
"10862 2016 President polls-only Clinton vs. Trump vs. Johnson \n",
"6718 2016 President now-cast Clinton vs. Trump vs. Johnson \n",
"11203 2016 President polls-only Clinton vs. Trump vs. Johnson \n",
"2105 2016 President polls-plus Clinton vs. Trump vs. Johnson \n",
"2801 2016 President polls-plus Clinton vs. Trump vs. Johnson \n",
"... ... ... ... ... \n",
"3981 2016 President polls-plus Clinton vs. Trump vs. Johnson \n",
"4013 2016 President polls-plus Clinton vs. Trump vs. Johnson \n",
"12439 2016 President polls-only Clinton vs. Trump vs. Johnson \n",
"8231 2016 President now-cast Clinton vs. Trump vs. Johnson \n",
"4023 2016 President polls-plus Clinton vs. Trump vs. Johnson \n",
"\n",
" forecastdate state startdate enddate pollster grade \\\n",
"10862 11/8/16 Kentucky 9/9/2016 9/22/2016 Ipsos A- \n",
"6718 11/8/16 Indiana 9/9/2016 9/22/2016 Ipsos A- \n",
"11203 11/8/16 Texas 9/9/2016 9/15/2016 Ipsos A- \n",
"2105 11/8/16 Montana 9/9/2016 9/29/2016 Ipsos A- \n",
"2801 11/8/16 Arizona 9/9/2016 9/22/2016 Ipsos A- \n",
"... ... ... ... ... ... ... \n",
"3981 11/8/16 U.S. 1/10/2016 1/10/2016 Gravis Marketing B- \n",
"4013 11/8/16 U.S. 1/10/2016 1/14/2016 Ipsos A- \n",
"12439 11/8/16 U.S. 1/1/2016 1/5/2016 Ipsos A- \n",
"8231 11/8/16 U.S. 1/1/2016 1/5/2016 Ipsos A- \n",
"4023 11/8/16 U.S. 1/1/2016 1/5/2016 Ipsos A- \n",
"\n",
" ... adjpoll_clinton adjpoll_trump adjpoll_johnson adjpoll_mcmullin \\\n",
"10862 ... 37.39425 54.17959 NaN NaN \n",
"6718 ... 33.73702 52.89504 NaN NaN \n",
"11203 ... 28.37295 51.58496 NaN NaN \n",
"2105 ... 38.08105 52.35901 NaN NaN \n",
"2801 ... 41.36205 48.14312 NaN NaN \n",
"... ... ... ... ... ... \n",
"3981 ... 47.18561 50.33852 NaN NaN \n",
"4013 ... 41.77744 36.23294 NaN NaN \n",
"12439 ... 40.13148 35.99752 NaN NaN \n",
"8231 ... 40.16681 36.06849 NaN NaN \n",
"4023 ... 40.11739 36.00981 NaN NaN \n",
"\n",
" multiversions url \\\n",
"10862 NaN http://www.reuters.com/statesofthenation/ \n",
"6718 NaN http://www.reuters.com/statesofthenation/ \n",
"11203 NaN http://www.reuters.com/statesofthenation/ \n",
"2105 NaN http://www.reuters.com/statesofthenation/ \n",
"2801 NaN http://www.reuters.com/statesofthenation/ \n",
"... ... ... \n",
"3981 NaN http://www.oann.com/pollresults/ \n",
"4013 NaN http://polling.reuters.com/#poll/TM651Y15_13/f... \n",
"12439 NaN http://polling.reuters.com/#poll/TM651Y15_13/f... \n",
"8231 NaN http://polling.reuters.com/#poll/TM651Y15_13/f... \n",
"4023 NaN http://polling.reuters.com/#poll/TM651Y15_13/f... \n",
"\n",
" poll_id question_id createddate timestamp \n",
"10862 46070 72062 9/26/16 09:14:14 8 Nov 2016 \n",
"6718 46067 72059 9/26/16 09:24:53 8 Nov 2016 \n",
"11203 45869 71633 9/16/16 09:14:14 8 Nov 2016 \n",
"2105 46360 72488 10/3/16 09:35:33 8 Nov 2016 \n",
"2801 46057 72049 9/26/16 09:35:33 8 Nov 2016 \n",
"... ... ... ... ... \n",
"3981 35856 48165 1/12/16 09:35:33 8 Nov 2016 \n",
"4013 35871 65430 5/5/16 09:35:33 8 Nov 2016 \n",
"12439 35484 65439 5/5/16 09:14:14 8 Nov 2016 \n",
"8231 35484 65439 5/5/16 09:24:53 8 Nov 2016 \n",
"4023 35484 65439 5/5/16 09:35:33 8 Nov 2016 \n",
"\n",
"[12624 rows x 27 columns]"
]
},
"execution_count": 13,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"%%time\n",
"polls = polls.sort_values('startdate', ascending=False)\n",
"polls"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "lXjenANaq3wW",
"outputId": "6be6b84c-8973-447f-fc7a-2c911a5b3ddf"
},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Variable Type Data/Info\n",
"---------------------------------\n",
"pd module <module 'pandas' from 'C:<...>es\\\\pandas\\\\__init__.py'>\n",
"poll_url str http://projects.fivethirt<...>nt_general_polls_2016.csv\n",
"polls DataFrame cycle branch <...>[12624 rows x 27 columns]\n",
"xticks list n=10\n"
]
}
],
"source": [
"%whos"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "U5lXuyGnq3wX"
},
"source": [
"## How to use the Python type() function"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "g1QTBgBlq3wX",
"outputId": "ee0e07d7-ef15-4882-a8ee-d83c0b3f6e9c"
},
"outputs": [
{
"data": {
"text/plain": [
"str"
]
},
"execution_count": 15,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"type(poll_url)"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "yRvnK41xq3wY",
"outputId": "44e17c31-9597-4355-e151-6520ad8d7d2e"
},
"outputs": [
{
"data": {
"text/plain": [
"pandas.core.frame.DataFrame"
]
},
"execution_count": 16,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"type(polls)"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "psM7QNfKq3wY"
},
"outputs": [],
"source": []
}
],
"metadata": {
"kernelspec": {
"display_name": "Python 3 (ipykernel)",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.9.7"
},
"colab": {
"provenance": [],
"include_colab_link": true
}
},
"nbformat": 4,
"nbformat_minor": 0
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment