{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Preprocessing Examples" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Examine the example dataset format and preprocess example data. Explore properties of the `PreprocessedData` object and masking operations available." ] }, { "cell_type": "code", "execution_count": 93, "metadata": { "scrolled": false }, "outputs": [], "source": [ "from epimodel.preprocessing.data_preprocessor import preprocess_data\n", "import numpy as np\n", "import pandas as pd" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Example Dataset" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Data format:" ] }, { "cell_type": "code", "execution_count": 79, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/html": [ "
\n", " | Country Code | \n", "Date | \n", "Region Name | \n", "Confirmed | \n", "Active | \n", "Deaths | \n", "Mask Wearing | \n", "Symptomatic Testing | \n", "Gatherings <1000 | \n", "Gatherings <100 | \n", "... | \n", "Some Businesses Suspended | \n", "Most Businesses Suspended | \n", "School Closure | \n", "University Closure | \n", "Stay Home Order | \n", "Travel Screen/Quarantine | \n", "Travel Bans | \n", "Public Transport Limited | \n", "Internal Movement Limited | \n", "Public Information Campaigns | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "AL | \n", "2020-01-22 00:00:00+00:00 | \n", "Albania | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
1 | \n", "AL | \n", "2020-01-23 00:00:00+00:00 | \n", "Albania | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
2 | \n", "AL | \n", "2020-01-24 00:00:00+00:00 | \n", "Albania | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
3 | \n", "AL | \n", "2020-01-25 00:00:00+00:00 | \n", "Albania | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
4 | \n", "AL | \n", "2020-01-26 00:00:00+00:00 | \n", "Albania | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "... | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "
5 rows × 21 columns
\n", "