Title: | Diverse Datasets for 'tsibble' |
---|---|
Description: | Provides diverse datasets in the 'tsibble' data structure. These datasets are useful for learning and demonstrating how tidy temporal data can tidied, visualised, and forecasted. |
Authors: | Mitchell O'Hara-Wild [aut, cre], Rob Hyndman [aut], Earo Wang [aut], Rakshitha Godahewa [aut], Christoph Bergmeir [ctb] |
Maintainer: | Mitchell O'Hara-Wild <[email protected]> |
License: | GPL-3 |
Version: | 0.4.1.9000 |
Built: | 2024-11-11 04:18:49 UTC |
Source: | https://github.com/tidyverts/tsibbledata |
The data features a major pilots' industrial dispute which results in some weeks having zero passengers. There were also at least two changes in the definitions of passenger classes.
Time series of class tsibble
ansett
is a weekly tsibble
with one value:
Passengers: | Total air passengers travelling with Ansett |
Each series is uniquely identified using two keys:
Airports: | The airports that passengers are travelling between (both directions) |
Class: | The class of the ticket. |
Ansett Airlines (which no longer exists).
library(tsibble) ansett
library(tsibble) ansett
Meat production in Australia for human consumption
Time series of class tsibble
aus_livestock
is a monthly tsibble
with one value:
Count: | Number of animals slaughtered. |
Each series is uniquely identified using two keys:
Animal: | The animal slaughtered. |
State: | The Australian state (or territory). |
Australian Bureau of Statistics, catalogue number 7218.0.55.001 tables 1 to 7.
library(tsibble) aus_livestock
library(tsibble) aus_livestock
Quarterly estimates of selected indicators of manufacturing production in Australia.
Time series of class tsibble
.
aus_production
is a half-hourly tsibble
with six values:
Beer: | Beer production in megalitres. |
Tobacco: | Tobacco and cigarette production in tonnes. |
Bricks: | Clay brick production in millions of bricks. |
Cement: | Portland cement production in thousands of tonnes. |
Electricity: | Electricity production in gigawatt hours. |
Gas: | Gas production in petajoules. |
Australian Bureau of Statistics, catalogue number 8301.0.55.001 table 1.
library(tsibble) aus_production
library(tsibble) aus_production
aus_retail
is a monthly tsibble
with one value:
Turnover: | Retail turnover in $Million AUD |
Time series of class tsibble
Each series is uniquely identified using two keys:
State: | The Australian state (or territory) |
Industry: | The industry of retail trade |
Australian Bureau of Statistics, catalogue number 8501.0, table 11.
library(tsibble) aus_retail
library(tsibble) aus_retail
Historical stock prices from 2014-2018 for Google, Amazon, Facebook and Apple. All prices are in $USD.
Time series of class tsibble
gafa_stock
is a tsibble
containing data on irregular trading days:
Open: | The opening price for the stock. |
High: | The stock's highest trading price. |
Low: | The stock's lowest trading price. |
Close: | The closing price for the stock. |
Adj_Close: | The adjusted closing price for the stock. |
Volume: | The amount of stock traded. |
Each stock is uniquely identified by one key:
Symbol: | The ticker symbol for the stock. |
Yahoo Finance historical data
library(tsibble) gafa_stock
library(tsibble) gafa_stock
Economic indicators featured by the World Bank from 1960 to 2017.
Time series of class tsibble
global_economy
is an annual tsibble
with six values:
GDP: | Gross domestic product (in $USD February 2019). |
Growth: | Annual percentage growth in GDP. |
CPI: | Consumer price index (base year 2010). |
Imports: | Imports of goods and services (% of GDP). |
Exports: | Exports of goods and services (% of GDP). |
Population: | Total population. |
Each series is uniquely identified by one key:
Country: | The country or region of the series. |
The World Bank, http://datatopics.worldbank.org/world-development-indicators/
library(tsibble) global_economy
library(tsibble) global_economy
Annual indicators of household budgets for Australia, Japan, Canada and USA from 1995-2016.
Time series of class tsibble
.
hh_budget
is an annual tsibble
with six values:
Debt: | Debt as a percentage of net disposable income. |
DI: | Annual growth rate of disposable income. |
Expenditure: | Annual growth rate of expenditure. |
Savings: | Savings as a percentage of household disposable income. |
Wealth: | Wealth as a percentage of net disposable income. |
Unemployment: | Percentage of unemployed in the labour force. |
Each country is uniquely identified by one key:
Country: | The country of the series. |
The Organisation for Economic Co-operation and Development (https://data.oecd.org/)
library(tsibble) hh_budget
library(tsibble) hh_budget
This function downloads datasets from the Monash Time Series Forecasting
Repository (https://forecastingdata.org/) and reads them in as a tsibble.
Downloaded data will be stored locally, allowing subsequent use of the data
without downloading. By default, the data is stored the appropriate
application data directory which varies by your operating system. The storage
path for these datasets can be changed by setting the rpkg_tsibbledata
option with options(rpkg_tsibbledata = /path/to/datadir)
monash_forecasting_repository(record_id)
monash_forecasting_repository(record_id)
record_id |
Record ID of the dataset to be downloaded from the Monash Time Series Forecasting Repository. A complete table of datasets which can be obtained with this function can be found here: https://forecastingdata.org/#datasets. From this link, the data's |
Datasets from this repository are stored in a tsf file format, which stores time series metadata at the top of the file. This function uses this metadata to produce a tsibble dataset, ready for analysis in R. For more details on the repository and the tsf file format, please refer to: Godahewa, R., Bergmeir, C., Webb, G. I., Hyndman, R. J. & Montero-Manso, P. (2021), Monash Time Series Forecasting Archive.
A tsibble.
Godahewa, R., Bergmeir, C., Webb, G. I., Hyndman, R. J. & Montero-Manso, P. (2021), Monash Time Series Forecasting Repository. https://forecastingdata.org/
Godahewa, R., Bergmeir, C., Webb, G. I., Hyndman, R. J. & Montero-Manso, P. (2021), Monash Time Series Forecasting Archive.
A sample from NYC Citi Bike usage of 10 bikes throughout 2018. The data includes event data on each trip, including the trip's start and end times and locations. The customer's gender, birth year and bike usage type is also available.
Time series of class tsibble
nyc_bikes
is a tsibble
containing event data, the events include
these details:
start_time: | The time and date when the trip was started. |
stop_time: | The time and date when the trip was ended. |
start_station: | A unique identifier for the starting bike station. |
start_lat: | The latitude of the starting bike station. |
start_long: | The longitude of the starting bike station. |
end_station: | A unique identifier for the destination bike station. |
end_lat: | The latitutde of the destination bike station. |
end_long: | The longitude of the destination bike station. |
type: | The type of trip. A "Customer" has purchased either a 24-hour or 3-day pass, and a "Subscriber" has purchased an annual subscription. |
birth_year | The bike rider's year of birth. |
gender: | The gender of the bike rider. |
Each series is uniquely identified by one key:
bike_id: | A unique identifier for the bike. |
Citi Bike NYC, https://www.citibikenyc.com/system-data
library(tsibble) nyc_bikes
library(tsibble) nyc_bikes
olympic_running
is a quadrennial tsibble
with one value:
Time: | Fastest running time for the event (seconds) |
Time series of class tsibble
The event is identified using two keys:
Length: | The length of the race (meters) |
Sex: | The sex of the event |
The data contains missing values in 1916, 1940 and 1944 due to the World Wars.
https://olympics.com/en/sports/athletics/
library(tsibble) olympic_running if(requireNamespace("ggplot2")){ library(ggplot2) olympic_running %>% as_tibble %>% ggplot(aes(x=Year, y = Time, colour = Sex)) + geom_line() + facet_wrap(~ Length, scales = "free_y") }
library(tsibble) olympic_running if(requireNamespace("ggplot2")){ library(ggplot2) olympic_running %>% as_tibble %>% ggplot(aes(x=Year, y = Time, colour = Sex)) + geom_line() + facet_wrap(~ Length, scales = "free_y") }
PBS
is a monthly tsibble
with two values:
Scripts: | Total number of scripts |
Cost: | Cost of the scripts in $AUD |
Time series of class tsibble
The data is disaggregated using four keys:
Concession: | Concessional scripts are given to pensioners, unemployed, dependents, and other card holders |
Type: | Co-payments are made until an individual's script expenditure hits a threshold ($290.00 for concession, $1141.80 otherwise). Safety net subsidies are provided to individuals exceeding this amount. |
ATC1: | Anatomical Therapeutic Chemical index (level 1) |
ATC2: | Anatomical Therapeutic Chemical index (level 2) |
Medicare Australia
library(tsibble) PBS
library(tsibble) PBS
Hudson Bay Company trading records for Snowshoe Hare and Canadian Lynx furs from 1845 to 1935. This data contains trade records for all areas of the company.
Time series of class tsibble
pelt
is an annual tsibble
with two values:
Hare: | The number of Snowshoe Hare pelts traded. |
Lynx: | The number of Canadian Lynx pelts traded. |
Hudson Bay Company
library(tsibble) pelt
library(tsibble) pelt
vic_elec
is a half-hourly tsibble
with three values:
Demand: | Total electricity demand in MWh. |
Temperature: | Temperature of Melbourne (BOM site 086071). |
Holiday: | Indicator for if that day is a public holiday. |
Time series of class tsibble
.
This data is for operational demand, which is the demand met by local scheduled generating units, semi-scheduled generating units, and non-scheduled intermittent generating units of aggregate capacity larger than 30 MWh, and by generation imports to the region. The operational demand excludes the demand met by non-scheduled non-intermittent generating units, non-scheduled intermittent generating units of aggregate capacity smaller than 30 MWh, exempt generation (e.g. rooftop solar, gas tri-generation, very small wind farms, etc), and demand of local scheduled loads. It also excludes some very large industrial users (such as mines or smelters).
Australian Energy Market Operator.
library(tsibble) vic_elec
library(tsibble) vic_elec