Datasets and Pipelines
Datasets and Pipelines
What is a dataset?
To begin with, a dataset is where the information is stored. Each dataset has one or multiple time series from one or many unique entities. Entities can be, for example, countries. Think about the datasets as an Excel spreadsheet, a Python Pandas DataFrame, or simply as a table with rows and columns. Now you will find datasets created by Alphacast and some featured publishers, but soon there will be many more.
Searching and downloading data
There are many ways to access a dataset, it could be from the Dataset List View or the Dataset Editor View.
At the Dataset List View
To explore and search for a dataset follow the "Datasets" tab. The tab has multiple features (and many more are coming soon). For instance, you can toggle "Show details" button to hide or expose datasets metadata, sort your results by name or last update time and favourite you frequently used datasets.
When searching for a dataset utilize the bar on the left. Use key words in order to find the data you are looking for. Then, navigate to see the dataset details and find the repository that stores it.
In the dataset view you can explore and download data. There's a brief description of the dataset, including the source. Next, you will see:
- A list of the variables that make up the dataset. Usually, as columns.
- The transformations that have been done to the data, in order to make it useful and -practical. Also, a preview of the dataset is an excel sheet.
- A filter, so that you do not have to download unnecessary information.
- If you regularly check this dataset, put it in your favorites!
- Last but not least, once you decide what data you need, download your dataset. When downloading the dataset, you can choose between CSV or XLSX format. You can also decide whether you want the variables as columns or rows.
Another way of finding the datasets you want is by clicking on a repository. There is a tab that includes all the datasets related to that specific topic. Select any repository of your interest and click on the tab called "Datasets". Choose the dataset you want from the list. When clicking on its name, it will redirect you to dataset view.
At the Dataset Editor View
To access Dataset Editor View, first you must select a repository. Then, click on "Upload CSV". Now, from the Dataset Editor View, you can access a dataset by clicking on the datasets tab. You can also access by clicking here. Select the repository in which the dataset is, you can search it by typing key words. Click on the dataset you want from the list.
When you select a dataset, the dataset view's structure is the following: documentation, variables and charts. You can also download data from the admin view. The only step is to click on the blue button that says "Download CSV" above the documentation.