Dataset Queries

When blending two datasets as well as when using the Google Sheets Plugin, you can create Scoop dataset queries. These queries allow you to query the data that is in your dataset, filter it and aggregate it if you like.

Standard Options

When querying, you can specify the following parameters:

  • Date range
    If your dataset is a snapshot dataset, you have the option of selecting "Use latest snapshot". When doing so the query will only pull data from the latest snapshot of data. If your dataset is transactional or you do not have this switch selected you may specify a start date and Scoop will query all data since that date.
  • Dataset Source
    Here you can specify which dataset you want to use, and if there are multiple tables within that dataset, which one to use.
  • Columns to use
    By selecting the check boxes next to the columns you are interested in pulling into your query, you can control which fields get pulled.
  • Column filters
    If you would like to restrict your query to only a subset of data, you can click on the filter icon next to the column name for the column you wish to filter. You can then specify how you want to filter data on that column.

Advanced Options

Scoop offers some more advanced options that are available if you click on the Advanced slider in the upper right.

  • Date to group by
    By default, Scoop uses the load date for grouping by date and setting the date range. However, if your dataset has other dates in it, you can make use of Scoop's advanced treatment of dates and group by date using those dates. See Intelligent Handling Dates in Data for more on how Scoop allows you to powerfully leverage dates in your data.
  • Aggregate by selected attributes
    When this switch is not selected, Scoop does not aggregate your data and instead simply queries the raw data in your dataset. However, if you select this option, Scoop will aggregate the numbers in your data by the attributes (non-number columns) that you have selected. This is very powerful because it allows you to up-level your data which can often be very useful when linking it to other datasets that may be at a higher level of aggregation.
  • Aggregate by time period
    By default, when aggregating, Scoop will aggregate daily. However, you can have Scoop aggregate data at a weekly, monthly, quarterly or annual level. This will further summarize the data in your dataset.
  • Data retrieval
    If this is a snapshot dataset, you can actually select this to retrieve the changes in your dataset instead of the snapshots themselves. This will allow you to analyze all or a subset of changes that have happened to your snapshots. You can pull this data into a spreadsheet for example and do quite detailed analysis of change behavior in your dataset.