Group data to groups based on date

This process groups the data into groups based on the date. If requested (by setting "year_starts_at_Dec1"), it adds December to the next year (i.e. all winter months together). In that case, in the result, every year starts at Dec-01 and ends on Nov-30. The process generates the numeric variable "Year_adj_generated" to show this change. It generates the character variable "group_labels" to allow grouping the data based on the provided periods, and the column "period_label" which contains the period definition, e.g. "Jun-01:Aug-30" (R function 'peri_conv').

daugava use case AquaINFRA peri_conv R

Inputs
Id Title Data Type Description
input_data Table to be grouped by date, with date colum string URL to the table with a column containing a date. It can have other columns which will not be changed during this process.
colname_date Date column name string Column name of the date column in the format defined above. Example: "visit_date".
date_format Date format string The date format used to parse the date, i.e. to extract day, month and year from the date column, e.g. or "y-m-d" for dates like "1998-08-22" (this is the default) or "y/m/d" for dates like "1998/08/22".
group_to_periods Periods to group the data into string Define the periods that you want the data to be grouped into. Please follow the example (which is also the default): "Dec-01:Mar-01,Mar-02:May-30,Jun-01:Aug-30,Sep-01:Nov-30" (first three letters of each month, then a minus/hyphen, then the day (two digits), then comma, then the next period).
period_labels Period labels string Define names for the periods defined above, separated by a comma. Example (which is also the default): "winter,spring,summer,autumn".
year_starts_at_dec1 Include December into next year boolean Tell whether the years in your grouping starts at first of December ("true") or not ("false"). This has to be reflected in the period definitions, these should not contradict each other. Defaults to "true".
Outputs
Id Title Description
data_grouped_by_date The input table grouped by date The input table grouped by period, i.e. with added columns "group_labels" (e.g. "winter", allows to group the data in later analyses), "period_label" (the period definition, e.g. "Jun-01:Aug-30"), "Year_adj_generated" (in case the year is set to start...

Educational resources and documentation

Jobs

Browse jobs

Links

Execution modes

  • Synchronous
  • Asynchronous