Survey Scheduling

Overview

A well-structured sampling schedule ensures that creel survey estimates of effort, catch, and CPUE are unbiased and representative of the full survey season. generate_schedule() builds a stratified random sampling calendar from season boundaries and a target sampling intensity — producing the count-period framework that drives the tidycreel estimation pipeline.

The Count Period Framework

Every tidycreel design rests on a calendar of count periods — the discrete time slots during which field crews collect data. A count period is defined by:

Date: the survey date
Period: the time-of-day window (e.g., “Morning”, “Afternoon”)
Stratum: the day type (weekday vs. weekend)

Interviews and angler counts are both collected during count periods. The schedule you generate defines exactly which days and periods will be sampled.

Building a Basic Schedule

The minimal inputs are season boundaries, the number of periods per day, and sampling intensity. The example below builds a schedule for a May–September creel season with morning and afternoon count periods, sampling 30% of weekdays and 50% of weekends.

library(tidycreel)

sched <- generate_schedule(
  start_date    = "2024-05-01",
  end_date      = "2024-09-30",
  n_periods     = 2,
  period_labels = c("Morning", "Afternoon"),
  sampling_rate = c(weekday = 0.30, weekend = 0.50),
  seed          = 42
)

head(sched, 8)

May 2024

Sun	Mon	Tue	Wed	Thu	Fri	Sat
			01	02	WEEKD	04
WEEKE	06	WEEKD	08	09	WEEKD	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

The result is a creel_schedule object with date, day_type, and period_id columns. Period labels replace integer IDs with human-readable names.

# Counts of scheduled days by stratum
table(sched$day_type)
#> 
#> weekday weekend 
#>      66      44

Visualising Sampled vs. Unsampled Days

Pass include_all = TRUE to return every day of the season with a sampled indicator. This is useful for checking seasonal coverage and communicating the design to stakeholders.

sched_full <- generate_schedule(
  start_date    = "2024-05-01",
  end_date      = "2024-09-30",
  n_periods     = 1,
  sampling_rate = c(weekday = 0.30, weekend = 0.50),
  include_all   = TRUE,
  seed          = 42
)

# Sampled proportion by stratum
table(sched_full$day_type, sched_full$sampled)
#>          
#>           FALSE TRUE
#>   weekday    76   33
#>   weekend    22   22

The sampled days are drawn by stratified random sampling within each stratum, so weekday and weekend sampling fractions match the requested rates.

Scheduling High-Use or Other Special Periods

Some surveys need more than the default weekday/weekend split. Opener weekends, holidays, or derby periods are often sampled more heavily because they are expected in advance to have different effort and catch characteristics. In that case, the scheduling layer should assign those dates to their own prospective stratum rather than leaving them inside the ordinary weekend pool.

generate_schedule() accepts a special_periods data frame with start_date, end_date, label, and optional reason columns. The periods are expanded to day-level assignments before sampling, so a boundary-crossing opener is split deterministically by civil date.

opener_periods <- data.frame(
  start_date = as.Date("2027-07-31"),
  end_date = as.Date("2027-08-01"),
  label = "high_use",
  reason = "opener"
)

sched_high_use <- generate_schedule(
  start_date = "2027-07-30",
  end_date = "2027-08-03",
  n_periods = 1,
  sampling_rate = c(weekday = 1, weekend = 1, high_use = 1),
  include_all = TRUE,
  expand_periods = FALSE,
  seed = 42,
  special_periods = opener_periods
)
#> Warning: Special-period declarations produced a fragile schedule design.
#> ℹ Affected strata: "weekend (baseline=2, final=0)"
#> ℹ Inspect the "special_period_diagnostics" attribute or print the schedule for
#>   details.

sched_high_use[, c("date", "day_type", "final_stratum", "special_period_reason", "sampled")]

July 2027

Sun	Mon	Tue	Wed	Thu	Fri	Sat
				01	02	03
04	05	06	07	08	09	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	WEEKD	WEEKE

August 2027

Sun	Mon	Tue	Wed	Thu	Fri	Sat
WEEKE	WEEKD	WEEKD	04	05	06	07
08	09	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30	31

The output keeps the baseline day_type classification and adds final_stratum, which is what the scheduler actually uses for sample allocation when special periods are present. Here the July 31 and August 1 dates are both assigned to high_use, even though one is in July and the other is in August.

You can inspect the audit surfaces directly:

attr(sched_high_use, "special_period_audit")
#>         date    label reason source_start_date source_end_date crosses_boundary
#> 1 2027-07-31 high_use opener        2027-07-31      2027-08-01             TRUE
#> 2 2027-08-01 high_use opener        2027-07-31      2027-08-01             TRUE
#>   sampled
#> 1    TRUE
#> 2    TRUE
attr(sched_high_use, "special_period_allocation")
#>   final_stratum available_days
#> 1      high_use              2
#> 2       weekday              3

These attributes show which dates were reassigned to the special stratum and how many available days ended up in each final stratum after the calendar rules were applied.

When a declaration is technically valid but leaves a very small special stratum or consumes most of a baseline stratum, generate_schedule() now emits a warning rather than failing silently. The returned schedule also carries a special_period_diagnostics attribute, and print() / format() surface a compact summary so you can see the risky stratum rewrite without digging through raw attributes.

sched_fragile <- suppressWarnings(generate_schedule(
  start_date = "2027-08-01",
  end_date = "2027-08-08",
  n_periods = 1,
  sampling_rate = c(weekday = 1, weekend = 1, high_use = 1),
  include_all = TRUE,
  expand_periods = FALSE,
  seed = 42,
  special_periods = data.frame(
    start_date = as.Date("2027-08-07"),
    end_date = as.Date("2027-08-07"),
    label = "high_use",
    reason = "opener"
  )
))

attr(sched_fragile, "special_period_diagnostics")
#>   severity
#> 1  warning
#>                                                                         issue
#> 1 fragile stratum with only one available day after special-period assignment
#>    stratum baseline_days final_days
#> 1 high_use             0          1
head(format(sched_fragile), 6)
#> [1] "# A creel_schedule: 8 rows x 5 cols (8 days, 1 periods)"                                                                
#> [2] "Special-period diagnostics"                                                                                             
#> [3] "- warning: high_use — fragile stratum with only one available day after special-period assignment (baseline=0, final=1)"
#> [4] ""                                                                                                                       
#> [5] "August 2027"                                                                                                            
#> [6] "| Sun      | Mon      | Tue      | Wed      | Thu      | Fri      | Sat      |"

If a declaration rewrites the schedule so aggressively that a requested baseline stratum no longer exists in the final schedule, tidycreel aborts and asks you to update n_days or sampling_rate to match the final strata.

This scheduling feature is the operational companion to the high-use design contract from M020: prospective high-use periods are treated as true design strata, and boundary-crossing periods are split by day rather than assigned wholesale to the month where they started.

Special Strata vs. Post-Stratification

Use special_periods when the unusual period is known before sampling and should be planned explicitly in the field calendar.

Do not use it for retrospective corrections after the season. If a survey did not schedule a high-use or special period prospectively and you later need to correct weights or population totals, that is a post-stratification problem, not a scheduling problem. Those retrospective corrections belong in a separate post-stratification workflow.

Three-Period Schedules

Some surveys divide the day into three periods — morning, midday, and evening — to capture temporal patterns in angler activity (e.g., a morning and evening peak with low midday use).

sched3 <- generate_schedule(
  start_date    = "2024-06-01",
  end_date      = "2024-08-31",
  n_periods     = 3,
  period_labels = c("Morning", "Midday", "Evening"),
  sampling_rate = c(weekday = 0.40, weekend = 0.60),
  seed          = 101
)

head(sched3, 9)

June 2024

Sun	Mon	Tue	Wed	Thu	Fri	Sat
						01
WE	03	04	05	06	07	WE
WE	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30

Reproducibility with `seed`

The seed argument ensures that the same calendar is produced every time. This is important when the schedule is pre-printed for field use and must match what the estimators will use.

sched_a <- generate_schedule("2024-06-01", "2024-06-30",
  n_periods = 1,
  sampling_rate = 0.4, seed = 99
)
sched_b <- generate_schedule("2024-06-01", "2024-06-30",
  n_periods = 1,
  sampling_rate = 0.4, seed = 99
)

identical(sched_a, sched_b)
#> [1] TRUE

write_schedule() exports to CSV (for any spreadsheet application) or xlsx (for direct field printing). read_schedule() reloads the file with correct column types, so the schedule round-trips without type coercion issues.

# Save for field crews
write_schedule(sched, "count_schedule_2024.csv")
write_schedule(sched, "count_schedule_2024.xlsx")

# Reload — column types are preserved automatically
sched_reload <- read_schedule("count_schedule_2024.csv")
identical(sched, sched_reload)

Linking the Schedule to a Design

Pass the schedule directly to creel_design(). The schedule’s stratum structure informs the survey weights used by all downstream estimators.

design <- creel_design(sched, date = date, strata = day_type)
design
#> 
#> ── Creel Survey Design ─────────────────────────────────────────────────────────
#> Type: "instantaneous"
#> Date column: date
#> Strata: day_type
#> Calendar: 110 days (2024-05-03 to 2024-09-30)
#> day_type: 2 levels
#> Counts: "none"
#> Interviews: "none"
#> Sections: "none"

When special_periods are present, the scheduling layer adds final_stratum, which is the resolved day-level stratum actually used for allocation. That is the column the analysis layer should follow.

sched_for_analysis <- sched_high_use[, c("date", "final_stratum")]
names(sched_for_analysis)[2] <- "analysis_stratum"

design_special <- creel_design(
  sched_for_analysis,
  date = date,
  strata = analysis_stratum
)
design_special
#> 
#> ── Creel Survey Design ─────────────────────────────────────────────────────────
#> Type: "instantaneous"
#> Date column: date
#> Strata: analysis_stratum
#> Calendar: 5 days (2027-07-30 to 2027-08-03)
#> analysis_stratum: 2 levels
#> Counts: "none"
#> Interviews: "none"
#> Sections: "none"

If you want to collapse multiple baseline labels into one analysis bucket (for example, treat all non-special days as "regular"), do that in the calendar you pass to creel_design(). The key rule is that the analysis strata must be derived from the same resolved day-level assignments the scheduler used.

Sparse special strata remain visible in analysis. If a special stratum has too few sampled PSUs for variance estimation, estimate_effort() or the product estimators abort with an explicit single-PSU diagnostic naming the stratum. If the stratum is merely small rather than impossible, tidycreel emits a warning about unstable variance instead of failing silently.

Count Period Scheduling vs. Interview Scheduling

In standard access-point creel surveys, interviews happen during count periods — the same field crew that counts anglers also approaches them for interviews. The count schedule therefore governs both collection activities.

There is no separate “interview schedule.” The set of available interviews is the set of interviews collected on sampled count-period days.

Bus-Route Scheduling

For bus-route designs, generate_bus_schedule() converts a count schedule and a site-level sampling frame into a complete bus-route sampling frame with inclusion probabilities. Each site’s probability of being visited in a given period is:

$\pi_i = p_{site} \times p_{period} \quad \text{where} \quad p_{period} = \frac{\text{crew}}{n_{circuits}}$

# Site-level selection probabilities (must sum to 1.0)
site_frame <- data.frame(
  site_id = c("North Bay", "South Cove", "Dock Area", "Main Channel"),
  p_site  = c(0.35, 0.25, 0.25, 0.15)
)

# Build bus-route sampling frame using the count schedule
bus_frame <- generate_bus_schedule(
  schedule       = sched,
  sampling_frame = site_frame,
  site           = site_id,
  p_site         = p_site,
  crew           = 1
)

bus_frame
#> # A tibble: 4 × 4
#>   site_id      p_site p_period inclusion_prob
#>   <chr>         <dbl>    <dbl>          <dbl>
#> 1 North Bay      0.35        1           0.35
#> 2 South Cove     0.25        1           0.25
#> 3 Dock Area      0.25        1           0.25
#> 4 Main Channel   0.15        1           0.15

The inclusion_prob column is the per-site overall inclusion probability across the season. This frame passes directly to creel_design(survey_type = "bus_route", ...).

Sampling Rate Guidance

The right sampling intensity depends on the target precision for the survey. As a starting point:

Survey length	Typical weekday rate	Typical weekend rate
Short (< 8 weeks)	0.40–0.60	0.60–0.80
Standard (8–20 weeks)	0.25–0.40	0.40–0.60
Long (> 20 weeks)	0.15–0.25	0.25–0.40

Higher rates on weekends reflect greater angler activity and higher stratum variance. Use creel_n_effort() or creel_power() to select rates based on a CV target rather than rule of thumb.

# Effort sample size to achieve CV ≤ 0.15
# N_h: total days per stratum in a 184-day season
# ybar_h: pilot mean daily effort (angler-hours) per stratum
# s2_h: pilot variance of daily effort per stratum
creel_n_effort(
  cv_target = 0.15,
  N_h = c(weekday = 132, weekend = 52),
  ybar_h = c(weekday = 280, weekend = 550),
  s2_h = c(weekday = 14400, weekend = 32400)
)
#> weekday weekend   total 
#>       6       2       7

Within-Day Count Time Scheduling

Counts are the primary effort estimator in creel surveys: the number of anglers observed during each count window is used to estimate total angler-hours. When those count windows fall in the day matters — clustered or poorly spaced windows can introduce bias into effort estimates. Pollock et al. (1994) describe three strategies for placing count time windows within the survey day: random, systematic, and fixed. generate_count_times() implements all three.

Random Strategy

With the random strategy, n_windows non-overlapping windows are drawn uniformly at random from the available survey hours. window_size sets the duration (minutes) of each window and min_gap enforces a minimum separation (minutes) between consecutive windows to prevent back-to-back counts.

library(tidycreel)
ct_random <- generate_count_times(
  start_time  = "06:00",
  end_time    = "14:00",
  strategy    = "random",
  n_windows   = 4,
  window_size = 30,
  min_gap     = 10,
  seed        = 42
)
ct_random

(no date column to render calendar)

n_windows is the total number of count windows to place in the day, window_size is how long each window lasts in minutes, and min_gap is the minimum idle time between the end of one window and the start of the next.

Systematic Strategy

The systematic strategy uses the Pollock et al. (1994) model: the day is divided into n_windows equal-length strata and one window is placed at a random start within the first stratum; subsequent windows begin at t₁+k, t₁+2k, and so on, where k is the stratum length. This ensures even spacing across the day. Systematic sampling is preferred by many agencies (Colorado CPW 2012) because it avoids the clustered count times that can occur by chance under random placement.

ct_systematic <- generate_count_times(
  start_time  = "06:00",
  end_time    = "14:00",
  strategy    = "systematic",
  n_windows   = 4,
  window_size = 30,
  min_gap     = 10,
  seed        = 42
)
ct_systematic

(no date column to render calendar)

Fixed Strategy

Use the fixed strategy when count time windows are defined by regulation, permit conditions, or a prior-season protocol that must be replicated exactly. Supply a data.frame with start_time and end_time columns and generate_count_times() wraps them in a creel_schedule object without any random placement.

fw <- data.frame(
  start_time = c("07:00", "09:30", "12:00"),
  end_time = c("07:30", "10:00", "12:30"),
  stringsAsFactors = FALSE
)
ct_fixed <- generate_count_times(strategy = "fixed", fixed_windows = fw)
ct_fixed

(no date column to render calendar)

Exporting Count Time Schedules

generate_count_times() returns a creel_schedule object, the same class returned by generate_schedule(). It therefore passes directly to write_schedule() for field printing without any conversion step.

write_schedule(ct_systematic, "count_times_2024.csv")

Combining the Daily Schedule with Count Time Windows

generate_schedule() and generate_count_times() each return a separate creel_schedule. Before field dispatch, biologists need a single table — one row per (date x period x count window) — that tells each crew exactly when to count. attach_count_times() performs the cross-join in one step.

sched <- generate_schedule(
  start_date = "2024-06-01", end_date = "2024-06-07",
  n_periods = 2,
  sampling_rate = c(weekday = 0.5, weekend = 0.8),
  seed = 42
)
ct <- generate_count_times(
  start_time = "06:00", end_time = "14:00",
  strategy = "systematic", n_windows = 3,
  window_size = 30, min_gap = 10,
  seed = 42
)
field_schedule <- attach_count_times(sched, ct)
field_schedule

June 2024

Sun	Mon	Tue	Wed	Thu	Fri	Sat
						WEEKE
WEEKE	WEEKD	04	05	06	WEEKD	08
09	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30

The result has nrow(sched) * nrow(ct) rows — every sampled day-period combination appears once per count window. Pass field_schedule directly to write_schedule() to produce a printable field dispatch sheet.

Validating the Design Before the Season

After building a schedule, validate_design() checks whether the proposed sampling intensity is sufficient to meet a target coefficient of variation (CV) — catching under-sampling problems before the season starts rather than discovering them in the post-season analysis. It uses the same pilot mean and variance inputs as creel_n_effort() and compares the proposed day counts against the minimum required sample size per stratum.

report <- validate_design(
  N_h        = c(weekday = 132, weekend = 52),
  ybar_h     = c(weekday = 280, weekend = 550),
  s2_h       = c(weekday = 14400, weekend = 32400),
  n_proposed = c(weekday = 40L, weekend = 26L),
  cv_target  = 0.15
)
report
#> 
#> ── Design Validation Report ────────────────────────────────────────────────────
#> Type: effort
#> ✔ All strata PASSED
#> 
#> ✔ weekday: n=40 >= 6 required (CV 0.006 vs target 0.15)
#> ✔ weekend: n=26 >= 2 required (CV 0.009 vs target 0.15)

report$results shows the per-stratum status (pass / warn / fail), the proposed sample size, the required sample size, and the achieved CV at the proposed intensity. report$passed is TRUE only when all strata pass — use this as a go/no-go gate before printing field schedules.

report$results
#> # A tibble: 2 × 7
#>   stratum status n_proposed n_required cv_actual cv_target message              
#>   <chr>   <chr>       <int>      <int>     <dbl>     <dbl> <chr>                
#> 1 weekday pass           40          6    0.0059      0.15 Proposed n meets or …
#> 2 weekend pass           26          2    0.0089      0.15 Proposed n meets or …

Checking Data Completeness After the Season

check_completeness() runs post-season on a creel_design with survey data attached, flagging missing sampling days and strata with too few interviews to produce reliable estimates. It is the first diagnostic step before running estimators — resolving completeness issues early avoids propagating gaps into effort or CPUE estimates.

data(example_calendar)
data(example_counts)
data(example_interviews)

design <- creel_design(example_calendar, date = date, strata = day_type)
design <- add_counts(design, example_counts)
#> Warning in svydesign.default(ids = psu_formula, strata = strata_formula, : No
#> weights or probabilities supplied, assuming equal probability
design <- add_interviews(design, example_interviews,
  catch        = catch_total,
  effort       = hours_fished,
  trip_status  = trip_status
)
comp <- check_completeness(design)
comp
#> 
#> ── Completeness Report ─────────────────────────────────────────────────────────
#> Survey type: instantaneous | n_min threshold: 10
#> ✖ Completeness issues found
#> 
#> 
#> ── Missing Days ──
#> 
#> ✔ No missing sampling days
#> 
#> ── Low-n Strata (threshold: 10) ──
#> 
#> ! 1 stratum/strata below n_min=10
#> 
#> ── Refusal Rates ──
#> 
#> (not recorded or not applicable)

comp$passed returns TRUE if no issues were found. When FALSE, comp$missing_days identifies dates that appear in the schedule but have no count data, and comp$low_n_strata identifies strata where the interview count is below the minimum threshold (n_min, default 10).

Assembling the Season Summary

season_summary() accepts a named list of creel_estimates objects — the outputs of estimate_effort(), estimate_catch_rate(), and related functions — and joins them into a single wide tibble for reporting or export. It performs no re-estimation; it is purely an assembly step that makes it easy to view all key metrics side by side.

# Run estimators first (see the tidycreel workflow vignette)
effort <- estimate_effort(design)
cpue <- estimate_catch_rate(design)

# Assemble the season summary
summary_tbl <- season_summary(list(effort = effort, cpue = cpue))
summary_tbl$table

The full estimation workflow — building the design, attaching counts and interviews, and running the estimators — is covered in the main tidycreel vignette. season_summary() is the final step that wraps pre-computed results for export or reporting.

write_schedule(summary_tbl$table, "season_2024_summary.xlsx")

write_schedule() accepts any data frame or tibble, so the season summary table exports to CSV or xlsx with a single call.

References

Hoenig, J. M., Robson, D. S., Jones, C. M., and Pollock, K. H. (1993). Scheduling counts in the instantaneous and progressive count methods for estimating sportfishing effort. North American Journal of Fisheries Management, 13, 723–736.
Pollock, K. H., Jones, C. M., and Brown, T. L. (1994). Angler Survey Methods and Their Applications in Fisheries Management. American Fisheries Society, Bethesda, MD.

Sun	Mon	Tue	Wed	Thu	Fri	Sat
						01
WE	03	04	05	06	07	WE
WE	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30

Sun	Mon	Tue	Wed	Thu	Fri	Sat
						01
WE	03	04	05	06	07	WE
WE	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30

Overview

The Count Period Framework

Building a Basic Schedule

May 2024

Visualising Sampled vs. Unsampled Days

Scheduling High-Use or Other Special Periods

July 2027

August 2027

Special Strata vs. Post-Stratification

Three-Period Schedules

June 2024

Reproducibility with seed

Saving and Sharing the Schedule

Linking the Schedule to a Design

Count Period Scheduling vs. Interview Scheduling

Bus-Route Scheduling

Sampling Rate Guidance

Within-Day Count Time Scheduling

Random Strategy

Systematic Strategy

Fixed Strategy

Exporting Count Time Schedules

Combining the Daily Schedule with Count Time Windows

June 2024

Validating the Design Before the Season

Checking Data Completeness After the Season

Assembling the Season Summary

References

Reproducibility with `seed`

Sun	Mon	Tue	Wed	Thu	Fri	Sat
						01
WE	03	04	05	06	07	WE
WE	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30