Download a DSN table in parallel by year
parallel_dsn_download.RdPartitions a table accessible via an ODBC Data Source Name (DSN) by a year
column and downloads chunks in parallel. Each worker opens its own DSN
connection, downloads its assigned years, and disconnects. This is the
DSN equivalent of parallel_download, which targets Oracle
connections.
Usage
parallel_dsn_download(
table,
year_col,
schema = "newobs",
dsn = "PIRO LOTUS",
years = NULL,
n_cores = NULL
)Arguments
- table
Character. Table name.
- year_col
Character. Name of the year column used for partitioning.
- schema
Character. Schema name. Default: "newobs".
- dsn
Character. ODBC Data Source Name. Default: "PIRO LOTUS".
- years
Integer vector. Years to download. If NULL (default), all available years are queried automatically.
- n_cores
Integer. Number of parallel workers. If NULL (default), determined by
optimal_cores().
Examples
if (FALSE) { # \dontrun{
# Download with automatic year detection and core selection
catch <- parallel_dsn_download(
table = "LDS_CATCH_V",
year_col = "HAULBEGIN_YR"
)
# Download specific years with custom core count
environ <- parallel_dsn_download(
table = "LDS_SET_ENVIRON_V",
year_col = "HAULBEGIN_YR",
years = 2015:2023,
n_cores = 4
)
} # }