gbifdb: High Performance Interface to 'GBIF'

A high performance interface to the Global Biodiversity Information Facility, 'GBIF'. In contrast to 'rgbif', which can access small subsets of 'GBIF' data through web-based queries to a central server, 'gbifdb' provides enhanced performance for R users performing large-scale analyses on servers and cloud computing providers, providing full support for arbitrary 'SQL' or 'dplyr' operations on the complete 'GBIF' data tables (now over 1 billion records, and over a terabyte in size). 'gbifdb' accesses a copy of the 'GBIF' data in 'parquet' format, which is already readily available in commercial computing clouds such as the Amazon Open Data portal and the Microsoft Planetary Computer, or can be accessed directly without downloading, or downloaded to any server with suitable bandwidth and storage space. The high-performance techniques for local and remote access are described in <https://duckdb.org/why_duckdb> and <https://arrow.apache.org/docs/r/articles/fs.html> respectively.

Version: 1.0.0
Depends: R (≥ 4.0)
Imports: arrow (≥ 8.0.0), dplyr, duckdbfs
Suggests: spelling, dbplyr, testthat (≥ 3.0.0), covr, knitr, rmarkdown, minioclient
Published: 2023-10-19
Author: Carl Boettiger ORCID iD [aut, cre]
Maintainer: Carl Boettiger <cboettig at gmail.com>
BugReports: https://github.com/ropensci/gbifdb
License: Apache License (≥ 2)
URL: https://docs.ropensci.org/gbifdb/, https://github.com/ropensci/gbifdb
NeedsCompilation: no
Language: en-US
Materials: README NEWS
CRAN checks: gbifdb results

Documentation:

Reference manual: gbifdb.pdf
Vignettes: Intro to gbifdb

Downloads:

Package source: gbifdb_1.0.0.tar.gz
Windows binaries: r-devel: gbifdb_1.0.0.zip, r-release: gbifdb_1.0.0.zip, r-oldrel: gbifdb_1.0.0.zip
macOS binaries: r-release (arm64): gbifdb_1.0.0.tgz, r-oldrel (arm64): gbifdb_1.0.0.tgz, r-release (x86_64): gbifdb_1.0.0.tgz
Old sources: gbifdb archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=gbifdb to link to this page.