Package: quak 0.1.0

quak: Query 'Azure Data Lake Storage Gen2' with 'DuckDB'

Provides convenience utilities for using 'DuckDB' directly over datasets stored in 'Azure Data Lake Storage Gen2' (ADLS Gen2, 'abfss://'). Opens connections configured for Azure-backed 'Delta Lake' and 'Parquet' data, registers Azure credentials as 'DuckDB' secrets, and supports optional repository mirrors for restricted networks. Integrates well with 'DBI' for SQL workflows and with 'dplyr' and 'dbplyr' for lazy table queries.

Authors:Pedro Baltazar [aut, cre, cph]

quak_0.1.0.tar.gz
quak_0.1.0.zip(r-4.7)quak_0.1.0.zip(r-4.6)quak_0.1.0.zip(r-4.5)
quak_0.1.0.tgz(r-4.6-any)quak_0.1.0.tgz(r-4.5-any)
quak_0.1.0.tar.gz(r-4.7-any)quak_0.1.0.tar.gz(r-4.6-any)
quak_0.1.0.tgz(r-4.6-emscripten)
manual.pdf |manual.html
card.svg |card.png
quak/json (API)
NEWS

# Install 'quak' in R:
install.packages('quak', repos = c('https://pedrobtz.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/pedrobtz/quak/issues

On CRAN:

Conda:

azure-storageduckdb

2.70 score 4 scripts 39 exports 7 dependencies

Last updated from:6a39eb893a. Checks:9 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-x86_64OK156
source / vignettesOK199
linux-release-x86_64OK161
macos-release-arm64OK117
macos-oldrel-arm64OK95
windows-develOK129
windows-releaseOK87
windows-oldrelOK96
wasm-releaseOK109

Exports:az_connaz_conn_settingsaz_copy_toaz_default_scopeaz_delta_filesaz_existsaz_glimpseaz_globaz_list_secretsaz_schemaaz_set_chain_secretaz_set_sp_secretaz_set_token_secretaz_tuneaz_write_parquetconn_settingext_cacheext_cache_pathext_dirext_installext_install_localext_is_installedext_list_availableext_list_installedext_loadext_set_dirext_uninstallload_csvload_datasetload_deltaload_jsonload_parquetquak_optionsrepo_set_urlsrepo_urlstbl_csvtbl_deltatbl_jsontbl_parquet

Dependencies:clicurlDBIduckdbfsgluerlang

Readme and manuals

Help Manual

Help pageTopics
Open a DuckDB connection configured for Azure Data Lake Storage Gen2az_conn
Get Azure settings from a DuckDB connectionaz_conn_settings
Copy data to Azure Data Lake Storage Gen2az_copy_to
Get the default Azure OAuth scopeaz_default_scope
List files in a Delta table on Azure Data Lake Storage Gen2az_delta_files
Check whether data exists at an Azure pathaz_exists
Preview an Azure datasetaz_glimpse
List Azure paths matching a glob patternaz_glob
List Azure secrets registered in DuckDBaz_list_secrets
Inspect a dataset schema without collecting dataaz_schema
Register an Azure credential-chain secretaz_set_chain_secret
Register an Azure service-principal secretaz_set_sp_secret
Register an Azure token secretaz_set_token_secret
Tune Azure read settings on a DuckDB connectionaz_tune
Write Parquet data to Azure Data Lake Storage Gen2az_write_parquet
Collect an Azure-backed lazy tblcollect.tbl_az
Get or set DuckDB settingsconn_setting
Extension cacheext_cache
Default DuckDB extension cache directoryext_cache_path
Find the DuckDB extension folderext_dir
Install a DuckDB extensionext_install
Install a DuckDB extension from a local fileext_install_local
Check whether a DuckDB extension is installedext_is_installed
List all DuckDB core extensionsext_list_available
List installed DuckDB extensionsext_list_installed
Load a DuckDB extension, installing it first if necessaryext_load
Set the DuckDB extension folderext_set_dir
Uninstall a DuckDB extensionext_uninstall
Register a CSV dataset as a view on a DuckDB connectionload_csv
Register a Delta, Parquet, CSV, or JSON dataset on a DuckDB connectionload_dataset
Register a Delta Lake table on a DuckDB connectionload_delta
Register a JSON dataset as a view on a DuckDB connectionload_json
Register a Parquet dataset as a view on a DuckDB connectionload_parquet
Print the quak option registryprint.quak_opts
List all quak options and their current valuesquak_options
Set DuckDB extension repository URLsrepo_set_urls
Get DuckDB extension repository URLsrepo_urls
Open a CSV dataset as a lazy dplyr tbltbl_csv
Open a Delta Lake table as a lazy dplyr tbltbl_delta
Open a JSON dataset as a lazy dplyr tbltbl_json
Open a Parquet dataset as a lazy dplyr tbltbl_parquet