Used to execute a SQL query against table(s) in Redivis, using the Redivis SQL query syntax, and read out the results.
Constructors
Examples
# Execute any SQL query and read the results
query = redivis.query("SELECT 1 + 1 AS two, 'foo' AS bar")
query.to_pandas_dataframe()
# two bar
# 0 2 foo
# The query can reference any table on Redivis
query = redivis.query("""
SELECT *
FROM demo.iris_species.iris
WHERE SepalLengthCm > 5
""")
query.to_pandas_dataframe()
# Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species
# 0 33 5.2 4.1 1.5 0.1 Iris-setosa
# ...
# Other methods to read data:
# query.to_arrow_batch_iterator()
# query.to_arrow_dataset()
# query.to_arrow_dataset()
# query.to_geopandas_dataframe()
# query.to_dask_dataframe()
# query.to_polars_lazyframe()
# To simplify table references, execute a query scoped to a dataset or workflow
dataset = redivis.organization("Demo").dataset("CMS 2014 Medicare Data")
query = dataset.query("""
SELECT
hospice_providers.name,
inpatient_charges.drg_definition
-- The tables inpatient_chargers, hospice_providers are assumed to be
-- within the scoped dataset
FROM inpatient_charges
INNER JOIN hospice_providers
ON hospice_providers.provider_id = inpatient_charges.provider_id
""")
# In a notebook, all queries are scoped to the current workflow.
# Additionally, the notebooks source table can simply be referenced as _source_
query = redivis.query("SELECT * FROM _source_ LIMIT 10")
Attributes
properties
Methods
Download all files represented by a file_id variable in the query results to a local directory.
Fetch query metadata. Once called, the properties attribute on the query will be fully populated.
Deprecated. Please use to_[geo]pandas_dataframe instead.
Execute a SQL query within the (either a dataset or workflow).
In a Redivis notebook, the default scope will always be the notebook's workflow, and the notebook's source table can be referenced via the _source_ identifier.
If no default scope is specified, all tables in the query must be fully qualified. Consult the documentation to learn more.
(query_string)
Execute a SQL query scoped to a specific . Tables referenced by the query do not need to be fully qualified, since the table lookup is already scoped to the dataset.
Consult the documentation to learn more.
(query_string)
Execute a SQL query scoped to a specific workflow. Tables referenced by the query do not need to be fully qualified, since the table lookup is already scoped to the dataset.
Consult the documentation to learn more.
A dict containing the . This will always be populated after the query has been created, and can be refreshed by calling query.get()
([path, *, overwrite, ...])
()
([max_results, *, ...])
Return a list of instances for query results containing a file_id variable.
([max_results, *, variables, ...])
Deprecated. Return a list of named tuples referencing the rows of the query results.
Use .to_pydict() instead.
([...])
Return an iterator that yields , for processing the query results in a memory-efficient streaming manner.
([max_results, ...])
Return a for the query results. Data is backed by disk, allowing for larger-than-memory analysis.
([max_results, ...])
Return a with the query results.
([max_results, ...])
([...])
Return a . For working with query results that contain a geography variable.
([max_results, ...])
Return a . Data is backed by disk, allowing for larger-than-memory analysis.
([max_results, ...])
Return a with the query results.
([max_results])
Return a . Data is backed by disk, allowing for larger-than-memory analysis.