0% found this document useful (0 votes)

5 views18 pages

API Reference - Intake Documentation

Uploaded by

manu.hernha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views18 pages

API Reference - Intake Documentation

Uploaded by

manu.hernha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

 / API Reference

API Reference

User Functions

intake.config.Config ([filename]) Intake's dict-like config system

intake.readers.datatypes.recommend ([url, ...]) Show which Intake data types can apply to the

intake.readers.convert.auto_pipeline (url[, ...]) Create pipeline from given URL to desired outp

intake.readers.convert.path (start, end[, ...]) Find possible conversion paths from start to en

intake.readers.entry.Catalog ([entries, ...]) A collection of data and reader descriptions.

intake.readers.entry.DataDescription (datatype) Defines some data: class and arguments.

intake.readers.entry.ReaderDescription (reader) A serialisable description of a reader or pipeline

intake.readers.readers.recommend (data) Show which readers claim to support the given

intake.readers.readers.reader_from_call (...) Attempt to construct a reader instance by findi

class intake.config.Config(filename=None, **kwargs)

Intake’s dict-like config system

Instance intake.conf is globally used throughout the package

get(key, default=None)

Return the value for key if key is in the dictionary, else default.

load(fn=None)

Update global config from YAML file

If fn is None, looks in global config directory, which is either defined by the

INTAKE_CONF_DIR env-var or is ~/.intake/ .

load_env()

Analyse environment variables and update conf accordingly

reset()

Set conf values back to defaults

save(fn=None)

Save current configuration to file as YAML

Uses self.filename for target location

set(update_dict=None, **kw)

Change config values within a context or for the session

values: dict

This can be deeply nested to set only leaf values

Value resets after context ends

>>> with intake.conf.set(mybval=5):

... ...

Set for whole session

>>> intake.conf.set(myval=5)

Set only a single leaf value within a nested dict

>>> intake.conf.set(intake.readers.utils.nested_keys_to_dict({"deep.2.key":
True})

intake.readers.datatypes.recommend(url: Optional[str] = None, mime: Optional[str] = None,

head: bool = True, contents: bool = False, storage_options=None, ignore: Optional[set[str]] = None)→
set[intake.readers.datatypes.BaseData]

Show which Intake data types can apply to the given details

Parameters url: str

Location of data

mime: str

MIME type, usually “x/y” form

head: bytes | bool | None

A small number of bytes from the file head, for seeking magic bytes.
If it is True, fetch these bytes from th given URL/storage_options and
use them. If None, only fetch bytes if there is no match by mime type
or path, if False, don’t fetch at all.
contents: bool | None

Attempt to delve into URL to analyse constituent files. This can

significantly slow your recommendation.

storage_options: dict | None

If passing a URL which might be a remote file, storage_options can

be used by fsspec.

ignore: set | None

Don’t include these in the output

Returns set of matching datatype classes.

intake.readers.convert.auto_pipeline(url: str | intake.readers.datatypes.BaseData, outtype:

str | tuple[str] = '', storage_options: Optional[dict] = None, avoid: Optional[list[str]] = None)→ Pipeline

Create pipeline from given URL to desired output type

Will search for the shortest conversion path from the inferred data-type to the output.

Parameters url: input data, usually a location/URL, but maybe a data instance

outtype: pattern to match to possible output types

storage_options: if url is a remote str, these are kwargs that fsspec may
need to

access it

avoid: don’t consider readers whose names match any of these strings

class intake.readers.entry.Catalog(entries: Optional[Union[Iterable[ReaderDescription],

Mapping]] = None, aliases: Optional[dict[str, int]] = None, data:
Optional[Union[Iterable[DataDescription], Mapping]] = None, user_parameters: Optional[dict[str,
intake.readers.user_parameters.BaseUserParameter]] = None, parameter_overrides: Optional[dict[str,
Any]] = None, metadata: Optional[dict] = None)

A collection of data and reader descriptions.

add_entry(entry, name=None, clobber=True)

Add entry/reader (and its requirements) in-place, with optional alias

delete(name, recursive=False)

Remove named entity (data/entry) from catalog

We do not check whether any other entity in the catalog refers to what is being
deleted, so you can break other entries this way.
Parameters recursive: bool

Also removed data/entries references by the given one, and

those they refer to in turn.

extract_parameter(item: str, name: str, path: ~typing.Optional[str] = None, value:

~typing.Optional[~typing.Any] = None, cls=<class
'intake.readers.user_parameters.SimpleUserParameter'>, store_to: ~typing.Optional[str] = None,
**kw)

Descend into data & reader descriptions to create a user_parameter

There are two ways to fund and replace values by a template:

if path is given, the kwargs will be walked to this location e.g.,

“field.0.special_value” -> kwargs[“field”][0][“special_value”]
if value is given, all kwargs will be recursively walked, looking for values that
equal that given.

Matched values will be replaced by a template string like "{name}" , and a

user_parameter of class cls will be placed in the location given by store_to (could
be “data”, “catalog”).

classmethod from_dict(data)

Assemble catalog from dict representation

classmethod from_entries(data: dict, metadata=None)

Assemble catalog from a dict of entries

static from_yaml_file(path: str, **kwargs)

Load YAML representation into a new Catalog instance

storage_options:

kwargs to pass to fsspec for opening the file to read; can pass as storage_options=
or will pick up any unused kwargs for simplicity

get_aliases(entity: str)

Return those alias names that point to the given opaque key

get_entity(item: str)

Get the objects by reference

Use this method if you want to change the catalog in-place

item can be an entry in .aliases, in which case the original wil be returned, or a key in
.entries, .user_parameters or .data. The entity in question is returned without
processing.

give_name(tok: str, name: str, clobber=True)

Give an alias to a dataset

tok:

a key in the .entries dict

move_parameter(from_entity: str, to_entity: str, parameter_name: str)→ Catalog

Move user-parameter from between entry/data

entity is an alias name or entry/data token

promote_parameter_name(parameter_name: str, level: str = 'cat')→ Catalog

Find and promote given named parameter, assuming they are all identical

parameter_name:

the key string referring to the parameter

level: cat | data

If the parameter is found in a reader, it can be promoted to the data it depends on.
Parameters in a data description can only be promoted to a catalog global.

rename(old: str, new: str, clobber=True)

Change the alias of a dataset

search(expr)→ Catalog

Make new catalog with a subset of this catalog

The new catalog will have those entries which pass the filter expr, which is an instance
of intake.readers.search.BaseSearch (i.e., has a method like filter(entry) -> bool).

In the special case that expr is just a string, the Text search expression will be used.

to_yaml_file(path: str, **storage_options)

Persist the state of this catalog as a YAML file

storage_options:

kwargs to pass to fsspec for opening the file to write

class intake.readers.entry.DataDescription(datatype: str, kwargs: Optional[dict] = None,
metadata: Optional[dict] = None, user_parameters: Optional[dict] = None)

Defines some data: class and arguments. This may be laoded in a number of ways

A DataDescription normally resides in a Catalog, and can contain templated arguments.

When there are user_parameters, these will also be applied to any reader that depends on
this data.

get_kwargs(user_parameters: Optional[dict[str |
intake.readers.user_parameters.BaseUserParameter]] = None, **kwargs)→ dict[str, Any]

Get set of kwargs for given reader, based on prescription, new args and user
parameters

Here, user_parameters is intended to come from the containing catalog. To provide

values for a user parameter, include it by name in kwargs

class intake.readers.entry.ReaderDescription(reader: str, kwargs: Optional[dict[str, Any]] =

None, user_parameters: Optional[dict[str | intake.readers.user_parameters.BaseUserParameter]] = None,
metadata: Optional[dict] = None, output_instance: Optional[str] = None)

A serialisable description of a reader or pipeline

This class is typically stored inside Catalogs, and can contain templated arguments which
get evaluated at the time that it is accessed from a Catalog.

check_imports()

Are the packages listed in the “imports” key of the metadata available?

extract_parameter(name: str, path=None, value=None, cls=<class

'intake.readers.user_parameters.SimpleUserParameter'>, **kw)

Creates new version of the description

Creates new instance, since the token will in general change

classmethod from_dict(data)

Recreate instance from the results of to_dict()

get_kwargs(user_parameters=None, **kwargs)→ dict[str, Any]

Get set of kwargs for given reader, based on prescription, new args and user
parameters

Here, user_parameters is intended to come from the containing catalog. To provide

values for a user parameter, include it by name in kwargs

to_cat(name=None)

Create a Catalog containing only this entry

intake.readers.readers.recommend(data)

Show which readers claim to support the given data instance or a superclass

The ordering is more specific readers first

intake.readers.readers.reader_from_call(func: str, *args, join_lines=False, **kwargs)→

BaseReader

Attempt to construct a reader instance by finding one that matches the function call

Fails for readers that don’t define a func, probably because it depends on the file type or
needs a dynamic instance to be a method of.

Parameters func: callable | str

If a callable, pass args and kwargs as you would have done to

execute the function. If a string, it should look like "func(arg1,
args2, kwarg1, **kw)" , i.e., a normal python call but as a string. In
the latter case, args and kwargs are ignored

Base Classes

These may be subclassed by developers

intake.readers.datatypes.BaseData ([metadata]) Prototype dataset definition

intake.readers.readers.BaseReader (*args[, ...])

intake.readers.convert.BaseConverter (*args) Converts from one object type to an

intake.readers.namespaces.Namespace (reader) A set of functions as an accessor on

intake.readers.search.SearchBase () Prototype for a single term in a searc

intake.readers.user_parameters.BaseUserParameter (default) The base class allows for any default

class intake.readers.datatypes.BaseData(metadata: Optional[dict[str, Any]] = None)

Prototype dataset definition

auto_pipeline(outtype: str | tuple[str])

Find a pipeline to transform from this to the given output type

contains: set[str] = {}

if using a directory URL, an ls() on that path will contain these things

filepattern: str = ''

regex, file URLs to match

magic: set[bytes | tuple] = {}

binary patterns, usually at the file head; each item identifies this data type

mimetypes: str = ''

regex, MIME pattern to match

property possible_outputs

Map of importable readers to the expected output class of each

property possible_readers

List of reader classes for this type, grouped by importability

structure: set[str] = {}

informational tags for nature of data, e.g., “array”

to_entry()

Create DataDescription version of this, for placing in a Catalog

to_reader(outtype: Optional[str] = None, reader: Optional[str] = None, **kw)

Find an appropriate reader for this data

If neither outtype or reader is passed, the first importable reader will be picked.

Parameters outtype: string to match against the output classes of potential

readers

reader: string to match against the class names of the readers

class intake.readers.readers.BaseReader(*args, metadata: Optional[dict] = None,

output_instance: Optional[str] = None, **kwargs)

property data

The BaseData this reader depends on, if it has one

discover(**kwargs)

Part of the data

The intent is to return a minimal dataset, but for some readers and conditions this may
be up to the whole of the data. Output type is the same as for read().

classmethod doc()

Doc associated with loading function

func: str = 'builtins:NotImplementedError'

function name for loading data

func_doc: str = None

docstring origin if not from func

implements: set[intake.readers.datatypes.BaseData] = {}

datatype(s) this applies to

imports: set[str] = {}

top-level packages required to use this

optional_imports: set[str] = {}

packages that might be required by some options

other_funcs: set[str] = {}

function names to recognise when matching user calls

output_instance: str = None

type the reader produces

read(*args, **kwargs)

Produce data artefact

Any of the arguments encoded in the data instance can be overridden.

Output type is given by the .output_instance attribute

to_cat(name=None)

Create a Catalog containing on this reader

to_entry()

Create an entry version of this, ready to be inserted into a Catalog

to_reader(outtype: Optional[Union[tuple[str], str]] = None, reader: Optional[str] = None, **kw)

Make a different reader for the data used by this reader

class intake.readers.convert.BaseConverter(*args, metadata: Optional[dict] = None,

output_instance: Optional[str] = None, **kwargs)

Converts from one object type to another

Most often, subclasses call a single function on the data, but arbitrary complex transforms
are possible. This is designed to be one step in a Pipeline.

.run() will be called on the output object from the previous stage, subclasses will wither
override that, or just provide a func=.

instances: dict[str, str] = {}

mapping from input types to output types

run(x, *args, **kwargs)

Execute a conversion stage on the output object from another stage

Subclasses may override this

class intake.readers.namespaces.Namespace(reader)

A set of functions as an accessor on a Reader, producing a Pipeline

acts_on: tuple[str] = ()

types that this namespace is associated with

imports: tuple[str] = ()

requires this top-level package

class intake.readers.search.SearchBase

Prototype for a single term in a search expression

The method filter() is meant to be overridden in subclasses.

filter(entry: ReaderDescription)→ bool

Does the given ReaderDescription entry match the query?

class intake.readers.user_parameters.BaseUserParameter(default, description='')

The base class allows for any default without checking/coercing

coerce(value)

Change given type to one that matches this parameter’s intent

default

the value to use without user input

description

what is the function of this parameter

set_default(value)

Change the default, if it validates

to_dict()

Dictionary representation of the instances contents

validate(value)→ bool

Is the given value allowed by this parameter?

Exceptions are treated as False

with_default(value)

A new instance with different default, if it validates

(original object is left unchanged)

Data Classes

intake.readers.datatypes.ASDF (url[, ...]) Advanced Scientific Data Format

intake.readers.datatypes.AVRO (url[, ...]) Structured record passing file format

intake.readers.datatypes.CSV (url[, ...]) Human-readable tabular format, Comma Se

intake.readers.datatypes.Catalog ([metadata]) Datatypes that are groupings of other data

intake.readers.datatypes.CatalogAPI (url[, ...]) An API endpoint capable of describing Intak

intake.readers.datatypes.CatalogFile (url[, ...]) Intake catalog expressed as YAML

intake.readers.datatypes.DICOM (url[, ...]) Imaging data usually from medical scans

intake.readers.datatypes.DeltalakeTable (url) Indexed set of parquet files with servioning

intake.readers.datatypes.Excel (url[, ...]) The well-known spreadsheet app's file form

intake.readers.datatypes.FITS (url[, ...]) Tabular or array data in text/binary format c

intake.readers.datatypes.Feather1 (url[, ...]) Deprecated tabular format from the Arrow

intake.readers.datatypes.Feather2 (url[, ...]) Tabular format based on Arrow IPC

intake.readers.datatypes.FileData (url[, ...]) Datatypes loaded from files, local or remote

intake.readers.datatypes.GDALRasterFile (url) One of the filetpes at https://gdal.org/drive

intake.readers.datatypes.GDALVectorFile (url) One of the filetypes at https://gdal.org/driv

intake.readers.datatypes.GRIB2 (url[, ...]) "Gridded" file format commonly used in me

intake.readers.datatypes.GeoJSON (url[, ...]) Geo data (position and geometries) within J

intake.readers.datatypes.GeoPackage (url[, ...]) Geo data (position and geometries) in a SQ

intake.readers.datatypes.HDF5 (url[, ...]) Hierarchical tree of ND-arrays, widely used

intake.readers.datatypes.Handle (url[, ...]) An identifier registered on handle registry

intake.readers.datatypes.HuggingfaceDataset (name) https://github.com/huggingface/datasets

intake.readers.datatypes.IcebergDataset (url) Indexed set of parquet files with servioning

intake.readers.datatypes.JPEG (url[, ...]) Image format with good compression for th

intake.readers.datatypes.JSONFile (url[, ...]) Nested record format as readable text, very

intake.readers.datatypes.KerasModel (url[, ...]) Keras model parameter set

intake.readers.datatypes.Literal (data[, ...]) A value that can be embedded directly to Y

intake.readers.datatypes.MatlabArray (path[, ...]) A single array in a .mat file

intake.readers.datatypes.MatrixMarket (url[, ...]) Text format for sparse array

intake.readers.datatypes.NetCDF3 (url[, ...]) Collection of ND-arrays with coordinates, s

intake.readers.datatypes.Nifti (url[, ...]) Medical imaging or volume data file

intake.readers.datatypes.NumpyFile (url[, ...]) Simple array format

intake.readers.datatypes.ORC (url[, ...]) Columnar-optimized tabular binary file form

intake.readers.datatypes.OpenDAP (url[, ...]) Earth-science oriented searchable HTTP AP

intake.readers.datatypes.PNG (url[, ...]) Portable Network Graphics, common image

intake.readers.datatypes.Parquet (url[, ...]) Column-optimized binary format

intake.readers.datatypes.PickleFile (url[, ...]) Python pickle, arbitrary serialized object

intake.readers.datatypes.Prometheus (url[, ...]) Monitoring metric query service

intake.readers.datatypes.PythonSourceCode (url) Source code file

intake.readers.datatypes.RawBuffer (url, dtype) A C or FORTRAN N-dimensional array buff

intake.readers.datatypes.SKLearnPickleModel (url) Trained model made by sklearn and saved a

intake.readers.datatypes.SQLQuery (conn, query) Query on a database-like service

intake.readers.datatypes.SQLite (url[, ...]) Database data stored in files

intake.readers.datatypes.STACJSON (url[, ...]) Data assets related to geo data, either as st

intake.readers.datatypes.Service (url[, ...]) Datatypes loaded from some service

intake.readers.datatypes.Shapefile (url[, ...]) Geo data (position and geometries) in a set

intake.readers.datatypes.TFRecord (url[, ...]) Tensorflow record file, ready for machine le

intake.readers.datatypes.THREDDSCatalog (url) Datasets on a THREDDS server

intake.readers.datatypes.TIFF (url[, ...]) Image format commonly used for large data

intake.readers.datatypes.Text (url[, ...]) Any text file

intake.readers.datatypes.TileDB (url[, ...]) Service exposing versioned, chunked and p

intake.readers.datatypes.TiledDataset (url[, ...]) Data access service for data-aware portals

intake.readers.datatypes.TiledService (url[, ...])

intake.readers.datatypes.WAV (url[, ...]) Waveform/sound file

intake.readers.datatypes.XML (url[, ...]) Extensible Markup Language file

intake.readers.datatypes.YAMLFile (url[, ...]) Human-readable JSON/object-like format

intake.readers.datatypes.Zarr (url[, ...]) Cloud optimised, chunked N-dimensional fi

Reader Classes
Includes readers, transformers, converters and output classes.

intake.readers.catalogs.EarthdataCatalogReader (*args) Finds the earthdata datasets that co

intake.readers.catalogs.EarthdataReader (*args) Read particular earthdata dataset by

intake.readers.catalogs.HuggingfaceHubCatalog (*args) Datasets from HuggingfaceHub

intake.readers.catalogs.SKLearnExamplesCatalog (*args) Example datasets from sklearn.datas

intake.readers.catalogs.SQLAlchemyCatalog (*args) Uses SQLAlchemy to get the list of t

intake.readers.catalogs.STACIndex (*args[, ...]) Searches stacindex.org for known pu

intake.readers.catalogs.StacCatalogReader (*args) Create a Catalog from a STAC endpo

intake.readers.catalogs.StacSearch ([metadata]) Get stac objects matching a search s

intake.readers.catalogs.StackBands (*args[, ...]) Reimplementation of "StackBandsSo

intake.readers.catalogs.THREDDSCatalogReader (*args) Read from THREDDS endpoint

intake.readers.catalogs.TensorFlowDatasetsCatalog (*args) Datasets from the TensorFlow public

intake.readers.catalogs.TiledCatalogReader (*args) Creates a catalog of Tiled datasets fr

intake.readers.catalogs.TorchDatasetsCatalog (*args) Standard example PyTorch datasets

intake.readers.convert.ASDFToNumpy (*args[, ...])

intake.readers.convert.BaseConverter (*args) Converts from one object type to an

intake.readers.convert.DaskArrayToTileDB (*args)

intake.readers.convert.DaskDFToPandas (*args)
intake.readers.convert.DaskToRay (*args[, ...])

intake.readers.convert.DeltaQueryToDask (*args)

intake.readers.convert.DeltaQueryToDaskGeopandas (*args)

intake.readers.convert.DicomToNumpy (*args[, ...])

intake.readers.convert.DuckToPandas (*args[, ...])

intake.readers.convert.FITSToNumpy (*args[, ...])

intake.readers.convert.GenericFunc (*args[, ...]) Call given arbitrary function

intake.readers.convert.HuggingfaceToRay (*args)

intake.readers.convert.NibabelToNumpy (*args)

intake.readers.convert.NumpyToTileDB (*args)

intake.readers.convert.PandasToGeopandas (*args)

intake.readers.convert.PandasToMetagraph (*args)

intake.readers.convert.PandasToPolars (*args)

intake.readers.convert.PandasToRay (*args[, ...])

intake.readers.convert.Pipeline (steps, ...) Holds a list of transforms/conversion

intake.readers.convert.PolarsEager (*args[, ...])

intake.readers.convert.PolarsLazy (*args[, ...])

intake.readers.convert.PolarsToPandas (*args)

intake.readers.convert.RayToDask (*args[, ...])

intake.readers.convert.RayToPandas (*args[, ...])

intake.readers.convert.RayToSpark (*args[, ...])

intake.readers.convert.SparkDFToRay (*args[, ...])

intake.readers.convert.TileDBToNumpy (*args)

intake.readers.convert.TileDBToPandas (*args) Implemented only if an attribute was

intake.readers.convert.TiledNodeToCatalog (*args)

intake.readers.convert.TiledSearch (*args[, ...]) See https://blueskyproject.io/tiled/tu

intake.readers.convert.ToHvPlot (*args[, ...])

intake.readers.convert.TorchToRay (*args[, ...])

intake.readers.output.CatalogToJson (*args[, ...])

intake.readers.output.DaskArrayToZarr (*args)

intake.readers.output.GeopandasToFile (*args) creates one of several output file typ

intake.readers.output.MatplotlibToPNG (*args) Take a matplotlib figure and save to

intake.readers.output.NumpyToNumpyFile (*args) Save a single array into a single binar

intake.readers.output.PandasToCSV (*args[, ...])

intake.readers.output.PandasToFeather (*args)

intake.readers.output.PandasToHDF5 (*args[, ...])

intake.readers.output.PandasToParquet (*args)

intake.readers.output.Repr (*args[, ...]) good for including "peek" at data in e

intake.readers.output.ToMatplotlib (*args[, ...])

intake.readers.output.XarrayToNetCDF (*args)

intake.readers.output.XarrayToZarr (*args[, ...])

intake.readers.readers.ASDFReader (*args[, ...])

intake.readers.readers.Awkward (*args[, ...])

intake.readers.readers.AwkwardAVRO (*args[, ...])

intake.readers.readers.AwkwardJSON (*args[, ...])

intake.readers.readers.AwkwardParquet (*args)

intake.readers.readers.Condition (*args[, ...])

intake.readers.readers.CupyNumpyReader (*args)

intake.readers.readers.CupyTextReader (*args)

intake.readers.readers.DaskAwkwardJSON (*args)

intake.readers.readers.DaskAwkwardParquet (*args)

intake.readers.readers.DaskCSV (*args[, ...])

intake.readers.readers.DaskDF (*args[, ...])

intake.readers.readers.DaskDeltaLake (*args)

intake.readers.readers.DaskHDF (*args[, ...])

intake.readers.readers.DaskJSON (*args[, ...])

intake.readers.readers.DaskNPYStack (*args[, ...]) Requires a directory with .npy files a

intake.readers.readers.DaskParquet (*args[, ...])

intake.readers.readers.DaskSQL (*args[, ...])

intake.readers.readers.DaskZarr (*args[, ...])

intake.readers.readers.DeltaReader (*args[, ...])

intake.readers.readers.DicomReader (*args[, ...])

intake.readers.readers.DuckCSV (*args[, ...])

intake.readers.readers.DuckDB (*args[, ...])

intake.readers.readers.DuckJSON (*args[, ...])

intake.readers.readers.DuckParquet (*args[, ...])

intake.readers.readers.DuckSQL (*args[, ...])

intake.readers.readers.FITSReader (*args[, ...])

intake.readers.readers.FileByteReader (*args) The contents of file(s) as bytes

intake.readers.readers.FileExistsReader (*args)

intake.readers.readers.FileReader (*args[, ...]) Convenience superclass for readers o

intake.readers.readers.GeoPandasReader (*args)

intake.readers.readers.GeoPandasTabular (*args)

intake.readers.readers.HandleToUrlReader (*args) Dereference handle (hdl:) identifiers

intake.readers.readers.HuggingfaceReader (*args)

intake.readers.readers.KerasAudio (*args[, ...])

intake.readers.readers.KerasImageReader (*args)

intake.readers.readers.KerasModelReader (*args)

intake.readers.readers.KerasText (*args[, ...])

intake.readers.readers.NibabelNiftiReader (*args)

intake.readers.readers.NumpyReader (*args[, ...])

intake.readers.readers.NumpyText (*args[, ...])

intake.readers.readers.NumpyZarr (*args[, ...])

intake.readers.readers.Pandas (*args[, ...])

intake.readers.readers.PandasCSV (*args[, ...])

intake.readers.readers.PandasExcel (*args[, ...])

intake.readers.readers.PandasFeather (*args)

intake.readers.readers.PandasHDF5 (*args[, ...])

intake.readers.readers.PandasORC (*args[, ...])

intake.readers.readers.PandasParquet (*args)

intake.readers.readers.PandasSQLAlchemy (*args)

intake.readers.readers.Polars (*args[, ...])

intake.readers.readers.PolarsAvro (*args[, ...])

intake.readers.readers.PolarsCSV (*args[, ...])

intake.readers.readers.PolarsDeltaLake (*args)

intake.readers.readers.PolarsExcel (*args[, ...])

intake.readers.readers.PolarsFeather (*args)

intake.readers.readers.PolarsIceberg (*args)

intake.readers.readers.PolarsJSON (*args[, ...])

intake.readers.readers.PolarsParquet (*args)

intake.readers.readers.PrometheusMetricReader (*args)

intake.readers.readers.PythonModule (*args[, ...])

intake.readers.readers.RasterIOXarrayReader (*args)

intake.readers.readers.Ray (*args[, ...])

intake.readers.readers.RayBinary (*args[, ...])

intake.readers.readers.RayCSV (*args[, ...])

intake.readers.readers.RayDeltaLake (*args[, ...])

intake.readers.readers.RayJSON (*args[, ...])

intake.readers.readers.RayParquet (*args[, ...])

intake.readers.readers.RayText (*args[, ...])

intake.readers.readers.Retry (*args[, ...]) Retry (part of) a pipeline until it retur

intake.readers.readers.SKImageReader (*args)

intake.readers.readers.SKLearnExampleReader (*args)

intake.readers.readers.SKLearnModelReader (*args)

intake.readers.readers.ScipyMatlabReader (*args)

intake.readers.readers.ScipyMatrixMarketReader (*args)

intake.readers.readers.SparkCSV (*args[, ...])

intake.readers.readers.SparkDataFrame (*args)

intake.readers.readers.SparkDeltaLake (*args)

intake.readers.readers.SparkParquet (*args[, ...])

intake.readers.readers.SparkText (*args[, ...])

intake.readers.readers.TFORC (*args[, ...])

intake.readers.readers.TFPublicDataset (*args)

intake.readers.readers.TFRecordReader (*args)
intake.readers.readers.TFSQL (*args[, ...])

intake.readers.readers.TFTextreader (*args[, ...])

intake.readers.readers.TileDBDaskReader (*args)

intake.readers.readers.TileDBReader (*args[, ...])

intake.readers.readers.TiledClient (*args[, ...])

intake.readers.readers.TiledNode (*args[, ...])

intake.readers.readers.TorchDataset (*args[, ...])

intake.readers.readers.XArrayDatasetReader (*args)

intake.readers.readers.YAMLCatalogReader (*args)

intake.readers.transform.DataFrameColumns (*args)

intake.readers.transform.GetItem (*args[, ...]) Equivalent of x[item]

intake.readers.transform.Method (*args[, ...]) Call named method on object

intake.readers.transform.PysparkColumns (*args)

intake.readers.transform.THREDDSCatToMergedDataset (*args)

intake.readers.transform.XarraySel (*args[, ...])

Thi Cuối Kỳ 2 (2020-2021) Pttk Hướng Đối Tượng: Bài làm của sinh viên
No ratings yet
Thi Cuối Kỳ 2 (2020-2021) Pttk Hướng Đối Tượng: Bài làm của sinh viên
17 pages
CovidData - Ipynb - Colaboratory
No ratings yet
CovidData - Ipynb - Colaboratory
4 pages
Python Cheat Sheet - The Basics Coursera
No ratings yet
Python Cheat Sheet - The Basics Coursera
2 pages
How Do We Pass Field Value From One Version To Next Version in Browser
No ratings yet
How Do We Pass Field Value From One Version To Next Version in Browser
3 pages
Compiler Design
No ratings yet
Compiler Design
94 pages
Chapter 3 Regular Expressions Notes
100% (1)
Chapter 3 Regular Expressions Notes
36 pages
621 C++ Mcqs CA Test 1
No ratings yet
621 C++ Mcqs CA Test 1
23 pages
Directives DD
No ratings yet
Directives DD
19 pages
Didntknow PDF
No ratings yet
Didntknow PDF
103 pages
Numpy Pad
No ratings yet
Numpy Pad
1,318 pages
Reticulate
No ratings yet
Reticulate
35 pages
The Python Standard Library
No ratings yet
The Python Standard Library
8 pages
MATLAB Object Oriented Programming PDF
No ratings yet
MATLAB Object Oriented Programming PDF
656 pages
Traveling Salesman Problem
No ratings yet
Traveling Salesman Problem
10 pages
Data Structures and Algorithm Analysis-Prelims
100% (1)
Data Structures and Algorithm Analysis-Prelims
4 pages
Library Book Other Modules
No ratings yet
Library Book Other Modules
33 pages
Operating System - Quick Guide
No ratings yet
Operating System - Quick Guide
64 pages
Pep20 by Example PDF
No ratings yet
Pep20 by Example PDF
24 pages
Pickle Serialize
No ratings yet
Pickle Serialize
6 pages
Pysftp
No ratings yet
Pysftp
39 pages
Python Idioms
100% (1)
Python Idioms
72 pages
DSA Sheet
No ratings yet
DSA Sheet
9 pages
Python Cheat Sheet - The Basics Edx
No ratings yet
Python Cheat Sheet - The Basics Edx
2 pages
Python Cheat Sheet - The Basics CC
No ratings yet
Python Cheat Sheet - The Basics CC
2 pages
What Are The Common Built-In Data Types in Python?: Class Name Description
No ratings yet
What Are The Common Built-In Data Types in Python?: Class Name Description
2 pages
Python Cheat Set
No ratings yet
Python Cheat Set
1 page
Ai MCQ
No ratings yet
Ai MCQ
5 pages
Jupiter Notebook Tricks
100% (1)
Jupiter Notebook Tricks
9 pages
The 30 Most Useful Python Libraries For Data Engineering - by ODSC - Open Data Science - Medium
No ratings yet
The 30 Most Useful Python Libraries For Data Engineering - by ODSC - Open Data Science - Medium
23 pages
Sensus 50yrs
No ratings yet
Sensus 50yrs
34 pages
What Is Python?: in Data Science
No ratings yet
What Is Python?: in Data Science
17 pages
L-8 - Artificial - Intelligence Tuk Publication Notes Class 7
No ratings yet
L-8 - Artificial - Intelligence Tuk Publication Notes Class 7
2 pages
Mod4 Test Study Guide Additional Practice 2019
No ratings yet
Mod4 Test Study Guide Additional Practice 2019
19 pages
PY0101 - Python For Data Science, AI, & Development Cheat Sheet
No ratings yet
PY0101 - Python For Data Science, AI, & Development Cheat Sheet
2 pages
Eex4465 - 2020-Cat 2
No ratings yet
Eex4465 - 2020-Cat 2
2 pages
Python Coursera 1
No ratings yet
Python Coursera 1
56 pages
List Tuple
No ratings yet
List Tuple
9 pages
Programming Concepts
No ratings yet
Programming Concepts
5 pages
Python Cheat Sheet - The Basics Coursera
No ratings yet
Python Cheat Sheet - The Basics Coursera
2 pages
Holly Message 2
No ratings yet
Holly Message 2
2 pages
Python Ultimate Guide
100% (1)
Python Ultimate Guide
10 pages
11 Automated Planning
No ratings yet
11 Automated Planning
25 pages
Pandas Documentation PDF
No ratings yet
Pandas Documentation PDF
86 pages
Os Module
No ratings yet
Os Module
28 pages
I M.SC Cs Alg & Oops Lab 6.11.23
No ratings yet
I M.SC Cs Alg & Oops Lab 6.11.23
45 pages
5.4 WS H. On Everything Factoring
No ratings yet
5.4 WS H. On Everything Factoring
2 pages
Artificial Intelligence: A.S.Sawtha Safreena - Iii ECE P.Shenbagavalli - Iii Ece
No ratings yet
Artificial Intelligence: A.S.Sawtha Safreena - Iii ECE P.Shenbagavalli - Iii Ece
12 pages
Visual Basic - Net L2
No ratings yet
Visual Basic - Net L2
5 pages
Project - Food Ordering System
No ratings yet
Project - Food Ordering System
46 pages
5-A - Search Algorithm
No ratings yet
5-A - Search Algorithm
4 pages
EE2003 COAL Assignment 1
No ratings yet
EE2003 COAL Assignment 1
4 pages
Api
No ratings yet
Api
3 pages
Enhancing Caesar's Cipher - Antonio Santos
No ratings yet
Enhancing Caesar's Cipher - Antonio Santos
8 pages
New Text Document
No ratings yet
New Text Document
7 pages
Anchal Rentaly Project Services
No ratings yet
Anchal Rentaly Project Services
43 pages
Unit-VI-Introduction-to-Libraries - And-Modules (NEP)
No ratings yet
Unit-VI-Introduction-to-Libraries - And-Modules (NEP)
25 pages
Python Tkinter Package
No ratings yet
Python Tkinter Package
383 pages
2024 Basic Elements of Assembly Language 2
No ratings yet
2024 Basic Elements of Assembly Language 2
13 pages
The Java Virtual Machine Specification Java SE 8 Edition Tim Lindholm & Frank Yellin & Gilad Bracha PDF Download
No ratings yet
The Java Virtual Machine Specification Java SE 8 Edition Tim Lindholm & Frank Yellin & Gilad Bracha PDF Download
61 pages
Python Record Manual
No ratings yet
Python Record Manual
18 pages
Python Basics - New - Session 11 - Pages 1-141
No ratings yet
Python Basics - New - Session 11 - Pages 1-141
141 pages
Building A GRPC Micro-Service in Go - A Comprehensive Guide - by Mou Sam Dahal - Medium
No ratings yet
Building A GRPC Micro-Service in Go - A Comprehensive Guide - by Mou Sam Dahal - Medium
26 pages
DS ML Python
No ratings yet
DS ML Python
4 pages
100 Recursion Problems by Level
No ratings yet
100 Recursion Problems by Level
4 pages
Ass1 DSBDA Writeup
No ratings yet
Ass1 DSBDA Writeup
8 pages
Intro To Python and IDE
No ratings yet
Intro To Python and IDE
2 pages
DataINgestion Code Doubt
No ratings yet
DataINgestion Code Doubt
6 pages
Código-Fonte para Inkex - Base
No ratings yet
Código-Fonte para Inkex - Base
11 pages
Extracted
No ratings yet
Extracted
8 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Python Data Structures Explained: A Practical Guide with Examples
From Everand
Python Data Structures Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Mastering Objectoriented Python
From Everand
Mastering Objectoriented Python
Steven F. Lott
5/5 (2)
Learning JavaScript Data Structures and Algorithms - Second Edition
From Everand
Learning JavaScript Data Structures and Algorithms - Second Edition
Loiane Groner
No ratings yet
Ian Talks JS A-Z: WebDevAtoZ, #1
From Everand
Ian Talks JS A-Z: WebDevAtoZ, #1
Ian Eress
No ratings yet
CSS Grid Layout
From Everand
CSS Grid Layout
Abdelfattah Ragab
No ratings yet
DBMS Lab Manual
From Everand
DBMS Lab Manual
Jitendra Patel
1.5/5 (3)
Ian Talks Python A-Z
From Everand
Ian Talks Python A-Z
Ian Eress
No ratings yet
NgRx SignalStore: An effortless solution for state management
From Everand
NgRx SignalStore: An effortless solution for state management
Abdelfattah Ragab
No ratings yet
Java: Advanced Guide to Programming Code with Java: Java Computer Programming, #4
From Everand
Java: Advanced Guide to Programming Code with Java: Java Computer Programming, #4
Charlie Masterson
No ratings yet
jQuery 1.4 Reference Guide
From Everand
jQuery 1.4 Reference Guide
Jonathan Chaffer
3.5/5 (2)
Visualizing Data Structures
From Everand
Visualizing Data Structures
Rhonda Hoenigman
No ratings yet
Mastering Elasticsearch 5.x - Third Edition
From Everand
Mastering Elasticsearch 5.x - Third Edition
Bharvi Dixit
3/5 (1)
Software Design Simplified
From Everand
Software Design Simplified
Liviu Catalin Dorobantu
No ratings yet
Elasticsearch Server: Second Edition
From Everand
Elasticsearch Server: Second Edition
Rafał Kuć
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
ElasticSearch Server
From Everand
ElasticSearch Server
Rafal Kuc
No ratings yet
Functional Python Programming
From Everand
Functional Python Programming
Steven Lott
No ratings yet
Java Programming Tutorial With Screen Shots & Many Code Example
From Everand
Java Programming Tutorial With Screen Shots & Many Code Example
Desmond Ohwofosirai
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet

API Reference - Intake Documentation

Uploaded by

API Reference - Intake Documentation

Uploaded by

 / API Reference

intake.config.Config ([filename]) Intake's dict-like config system

intake.readers.entry.Catalog ([entries, ...]) A collection of data and reader descriptions.

intake.readers.entry.DataDescription (datatype) Defines some data: class and arguments.

intake.readers.entry.ReaderDescription (reader) A serialisable description of a reader or pipeline

intake.readers.readers.recommend (data) Show which readers claim to support the given

intake.readers.readers.reader_from_call (...) Attempt to construct a reader instance by findi

class intake.config.Config(filename=None, **kwargs)

Intake’s dict-like config system

Instance intake.conf is globally used throughout the package

Update global config from YAML file

If fn is None, looks in global config directory, which is either defined by the

Analyse environment variables and update conf accordingly

Set conf values back to defaults

Save current configuration to file as YAML

Uses self.filename for target location

Change config values within a context or for the session

This can be deeply nested to set only leaf values

See also: intake.readers.utils.nested_keys_to_dict

Value resets after context ends

>>> with intake.conf.set(mybval=5):

Set for whole session

Set only a single leaf value within a nested dict

intake.readers.datatypes.recommend(url: Optional[str] = None, mime: Optional[str] = None,

Parameters url: str

MIME type, usually “x/y” form

head: bytes | bool | None

Attempt to delve into URL to analyse constituent files. This can

storage_options: dict | None

If passing a URL which might be a remote file, storage_options can

ignore: set | None

Don’t include these in the output

Returns set of matching datatype classes.

intake.readers.convert.auto_pipeline(url: str | intake.readers.datatypes.BaseData, outtype:

Create pipeline from given URL to desired output type

outtype: pattern to match to possible output types

class intake.readers.entry.Catalog(entries: Optional[Union[Iterable[ReaderDescription],

A collection of data and reader descriptions.

add_entry(entry, name=None, clobber=True)

Add entry/reader (and its requirements) in-place, with optional alias

Remove named entity (data/entry) from catalog

Also removed data/entries references by the given one, and

extract_parameter(item: str, name: str, path: ~typing.Optional[str] = None, value:

Descend into data & reader descriptions to create a user_parameter

There are two ways to fund and replace values by a template:

if path is given, the kwargs will be walked to this location e.g.,

Matched values will be replaced by a template string like "{name}" , and a

Assemble catalog from dict representation

classmethod from_entries(data: dict, metadata=None)

Assemble catalog from a dict of entries

static from_yaml_file(path: str, **kwargs)

Load YAML representation into a new Catalog instance

Get the objects by reference

Use this method if you want to change the catalog in-place

give_name(tok: str, name: str, clobber=True)

Give an alias to a dataset

a key in the .entries dict

move_parameter(from_entity: str, to_entity: str, parameter_name: str)→ Catalog

Move user-parameter from between entry/data

entity is an alias name or entry/data token

promote_parameter_name(parameter_name: str, level: str = 'cat')→ Catalog

the key string referring to the parameter

level: cat | data

rename(old: str, new: str, clobber=True)

Change the alias of a dataset

Make new catalog with a subset of this catalog

to_yaml_file(path: str, **storage_options)

Persist the state of this catalog as a YAML file

kwargs to pass to fsspec for opening the file to write

A DataDescription normally resides in a Catalog, and can contain templated arguments.

Here, user_parameters is intended to come from the containing catalog. To provide

class intake.readers.entry.ReaderDescription(reader: str, kwargs: Optional[dict[str, Any]] =

A serialisable description of a reader or pipeline

extract_parameter(name: str, path=None, value=None, cls=<class

Creates new version of the description

Creates new instance, since the token will in general change