Stars
Data product portal created by Dataminded
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
This is a code repository for the corresponding video tutorial. Using React, Node.js, Express & MongoDB you'll learn how to build a Full Stack MERN Application - from start to finish. The App is ca…
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Apache Superset is a Data Visualization and Data Exploration Platform
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
configuration library for JVM languages using HOCON files
re_data - fix data issues before your users & CEO would discover them 😊
Docker file for a minimal effort OpenStreetMap tile server
DebOps - Your Debian-based data center in a box
Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to clo…
mcci-catena / arduino-lmic
Forked from things-nyc/arduino-lmicLoraWAN-MAC-in-C library, adapted to run under the Arduino environment
Hadoop-Unit is a project which allow testing projects which need hadoop ecosysteme like kafka, solr, hdfs, hive, hbase, ...
hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE
This repository contains makescript and instruction on how to setup local hdfs+spark+hive setup.
earthquakesan / docker-hadoop-spark-workbench
Forked from big-data-europe/docker-hadoop-spark-workbench[EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook and HDFS FileBrowser.
AWX provides a web-based user interface, REST API, and task engine built on top of Ansible. It is one of the upstream projects for Red Hat Ansible Automation Platform.