Grow your team on GitHub
GitHub is home to over 50 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Sign upRepositories
-
beam
Apache Beam is a unified programming model for Batch and Streaming
-
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
-
logging-log4j-site
Apache log4j web site
-
incubator-tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
-
spark
Apache Spark - A unified analytics engine for large-scale data processing
-
-
trafficcontrol
Apache Traffic Control is an Open Source implementation of a Content Delivery Network
-
trafficserver
Apache Traffic Serverβ’ is a fast, scalable and extensible HTTP/1.1 and HTTP/2 compliant caching proxy server.
-
-
incubator-gobblin
Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems. Gobblin features integrations with Apache Hadoop, Apache Kafka, Salesforce, S3, MySQL, Google etc.
-
incubator-datasketches-website
Website for DataSketches.
-
incubator-superset
Apache Superset is a Data Visualization and Data Exploration Platform
-
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communicationβ¦
-
lucene-solr
Apache Lucene and Solr open-source search software
-
-
-
incubator-nuttx
Apache NuttX is a mature, real-time embedded operating system (RTOS)
-
incubator-annotator
Apache Annotator provides annotation enabling code for browsers, servers, and humans.