hue
Open source SQL Query Assistant service for Databases/Warehouseslivy
Livy is an open source REST interface for interacting with Apache Spark from anywhereflume
WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. The system is centrally managed and allows for intelligent dynamic management. It uses a simple extensible data model that allows for online analytic applications.impyla
Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)cm_api
Cloudera Manager API Clientcdh-twitter-example
Example application for analyzing Twitter data using CDH - Flume, Oozie, Hivecloudera-playbook
Cloudera deployment automation with Ansiblecm_ext
Cloudera Manager Extensibility Tools and Documentation.flink-tutorials
impala-tpcds-kit
TPC-DS Kit for Impalakitten
The fast and fun way to write YARN applications.cloudera-scripts-for-log4j
Scripts for addressing log4j zero day security issuekudu-examples
Example code for Kudupython-ngrams
clusterdock
hs2client
C++ native client for Impala and Hive, with Python / pandas bindingsimpala-udf-samples
Sample UDF and UDAs for Impala.director-scripts
Cloudera Director sample codecm_csds
A collection of Custom Service Descriptorsbigtop
Bigtop is a project for the development of packaging and tests of the Apache Hadoop ecosystem. The primary goal of Bigtop is to build a community around the packaging and interoperability testing of Hadoop-related projects. This includes testing at various levels (packaging, platform, runtime, upgrade, etc...) developed by a community with a focus on the system as a whole, rather than individual projects.CML_AMP_LLM_Chatbot_Augmented_with_Enterprise_Data
cdh-package
ades
An analysis of adverse drug event data using Hadoop, R, and Gephikafka-examples
Kafka Examples repository.mapreduce-tutorial
llama
Llama - Low Latency Application MAsterseismichadoop
System for performing seismic data processing on a Hadoop cluster.CML_AMP_Anomaly_Detection
Apply modern, deep learning techniques for anomaly detection to identify network intrusions.mahout
parquet-examples
Example programs and scripts for accessing parquet filesdist_test
Impala
Real-time Query for Hadoop; mirror of Apache Impalanative-toolchain
emailarchive
Hadoop for archiving emaildbt-impala
A dbt adapter for Apache Impala & Cloudera Data Platformcdsw-training
Example Python and R code for Cloudera Data Science Workbench trainingnavigator-sdk
Navigator SDKdbt-hive
The dbt-hive adapter allows you to use dbt with Apache Hive and Cloudera Data Platform.director-sdk
Cloudera Director API clientsthrift_sasl
Thrift SASL module that implements TSaslClientTransporttutorial-assets
Assets used in Cloudera Tutorialscommunity-ml-runtimes
squeasel
python-sasl
Python wrapper for Cyrus SASLcod-examples
cod-examplessqoop2
CML_AMP_Explainability_LIME_SHAP
Learn how to explain ML models using LIME and SHAP.CML_AMP_Few-Shot_Text_Classification
Perform topic classification on news articles in several limited-labeled data regimes.earthquake
cmlextensions
Added functionality to the cml python packageml-runtimes
CML_AMP_Image_Analysis
Build a semantic search application with deep learning models.cloudera-airflow-plugins
CML_AMP_Continuous_Model_Monitoring
Demonstration of how to perform continuous model monitoring on CML using Model Metrics and Evidently.ai dashboardsstrata-tutorial-2016-nyc
cdp-sdk-java
Cloudera CDP SDK for Javadirector-aws-plugin
Cloudera Director - Amazon Web Services integrationlogredactor
CML_AMP_Churn_Prediction
Build an scikit-learn model to predict churn using customer telco data.phoenix
phoenixdbt-impala-example
A demo project for dbt-impala adapter for dbtpoisson_sampling
cml-training
Example Python and R code for Cloudera Machine Learning (CML) trainingApplied-ML-Prototypes
director-google-plugin
Cloudera Director - Google Cloud Platform integrationcdpcli
CDP command line interface (CLI)cdp-dev-docs
cdp-dev-docsCML_AMP_Canceled_Flight_Prediction
Perform analytics on a large airline dataset with Spark and build an XGBoost model to predict flight cancellations.CML_AMP_Structural_Time_Series
Applying a structural time series approach to California hourly electricity demand data.director-spi
Cloudera Director Service Provider InterfaceCML_AMP_Question_Answering
Explore an emerging NLP capability with WikiQA, an automated question answering system built on top of Wikipedia.CML_AMP_Intelligent-QA-Chatbot-with-NiFi-Pinecone-and-Llama2
The prototype deploys an Application in CML using a Llama2 model from Hugging Face to answer questions augmented with knowledge extracted from the website. This prototype introduces Pinecone as a database for storing vectors for semantic search.dbt-hive-example
A sample project for dbt-hive adapter with Cloudera Data Platformterraform-provider-cdp
terraform-provider-cdpcmlutils
crcutil
datafu
flink-basic-auth-handler
flink-basic-auth-handlerpartner-engineering
Cloudera Partner Engineering Toolscybersec
cdpcurl
Curl like tool with CDP request signing.CML_AMP_MLFlow_Tracking
Experiment tracking with MLFlow.hcatalog-examples
Sample code for reading and writing tables with hcatalogCML_AMP_Dask_on_CML
CML_AMP_Dask_on_CMLCML_AMP_Streamlit_on_CML
Demonstration of how to use Streamlit as a CML Application.CML_AMP_Video_Classification
Demonstration of how to perform video classification using pre-trained TensorFlow models.opdb-docker
github-jira-gateway
A Grails app to serve as a gateway between an internal GitHub Enterprise server and an external JIRA serverblog-eclipse
CML_llm-hol
CML_AMP_SpaCy_Entity_Extraction
A Jupyter notebook demonstrating entity extraction on headlines with SpaCy.flink-kerberos-auth-handler
flink-kerberos-auth-handlerCML_AMP_Object_Detection_Inference
Interact with a blog-style Streamlit application to visually unpack the inference workflow of a modern, single-stage object detector.dbt-spark-cde-example
CML_AMP_Intelligent_Writing_Assistance
CML_AMP_Intelligent_Writing_Assistancedbt-spark-livy-example
dbt-spark-livy-exampleCML_AMP_LLM_Fine_Tuning_Studio
CML_AMP_APIv2
Demonstration of how to use the CML API to interact with CML.director-azure-plugin
Cloudera Director - Microsoft Azure Integrationobservability
Cloudera Observability related artifacts including Grafana charts and Alert definitionsLove Open Source and this site? Check out how you can help us