• Stars
    star
    672
  • Rank 67,148 (Top 2 %)
  • Language
    HTML
  • License
    Apache License 2.0
  • Created about 5 years ago
  • Updated 4 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

OpenDS4All project, hosted by LF AI & Data

GitHub

Binder

Success Story - One-of-a-Kind Workshop to University of Liverpool Students

Bootcamp-style OpenDS4All workshop where students could enhance their theoretical knowledge with hands-on, industry-focused experience.

Description

OpenDS4All is a project created to accelerate the creation of data science curricula at academic institutions. While a great deal of online material is available for data science, including online courses, we recognize that the best way for many students to learn (and for many institutions to deliver) content is through a combination of lectures, recitation or flipped classroom activities, and hands-on assignments.

OpenDS4All attempts to fill this important niche. Our goal is to provide recommendations, slide sets, sample Jupyter notebooks, and other materials for creating, customizing, and delivering data science and data engineering education.

The project hosts educational modules that may be used as building blocks for a data science curriculum.

Note: The link opends4all-resources takes you to the opends4all curriculum building blocks organized by category.

Note: If you adopt all or some of the content, please add your program's details to the ADOPTERS.csv file.

Audience (Instructor and Student)

The initial modules were designed to target a broad, cross-university audience at both the undergraduate and graduate levels. Modules contain instructor notes and comments intended to aid in the delivery of the material; the expectation is that instructors will be generally fluent in basic database and machine learning concepts.

The perspective of the materials largely comes from computer science, with an emphasis on data wrangling and engineering as well as machine learning and validation. However, prior versions of the content have been used to teach students ranging from freshmen to PhD students, across a wide range of fields. The emphasis is largely on core concepts and algorithms with grounding in today's technologies and best practices.

Students are expected to come in with two major prerequisites:

  • Comfort and familiarity with programming in Python (writing small functions, importing and calling library functions, using Python data structures).
  • Familiarity with probability theory and very basic statistical notions.

To some extent, students with a limited background can follow along with this material, but they will likely need to supplement extensively.

How to use

The following topology shows how content is currently organized around categories. This is a living/dynamic taxonomy that is updated as new content is added to the project. taxonomy Each category contains modules and each module consists of one or more of the following components:

  • instructor notes (Instructor_Notes.md) and guide to files
  • a set of PowerPoint slides (with presenter notes) ending in .pptx
  • companion Jupyter notebooks, for students to see the lecture materials "in context" and to be able to experiment
  • sample quiz materials (where applicable)
  • sample homework assignments (where applicable)
  • additional documentation (where applicable)

Note: The PowerPoint slides are not directly viewable on GitHub. After you clicked on the link to a set of PowePoint slides you need to select the Download button to download and view the slide deck. Two viewable extracts from the slide decks can be seen by clicking on the links below:

There are many ways to interact with this repository:

  • browse the repository in search of content ( use the 'Find file' search functionality )
  • download content (PowerPoint slides, Jupyter notebooks, etc.)
  • contribute content ( become a contributor to the project )
  • become involved in the day-to-day management of the project ( become a committer )
  • provide overall direction and leadership to the project ( become a Technical Steering Committee member )

The project's governance principles clarifies the different roles and describes the processes for becoming a contributor, a committer or a TSC member.

Contributing

Anyone can contribute to this repository - learn more at CONTRIBUTING.md. Follow the step-by-step instructions COMMUNITY-GUIDE.md to submit a module for possible inclusion into to repository.

Governance

OpenDS4All is a project hosted by LF AI & DATA. This project has established its own processes for managing day-to-day processes in the project at GOVERNANCE.md.

Reporting Issues

To report a problem, you can open an issue. If the issue is sensitive in nature or a security related issue, please do not report in the issue tracker but instead email [email protected].

Contact Us

If you want to contact us, please open an issue and one of the members of the TSC will respond to your request. If you do not feel comfortable opening an Issue, email [email protected].

Learn More

If you are interested in collaborating on the project, please open an issue and one of the members of the TSC will respond to your request. If you do not feel comfortable opening an Issue, email [email protected].


License: CC BY 4.0, Copyright Contributors to the LF AI & DATA OpenDS4All project.

More Repositories

1

egeria

Egeria core
Java
784
star
2

data-governance

Egeria's Guidance on Governance as well as large media files such as presentations and movies
101
star
3

specs

ODPi specifications, developed by ODPi Runtime and ODPi Operations projects. Currently in Emeritus status
35
star
4

egeria-ui

User interface instance using main Egeria functionalities.
TypeScript
35
star
5

egeria-connector-ibm-information-server

IBM Information Server connectors for Egeria: repository proxy connector for IGC, data engine proxy connector for DataStage.
Java
26
star
6

egeria-docs

Documentation repository for the Egeria project.
HTML
22
star
7

egeria-connector-hadoop-ecosystem

Hadoop ecosystem connectors for Egeria: repository proxy connector for Apache Atlas.
Java
20
star
8

egeria-connector-sas-viya

Egeria connector for SAS Information Catalog
Java
16
star
9

egeria-ui-components

Encapsulated reactjs components with business logic inside.
TypeScript
16
star
10

egeria-js-commons

Common module for storing static data such as key value objects, SVGs, icon mappings, API urls.
TypeScript
15
star
11

egeria-connector-xtdb

Pluggable repository for Egeria, using XTDB (formerly "Crux") as the back-end to natively support historical metadata.
Java
15
star
12

egeria-database-connectors

Connectors for exchanging metadata
Java
15
star
13

egeria-charts

Helm chart repository
Shell
13
star
14

egeria-react-ui

A multi-tenant, role based Egeria React UI for Open Metadata, that has eco-system and solutions capabilities.
JavaScript
13
star
15

egeria-samples

various samples that can be useful either for learning or as initial starting points for working with Egeria
Java
12
star
16

egeria-api-mocks

Support for mocking Egeria API for UI dev test
JavaScript
12
star
17

happi-graph

Generic graph rendering solution.
TypeScript
11
star
18

egeria-samples-api

A collection of samples illustrating the different APIs of Egeria.
Java
11
star
19

egeria-ui-core

Core library that includes generic reactjs components used across the user interface.
TypeScript
11
star
20

egeria-dev-projects

Fun projects for developers to help them learn about the Egeria technology
Java
10
star
21

egeria-connector-hivemetastore

Egeria repository connector for Hive metastore. Initially targeted at spark metadata
Java
10
star
22

egeria-k8s-operator

Egeria Kubernetes operator
Go
10
star
23

egeria-connector-integration-topic-strimzi

Strimzi Egeria integration connector for Kafka Topics
Java
10
star
24

egeria-coco-labs

Egeria Jupiter notebooks used in the Open Metadata Labs
Jupyter Notebook
9
star
25

egeria-palisade

Java
8
star
26

egeria-python

Python library for Egeria
Python
7
star
27

egeria-connector-integration-event-schema

Provides an integration connector that extract event schemata from a schema registry (including Confluent schema registry). The connector will be a polling connector and will look in Egeria for new topics that if present in the confluent registry, the associate schema elements will be brought into Egeria.
Java
7
star
28

egeria-connector-repository-file-sample

Provides a repository proxy sample that uses polling. The target is a file folder
Java
7
star
29

self-certification-reports

Commit your ODPi spec test runs to this repo to let others know if your platform or application is ODPi compliant
Roff
6
star
30

egeria-template-newrepo

Template for new Egeria repository
Java
6
star
31

tsc

ODPi TSC Materials, Guidelines, and Meeting notes
6
star
32

egeria-connector-omrs-caching

Provides an OMRS repository proxy connector that has an embedded repository that can be used to cache entities and relationships.
Java
5
star
33

security-guide

3
star
34

OBAIC

Python
3
star
35

ci-management

Shell
3
star
36

egeria-connector-integration-lineage-event-driven-sample

Sample showing how to bring lineage into Egeria
Java
3
star
37

egeria-listener-hms

Provides a Hive Metastore listener to feed HMS changes into an Egeria ecosystem
Java
2
star
38

bi-ai

ODPi BI and AI Project
2
star
39

egeria-cloudnative

Supports egeria cloud-native workgroup activities *currently prototyping only*
Java
2
star
40

egeria-connector-jdbc

JDBC connectors for Egeria (Integration and Resource)
Java
1
star
41

egeria-workspaces

Docker compose starter configurations for work, samples, and demos
Jupyter Notebook
1
star
42

egeria-dojo-developer

Provides the starting project structure for the developer dojo. The participant creates a clone of it and works from there - they do not merge their code back into the repository. There is also a branch which includes the sample answers that they can refer to if they get stuck.
1
star