baidu/Youtube-8M

Stars
114
Rank 308,031 (Top 7 %)
Language
Python
License
Apache License 2.0
Created over 7 years ago
Updated almost 2 years ago

baidu/Youtube-8M

baidu

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

PaddlePaddle models for Youtube-8M Video Understanding Challenge

Temporal Modeling Approaches for Large-scale Youtube-8M Video Understanding

By Fu Li, Chuang Gan, Xiao Liu, Yunlong Bian, Xiang Long, Yandong Li, Zhichao Li, Jie Zhou, Shilei Wen (Baidu IDL & Tsinghua University)

Table of Contents

Introduction
Usage
Results
Citation

Introduction

This repository contains the data providers and model configurations of three temporal modeling approaches (fast-forward sequence model, two stream sequence model and temporal residual neural networks) described in the paper "Temporal Modeling Approaches for Large-scale Youtube-8M Video Understanding" (xxx). These model configurations are those used in the Google Cloud & YouTube-8M Video Understanding Challenge (https://www.kaggle.com/c/youtube8m/leaderboard).

Usage

Dependencies of PaddlePaddle 0.9.0 (https://github.com/PaddlePaddle/Paddle) and Python 2.7.

Model Training:

cfg=your_config_file
paddle_trainer \
    --config=$cfg \
    --save_dir=./models \
    --trainer_count=4 \
    --log_period=20 \
    --num_passes=100 \
    --use_gpu=true \
    --test_period=0 \
    --show_parameter_stats_period=100

Model Testing:

cfg=your_config_file
paddle_trainer \
    --config=$cfg \
    --use_gpu=true \
    --gpu_id=0 \
    --trainer_count=1 \
    --job=test \
    --init_model_path=pass-00000 \
    --predict_output_dir=output \
    --log_period=20

Results

Model	GAP@20
Temporal CNN	0.80889
Two-stream LSTM	0.82172
Two-stream GRU	0.82366
Fast-forward LSTM	0.81885
Fast-forward GRU	0.81970
Fast-forward LSTM (depth7)	0.82750

Citation

amis

前端低代码框架，通过 JSON 配置就能生成各种页面。

uid-generator

UniqueID generator

san

A fast, portable, flexible JavaScript component framework

lac

百度NLP：分词，词性标注，命名实体识别，词重要性

braft

An industrial-grade C++ implementation of RAFT consensus algorithm based on brpc, widely used inside Baidu to build highly-available distributed systems.

dperf

dperf is a DPDK based 100Gbps network performance and load testing software.

bfs

The Baidu File System.

openrasp

🔥Open source RASP solution

Familia

A Toolkit for Industrial Topic Modeling

AnyQ

FAQ-based Question Answering System

sofa-pbrpc

A light-weight RPC implement of google protobuf RPC framework.

Senta

Baidu's open-source Sentiment Analysis System.

tera

An Internet-Scale Database.

bfe-book

In-depth Understanding of BFE《深入理解BFE》（Book for BFE, a CNCF open source project. both in English and in Chinese）

BaikalDB

BaikalDB, A Distributed HTAP Database.

bigflow

Baidu Bigflow is an interface that allows for writing distributed computing programs and provides lots of simple, flexible, powerful APIs. Using Bigflow, you can easily handle data of any scale. Bigflow processes 4P+ data inside Baidu and runs about 10k jobs every day.

DuReader

Baseline Systems of DuReader Dataset

DDParser

百度开源的依存句法分析系统

starlight

Java implementation for Baidu RPC, multi-protocol & high performance RPC.

CUP

CUP, common useful python-lib. (Currently, Most popular python lib in baidu). Python 开发底层库, 涵盖util、service(threadpool/generator/executor/cache等等)、logging、monitoring、增强型配置等等库支持

ICE-BA

NoahV

An efficient front-end application framework based on vue.js

EasyFaaS

EasyFaaS是一个依赖轻、适配性强、资源占用少、无状态且高性能的函数计算服务引擎

Curve

An Integrated Experimental Platform for time series data anomaly detection.

Jprotobuf-rpc-socket

Protobuf RPC是一种基于TCP协议的二进制RPC通信协议的Java实现

bifromq

A MQTT broker implementation adopting serverless architecture

fast_rgf

Multi-core implementation of Regularized Greedy Forest

babylon

High-Performance C++ Fundamental Library

Dialogue

Elasticsearch

Baidu Elasticsearch

brcc

BRCC（better remote config center）是一个分布式配置中心，用于统一管理应用服务的配置信息，避免各类资源散落在各个项目中，简化资源配置的维护成本。作为一种轻量级的解决方案，部署简单，同时支持多环境、多版本、多角色的资源管理，可以在不改变应用源码的情况下无缝切换和实时生效配置信息。

Cafe

A powerful test framework for Android

mix-img

A fast mix image javascript tool libary

puck

Puck is a high-performance ANN search engine

Jupyter Notebook

unit-dmkit

galaxy

Galaxy is a cluster management system.

information-extraction

knowledge-driven-dialogue

baseline system of knowledge driven dialogue competition

CarbonGraph

A Swift dependency injection / lookup framework for iOS

unit-uskit

BIPlatform

dlock

An effective and reliable Distributed Lock

ins

iNexus, coordinate large scale services

boteye

titan-dex

m-git

MGit 是一款基于 Git 的多仓库管理工具，可以安全的、高效的管理多个 Git 仓库；适合于在多个仓库中进行关联开发的项目，实现批量的版本管理功能，提高 Git 操作的效率，避免逐个执行 Git 命令带来的误操作风险。

Rubik

An Android platform component management tool chain, based on Kotlin language.

common

go-lib

titan-hotfix

wx2

小程序互转工具

iot-sdk-c

device sdk for baidu IoT Core service, in c. Including MQTT client

ar-sdk

DuMix AR SDK for Developer

broc

ITEST

Web service interface test framework

ote-stack

OTE-Stack is an edge computing platform for 5G and AI

GPT

redis

Baidu Ksarch Redis - a production solution of redis cluster

san-devtools

Browser developer tools extension for debugging San.

terminator

Service Virtualization

QCompute

QCompute is a Python-based quantum software development kit (SDK). It provides a full-stack programming experience for advanced users via hybrid quantum programming language features and a high-performance simulator.

spring-cloud-baidu

shuttle

A fast computing framework based on Galaxy

iot-edge-sdk-for-iot-parser

baidu-iot-samples

san-store

Application States Management for San

ARK

Development framework of intelligent operation

san-update

Object immutable update utility for san solution

logcover

轻量级异常日志测试覆盖率度量工具

palo

A fast MPP database for all modern analytics on big data. Powered by Apache Doris(Incubating)

speech-samples

百度语音示例

ntripcaster

san-router

Official Router for San

Quanlse

Jupyter Notebook

san-ssr

San SSR framework and utils

dm-kit-php

boteye_sensor

ipipe-agent

OASP

OASP (Online App Status Protocol)

san-composition

duedge-recipes

DuEdge百度边缘网络计算样例代码

paddle-on-k8s-operator

Kubernetes operator for managing the lifecycle of PaddlePaddle job.

baiducloud-sdk-go

Go SDK for Baidu Cloud

san-website

baiduads-sdk

Baidu Ads API SDK

du1906_esp

DUHOME AIOT platform based on du1906 and esp32

highflip

HIGHFLIP: An easy way to bridge different federal learning platforms

smartapp-openapi-java

百度智能小程序服务端 OpenAPI SDK for java，是基于小程序服务端 OpenAPI 封装的一套让开发者方便使用的 SDK，它可以帮开发者减少理解和使用 OpenAPI 的成本，减少开发者直接调用服务端接口不当而引起的错误，避免在开发中走弯路。

san-factory

ttm

cluster-api-provider-baiducloud

Kubernetes cluster-api for Baidu Cloud

minions

Baidu 100G Chasiss Switch hardware spec

signet

sgxray

SGXRay: a bounded verifier for Intel SGX enclaves

grafana-tsdb-datasource

iotcore-sdk-java

Java SDK for baidu IoT Core service

bce-fpga-dev-kit

iot

for all code about Internet of Things

smartapp-openapi-go

百度智能小程序服务端 OpenAPI SDK for go，是基于小程序服务端 OpenAPI 封装的一套让开发者方便使用的 SDK，它可以帮开发者减少理解和使用 OpenAPI 的成本，减少开发者直接调用服务端接口不当而引起的错误，避免在开发中走弯路。

duedge-cli

DuEdge Command Line