• Stars
    star
    245
  • Rank 165,304 (Top 4 %)
  • Language
    Rust
  • License
    MIT License
  • Created over 5 years ago
  • Updated 10 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Small crate to infer file and MIME type by checking the magic number signature

infer

Build Status crates version documentation

Small crate to infer file and MIME type by checking the magic number signature.

Adaptation of filetype Go package ported to Rust.

Does not require magic file database (i.e. /etc/magic).

Features

  • Supports a wide range of file types
  • Provides file extension and MIME type
  • File discovery by extension or MIME type
  • File discovery by class (image, video, audio...)
  • Supports custom new types and matchers

Installation

This crate works with Cargo and is on crates.io. Add it to your Cargo.toml like so:

[dependencies]
infer = "0.3"

If you are not using the custom matcher or the file type from file path functionality you can make this crate even lighter by importing it with no default features, like so:

[dependencies]
infer = { version = "0.3", default-features = false }

no_std and no_alloc support

This crate supports no_std and no_alloc environments. std support is enabled by default, but you can disable it by importing the crate with no default features, making it depend only on the Rust core Library.

alloc has to be enabled to be able to use custom file matchers.

std has to be enabled to be able to get the file type from a file given the file path.

Examples

Most operations can be done via top level functions, but they are also available through the Infer struct, which must be used when dealing custom matchers.

Get the type of a buffer

let buf = [0xFF, 0xD8, 0xFF, 0xAA];
let kind = infer::get(&buf).expect("file type is known");

assert_eq!(kind.mime_type(), "image/jpeg");
assert_eq!(kind.extension(), "jpg");

Check file type by path

let kind = infer::get_from_path("testdata/sample.jpg")
    .expect("file read successfully")
    .expect("file type is known");

assert_eq!(kind.mime_type(), "image/jpeg");
assert_eq!(kind.extension(), "jpg");

Check for specific type

let buf = [0xFF, 0xD8, 0xFF, 0xAA];
assert!(infer::image::is_jpeg(&buf));

Check for specific type class

let buf = [0xFF, 0xD8, 0xFF, 0xAA];
assert!(infer::is_image(&buf));

Adds a custom file type matcher

fn custom_matcher(buf: &[u8]) -> bool {
    return buf.len() >= 3 && buf[0] == 0x10 && buf[1] == 0x11 && buf[2] == 0x12;
}

let mut info = infer::Infer::new();
info.add("custom/foo", "foo", custom_matcher);

let buf = [0x10, 0x11, 0x12, 0x13];
let kind = info.get(&buf).expect("file type is known");

assert_eq!(kind.mime_type(), "custom/foo");
assert_eq!(kind.extension(), "foo");

Supported types

Image

  • jpg - image/jpeg
  • png - image/png
  • gif - image/gif
  • webp - image/webp
  • cr2 - image/x-canon-cr2
  • tif - image/tiff
  • bmp - image/bmp
  • heif - image/heif
  • avif - image/avif
  • jxr - image/vnd.ms-photo
  • psd - image/vnd.adobe.photoshop
  • ico - image/vnd.microsoft.icon
  • ora - image/openraster

Video

  • mp4 - video/mp4
  • m4v - video/x-m4v
  • mkv - video/x-matroska
  • webm - video/webm
  • mov - video/quicktime
  • avi - video/x-msvideo
  • wmv - video/x-ms-wmv
  • mpg - video/mpeg
  • flv - video/x-flv

Audio

  • mid - audio/midi
  • mp3 - audio/mpeg
  • m4a - audio/m4a
  • ogg - audio/ogg
  • flac - audio/x-flac
  • wav - audio/x-wav
  • amr - audio/amr
  • aac - audio/aac
  • aiff - audio/x-aiff
  • dsf - audio/x-dsf
  • ape - audio/x-ape

Archive

  • epub - application/epub+zip
  • zip - application/zip
  • tar - application/x-tar
  • rar - application/vnd.rar
  • gz - application/gzip
  • bz2 - application/x-bzip2
  • 7z - application/x-7z-compressed
  • xz - application/x-xz
  • pdf - application/pdf
  • swf - application/x-shockwave-flash
  • rtf - application/rtf
  • eot - application/octet-stream
  • ps - application/postscript
  • sqlite - application/vnd.sqlite3
  • nes - application/x-nintendo-nes-rom
  • crx - application/x-google-chrome-extension
  • cab - application/vnd.ms-cab-compressed
  • deb - application/vnd.debian.binary-package
  • ar - application/x-unix-archive
  • Z - application/x-compress
  • lz - application/x-lzip
  • rpm - application/x-rpm
  • dcm - application/dicom
  • zst - application/zstd
  • msi - application/x-ole-storage
  • cpio - application/x-cpio

Book

  • epub - application/epub+zip
  • mobi - application/x-mobipocket-ebook

Documents

  • doc - application/msword
  • docx - application/vnd.openxmlformats-officedocument.wordprocessingml.document
  • xls - application/vnd.ms-excel
  • xlsx - application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
  • ppt - application/vnd.ms-powerpoint
  • pptx - application/vnd.openxmlformats-officedocument.presentationml.presentation
  • odt - application/vnd.oasis.opendocument.text
  • ods - application/vnd.oasis.opendocument.spreadsheet
  • odp - application/vnd.oasis.opendocument.presentation

Font

  • woff - application/font-woff
  • woff2 - application/font-woff
  • ttf - application/font-sfnt
  • otf - application/font-sfnt

Application

  • wasm - application/wasm
  • exe - application/vnd.microsoft.portable-executable
  • dll - application/vnd.microsoft.portable-executable
  • elf - application/x-executable
  • bc - application/llvm
  • mach - application/x-mach-binary
  • class - application/java
  • dex - application/vnd.android.dex
  • dey - application/vnd.android.dey
  • der - application/x-x509-ca-cert
  • obj - application/x-executable

Known Issues

  • exe and dll have the same magic number so it's not possible to tell which one just based on the binary data. exe is returned for all.

License

MIT

More Repositories

1

ghz

Simple gRPC benchmarking and load testing tool
Go
2,843
star
2

grpc-caller

An improved Node.js gRPC client
JavaScript
183
star
3

json-schema-test-data-generator

Generate sample test data based on JSON schema
JavaScript
26
star
4

gcall

Simple gRPC command line interface
JavaScript
12
star
5

grpc-inspect

gRPC protocol buffer inspection utility
JavaScript
9
star
6

grpc-create-error

Utility to crete errors for gRPC responses
JavaScript
8
star
7

nbrite

Simple Node.js helper module for Eventbrite API V3
JavaScript
5
star
8

grpc-create-metadata

Simple Node.js helper utility for creating gRPC metadata
JavaScript
5
star
9

hri

Go module for generating human readable string IDs
Go
4
star
10

promisify-call

Promisify a function call so users can call a function with a callback or get a promise
JavaScript
4
star
11

grpc-error

Utility Error class suitable for gRPC error responses
JavaScript
4
star
12

plaster

Simple Mongoose-inspired schema based Javascript object modelling
JavaScript
3
star
13

prop-search

Simple property searches on your JavaScript objects.
JavaScript
3
star
14

ghz-web

CSS
3
star
15

mdpage

Simple documentation tool
Rust
2
star
16

json-schema-api-test-data-generator

Generate sample test data based on JSON Hyper-Schema
JavaScript
2
star
17

wasm-play

Playing with WebAssembly
WebAssembly
1
star
18

sample-grpc-gateway

Sample gRPC gateway application
Go
1
star
19

sample-golang-multi

Multi service example
Go
1
star
20

sample-rust-spin

Sample DigitalOcean App Platform app in Rust using Spin WebAssembly framework
Dockerfile
1
star
21

sample-node-grpc

Sample Node.js gRPC DigitalOcean App Platform application
JavaScript
1
star
22

fsto-2018-grpc

Full Stack Toronto 2018 gRPC presentation and code
JavaScript
1
star
23

promisify-decorator

Promisify decorator
JavaScript
1
star
24

protocol-buffers-benchmarks

Simple benchmarks of Protocol Buffers and JSON
JavaScript
1
star