• Stars
    star
    187
  • Rank 202,266 (Top 5 %)
  • Language
    C
  • License
    Other
  • Created about 8 years ago
  • Updated 7 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

SCTK, the NIST Scoring Toolkit

File: README.md

Date: October 28, 2021

SCTK Version: 2.4.12

This directory contains SCTK, the NIST Scoring Toolkit. SCTK contains the source code, documentation and example data for the following programs:

  • sclite V2.10 - "Score Lite",
  • sc_stats V1.3 - sclite's Statistical Significance Comparison tool
  • rover V0.1 - Recognition Output Voting Error Reduction
  • asclite V1.11 - Multidimensional alignment replacement for sclite

SCTK Basic Installation

To install compile and install sctk, from the main directory type the commands:

% make config
% make all
% make check
% make install
% make doc

The make config command executes the script config.sh in the src/sclite directory and initialized the makefiles in the rest of the src directories.

You can specify installation directory by changing the PREFIX variable in the src/makefile file. By default, all executables will be installed in bin of this directory. Be sure to add the installation directory to your PATH variable in order to access the programs.

The make all command will compile all toolkit executables and libraries. Compilation of rfilter1 may fail if your version of libC is very new because it now includes the strcmp* family of functions. You can use the libC versions, and avoid compilation complaints, by following the instructions in src/rfilter1/makefile.

The make check will perform self tests on the toolkit. Please report any problems to Jon Fiscus at NIST with any bug reports or comments at the email address [email protected]. Please include the version number and any other relevant information such as OS, compiler, etc.

The make install command will copy the executables to $PREFIX (see make config above.)

UNIX-style manual pages and HTML formatted manuals are provided in the doc directory.

Bundled Software

This version of SCTK includes the CMU-Cambridge Statistical Language Modeling Toolkit v2. Included in STCK is the complete source code tree and documentation. The toolkit has been included so that SCLITE can use a language model to compute conditional word weights for word-weighted scoring.

Optional Software/Notes:

sclite: To use the interactive installation for sclite, edit the file src/sclite/config.sh at line 36, set the user_interaction=1 it will ask you during the configuration (make config) if you want to use GNU diff instead of sclite and CMU-Cambridge SLM for sclite. Those choices are not recommended and not used anymore.

sclite has the ability to use GNU's diff utility as an alternate string alignment method. The installer has the choice whether or not to include this ability in the compiled version.

GNU diff: If the use of GNU's diff is desired, you will need GNU's diff utility from their diffutils distribution (version 2.7). If you cannot locate this distribution, a copy is available at ftp://jaguar.ncsl.nist.gov/pub/diffutils-2.7.tar. When you install diff, make sure the executable is installed as gdiff, gnudiff or diffgnu (config.sh expects it to be called one of these).

Special Note to SGI owners: When installing GNU's diff make sure to call it gnudiff or diffgnu. If you use gdiff the executable name may be overshadowed by a pre-existing executable.

utf_filt: utf_filt is using SGML external software and modules like SGMLS perl module and nsgmls program which have been updated and continued: OpenSP (http://sourceforge.net/project/showfiles.php?group_id=2115&package_id=2869). Version v1.5.2 of OpenSP have been tested and working with the current version of utf_filt.

Special Note to Cygwin users: it has been reported that compilation of rfilter1 can fail in some case, please read the OPTIONS part of the rfilter1/makefile and adapt accordingly before retrying compilation.

64 bits Compilation: With big alignments, sctk needs to be compiled in 64 bits. By default, the C/C++ software are compiled in 32 bits with the options (-Os) but can be compiled in 64 bits. To do so, -m64 is added to the CFLAGS variable in src/asclite/core/makefile, src/asclite/test/makefile, src/rfilter1/makefile and src/sclite/makefile.

Example of CFLAGS for OSX 10.4+: -fast -m64 -arch x86_64 -arch ppc64

Known to compile on:

The distribution was initially written in ANSI C on a SUN Workstation. To date, the code has been successfully compiled and tested on the following platforms:

Systems Version
SUN SunOS 4.1.4 using gcc
SUN Solaris SunOS 5.4 using cc
SUN Solaris SunOS 5.5 using gcc 1.0 - 1.2
SGI IRIX 5.3 using cc
SGI IRIX 4.0.3H using gcc
HP 735 using gcc
HP using "cc" 1.0 - 1.2
DEC Alpha running DEC OSF/1 V2.0 "cc -std1"
DEC Alpha running DEC OSF/1 V2.0 "gcc" 1.1
IBM Powerstation running AIX V3.2 using cc 1.0
IBM Powerstation running AIX V3.2 using gcc [*] 1.0
Linux Version 2.0.33 1.0 - 1.2
Cygwin 1.3 - 2.0B2
Linux Version 2.2.17 1.3 - 2.0B2
Linux Version 2.4.20-20-9 [**] 1.3 - 2.1.7
Linux Version 2.6.19.fc5 2.2.0 - 2.2.4
Linux Version 2.6.9-5.EL (gcc 3.4.6-3) 2.2.0 - 2.2.4
Darwin Version 7.4.0 1.3
Darwin Version 8.5.0 (gcc v4.0.1 build 5367) 2.1.2 - 2.3-rc1
Darwin Version 8.11.0 (gcc v4.0.1 build 5367) 2.3-rc2 - 2.4.0
Darwin Version 9.6.0 (gcc v4.0.1 build 5490) 2.3.6 - 2.4.0
RedHat Fedora 9 (gcc v4.3.0-8) 2.3.9 - 2.4.0
Darwin Version 10.8.0 (gcc version 4.2.1 (Apple Inc. build 5666) 2.4.2 - 2.4.8

[*] compiled with warning messages, but successful self test

[**] compiled and ran, but test6, which uses GNU's diff for alignment fails. The use of DIFF for alignment is rarely used and the failure can be safely ignored.

Docker

A Dockerfile is included to build and use SCTK without compiling the codebase for your particular platform.

To build:

docker build -t sctk .

Hypothesis Test

To run sclite on a reference file and hypothesis file, you need to map a volume to your host.

Linux or a Mac:

docker run -it -v $PWD:/var/sctk sctk sclite -i wsj -r ref.txt -h hyp.txt

Windows:

docker run -it -v /myhostpath:/var/sctk sctk sclite -i wsj -r ref.txt -h hyp.txt

sc_stats

To run more advanced features of the SDK, shell into the container with a volume mapped to your local files:

docker run -it -v $PWD:/var/sctk sctk sh
/var/sctk #

Then run sclite piped to sc_stats:

sclite -i wsj -r ref.txt -h hyp1.txt -h hyp2.txt -o sgml stdout | sc_stats -p -t mapsswe -u
sc_stats: 1.3
Beginning Multi-System comparisons and reports
    Performing the Matched Pair Sentence Segment (Word Error) Test
    Printing Unified Statistical Test Reports
        Output written to 'Ensemble.stats.unified'

Successful Completion

Contact

If you have questions, please address them to Jonathan Fiscus at [email protected]

License

See LICENSE.md for details or http://www.nist.gov/open/license.cfm

More Repositories

1

macos_security

macOS Security Compliance Project
YAML
1,603
star
2

800-63-3

Home to public development of NIST Special Publication 800-63-3: Digital Authentication Guidelines
CSS
699
star
3

OSCAL

Open Security Controls Assessment Language (OSCAL)
XSLT
572
star
4

fipy

FiPy is a Finite Volume PDE solver written in Python
Python
430
star
5

jarvis

JARVIS-Tools: an open-source software package for data-driven atomistic materials design. Publications: https://scholar.google.com/citations?user=3w6ej94AAAAJ
Python
279
star
6

jsip

JSIP: Java SIP specification Reference Implementation (moved from java.net)
Java
277
star
7

frvt

Repository for the Face Recognition Vendor Test (FRVT)
C++
259
star
8

trec_eval

Evaluation software used in the Text Retrieval Conference
C
224
star
9

oscal-content

NIST SP 800-53 content and other OSCAL content examples
Shell
218
star
10

alignn

Atomistic Line Graph Neural Network https://scholar.google.com/citations?user=9Q-tNnwAAAAJ&hl=en
Python
192
star
11

SP800-90B_EntropyAssessment

The SP800-90B_EntropyAssessment C++package implements the min-entropy assessment methods included in Special Publication 800-90B.
C++
189
star
12

PrivacyEngCollabSpace

Privacy Engineering Collaboration Space
Python
186
star
13

REFPROP-wrappers

Wrappers around NIST REFPROP for languages such as Python, MATLAB, etc.
Mathematica
160
star
14

ACVP

Industry Working Group on Automated Cryptographic Algorithm Validation
HTML
151
star
15

mobile-threat-catalogue

NIST/NCCoE Mobile Threat Catalogue
HTML
141
star
16

trojai-literature

129
star
17

NFIQ2

Optical live-scan and ink fingerprint image quality assessment tool
C++
127
star
18

MIST

Microscopy Image Stitching Tool
Java
120
star
19

applesec

Draft SP 800-179r1 macOS 10.12 Security project files: draft publication, security settings spreadsheet and Bash script implementation of settings.
Shell
116
star
20

ndn-dpdk

NDN-DPDK: High-Speed Named Data Networking Forwarder
Go
114
star
21

SFA

The NIST STEP File Analyzer and Viewer (SFA) generates a spreadsheet and a visualization from an ISO 10303 Part 21 STEP file.
Tcl
109
star
22

ARIAC

Repository for ARIAC (Agile Robotics for Industrial Automation Competition), consisting of kit building and assembly in a simulated warehouse
C++
104
star
23

NEMO

NEMO is a laboratory logistics web application. Use it to schedule reservations, control tool access, track maintenance issues, and more.
Python
98
star
24

jsfive

A pure javascript HDF5 reader
JavaScript
92
star
25

h5wasm

A WebAssembly HDF5 reader/writer library
C++
81
star
26

pyMCR

pyMCR: Multivariate Curve Resolution for Python
Python
79
star
27

Metrology

Metrology for software; software for metrology
JavaScript
65
star
28

psc-ns3

Public Safety Communication modeling tools based on ns-3
C++
62
star
29

STP2X3D

Translator from STEP format to X3D format
C++
62
star
30

combinatorial-testing-tools

Tools for combinatorial testing developed by the NIST ACTS project
Java
61
star
31

chemnlp

ChemNLP: A Natural Language Processing based Library for Materials Chemistry Text Data
Python
59
star
32

jarvis_leaderboard

Explore State-of-the-Art Materials Design Methods: https://www.nature.com/articles/s41524-024-01259-w
Jupyter Notebook
52
star
33

COSMOSAC

A Benchmark Implementation of COSMO-SAC
HTML
48
star
34

pfhub

The CHiMaD Phase Field Community Website
HTML
48
star
35

Lightweight-Cryptography-Benchmarking

C
48
star
36

SimulatedRadarWaveformGenerator

A software tool that generates simulated radar signals and creates RF datasets for developing and testing machine/deep learning detection algorithms.
MATLAB
47
star
37

REFPROP-cmake

Small repo with CMake build system for building REFPROP shared library
CMake
46
star
38

iheos-toolkit2

XDS Toolkit
Java
44
star
39

OpenSeadragonFiltering

OpenSeadragon filtering plugin
JavaScript
44
star
40

dioptra

Test Software for the Characterization of AI Technologies
Python
43
star
41

pmml_pymcBN

Jupyter Notebook
42
star
42

teqp

A highly efficient, flexible, and accurate implementation of thermodynamic EOS powered by automatic differentiation
C++
42
star
43

ActEV_Scorer

Scoring software for the TRECVID Activities in Extended Video (ActEV) evaluation
Python
41
star
44

HTGS

The Hybrid Task Graph Scheduler API
C++
40
star
45

sctools

Tools for security content automation, baseline tailoring, and overlay development.
HTML
39
star
46

hiperc

High Performance Computing Strategies for Boundary Value Problems
HTML
39
star
47

ocr-pipeline

Convert a corpus of PDF to clean text files on a distributed architecture
Python
38
star
48

OpenSeadragonScalebar

OpenSeadragon scalebar plugin
JavaScript
37
star
49

mosaic

A modular single-molecule analysis interface
Python
37
star
50

oscal-cli

A simple open source command line tool to support common operations over OSCAL content.
Java
37
star
51

ACVP-Server

A repository tracking releases of NIST's ACVP server. See www.github.com/usnistgov/ACVP for the protocol.
C#
36
star
52

vulntology

Development of the NIST vulnerability data ontology (Vulntology).
JavaScript
36
star
53

pyPRISM

A framework for conducting polymer reference interaction site model (PRISM) calculations
Python
35
star
54

DT4SM

Digital Thread for Smart Manufacturing
C#
34
star
55

OOF3D

Object Oriented for Finite Elements 3D version code.
Python
34
star
56

hugo-uswds

Implementation of the The United States Web Design System (USWDS) 2.0 using the Hugo open-source static site generator
SCSS
33
star
57

rcslib

NIST Real-Time Control Systems Library including Posemath, NML communications & Java Plotter
Java
33
star
58

PrivacyFrmwkResources

This repository contains resources to support organizations’ use of the Privacy Framework. Resources include crosswalks, Profiles, guidelines, and tools. NIST encourages new contributions and feedback on these resources as part of the ongoing collaborative effort to improve implementation of the Privacy Framework.
33
star
59

dataplot

Source code and auxiliary files for dataplot.
Fortran
32
star
60

oscal-tools

Tools for the OSCAL project
XSLT
32
star
61

pyramidio

Image pyramid reader and writer
Java
31
star
62

Voting

The NIST Voting Program repository
31
star
63

800-63-4

CSS
31
star
64

metaschema

Documentation for and implementations of the metaschema modeling language
Shell
31
star
65

MDCS

CSS
31
star
66

SDNist

SDNist: Benchmark data and evaluation tools for data synthesizers.
HTML
30
star
67

phasefield-precipitate-aging

Phase field model for precipitate aging in ternary analogues to Ni-based superalloys
Cuda
30
star
68

pySCATMECH

pySCATMECH is a Python interface to SCATMECH: Polarized Light Scattering C++ Class Library
C++
30
star
69

AGA8

Files associated with the AGA8 standard
Rust
30
star
70

feasst

The Free Energy and Advanced Sampling Simulation Toolkit (FEASST) is a free, open-source, modular program to conduct molecular and particle-based simulations with flat-histogram Monte Carlo methods.
C++
29
star
71

NetSimulyzer-ns3-module

A flexible 3D visualizer for displaying, debugging, presenting, and understanding ns-3 scenarios.
C++
28
star
72

liboscal-java

A Java library to support processing OSCAL content
Java
28
star
73

OFDM-GAN

Python
28
star
74

lantern

Interpretable genotype-phenotype landscape modeling
Python
28
star
75

ChebTools

C++ tools for working with Chebyshev expansion interpolants
C++
27
star
76

MediScore

Scoring tools for Media Forensics Evaluations
HTML
27
star
77

hedgehog

C++
27
star
78

NetSimulyzer

A flexible 3D visualizer for displaying, debugging, presenting, and understanding ns-3 scenarios.
C++
27
star
79

atomvision

Deep learning framework for atomistic image data
Python
26
star
80

REFPROP-issues

A repository solely used for reporting issues with NIST REFPROP
26
star
81

SCATMECH

SCATMECH: Polarized light scattering C++ class library
C++
26
star
82

youbot

Robotic platform for industrial control systems cybersecurity research. We use the research-grade Youbot as the robotics platform for our research. The ROS framework is used for inter-process communication, and Python is the language used for application development.
Python
26
star
83

ThreeBodyTB.jl

Accurate and fast tight-binding calculations, using pre-fit coefficients and three-body terms.
Julia
25
star
84

Circuits

Circuits for functions of interest to cryptography
C++
25
star
85

OOF2

Object Oriented for Finite Elements 2D version.
C++
25
star
86

F4DE

Framework for Detection Evaluation (F4DE) : set of evaluation tools for detection evaluations and for specific NIST-coordinated evaluations
Perl
24
star
87

optbayesexpt

Optimal Bayesian Experiment Design
Python
24
star
88

blockmatrix

This project is developing code to implement features and extensions to the NIST Cybersecurity Whitepaper, "A Data Structure for Integrity Protection with Erasure Capability". The block matrix data structure may have utility for incorporation into applications requiring integrity protection that currently use permissioned blockchains. This capability could for example be useful in meeting privacy requirements such as the European Union General Data Protection Regulation (GDPR), which requires that organizations make it possible to delete all information related to a particular individual, at that person's request.
Java
24
star
89

libbiomeval

Software components for biometric technology evaluations.
C++
24
star
90

ElectionResultsReporting

Common data format specification for election results reporting data
23
star
91

oscal-deep-diff

Open Security Controls Assessment Language (OSCAL) Deep Differencing Tool
TypeScript
22
star
92

IFA

The NIST IFC File Analyzer (IFA) generates a spreadsheet from an IFC file.
Tcl
22
star
93

ns3-oran

A module that can be used to model and simulate O-RAN-like behavior in ns-3.
C++
22
star
94

MUD-PD

A tool for characterizing the network behavior of IoT Devices. The primary intended use is to assist in the generation of allowlist files formatted according to the Manufacturer Usage Description specification.
Python
21
star
95

texture

Python scripts for analysis of crystallographic texture
Jupyter Notebook
21
star
96

trojai-example

Example TrojAI Submission
21
star
97

blossom-case-study

A case study for ACSAC 2022 utilizing OSCAL with a custom GitHub action to automate assessments.
HTML
21
star
98

BiometricEvaluation

NIST Image Group Biometric Repositories
20
star
99

WIPP

Web Image Processing Pipeline (WIPP)
Shell
20
star
100

CastVoteRecords

Common data format specification for cast vote records
19
star