• Stars
    star
    208
  • Rank 189,015 (Top 4 %)
  • Language
    C
  • License
    Other
  • Created over 8 years ago
  • Updated 12 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

SCTK, the NIST Scoring Toolkit

File: README.md

Date: October 28, 2021

SCTK Version: 2.4.12

This directory contains SCTK, the NIST Scoring Toolkit. SCTK contains the source code, documentation and example data for the following programs:

  • sclite V2.10 - "Score Lite",
  • sc_stats V1.3 - sclite's Statistical Significance Comparison tool
  • rover V0.1 - Recognition Output Voting Error Reduction
  • asclite V1.11 - Multidimensional alignment replacement for sclite

SCTK Basic Installation

To install compile and install sctk, from the main directory type the commands:

% make config
% make all
% make check
% make install
% make doc

The make config command executes the script config.sh in the src/sclite directory and initialized the makefiles in the rest of the src directories.

You can specify installation directory by changing the PREFIX variable in the src/makefile file. By default, all executables will be installed in bin of this directory. Be sure to add the installation directory to your PATH variable in order to access the programs.

The make all command will compile all toolkit executables and libraries. Compilation of rfilter1 may fail if your version of libC is very new because it now includes the strcmp* family of functions. You can use the libC versions, and avoid compilation complaints, by following the instructions in src/rfilter1/makefile.

The make check will perform self tests on the toolkit. Please report any problems to Jon Fiscus at NIST with any bug reports or comments at the email address [email protected]. Please include the version number and any other relevant information such as OS, compiler, etc.

The make install command will copy the executables to $PREFIX (see make config above.)

UNIX-style manual pages and HTML formatted manuals are provided in the doc directory.

Bundled Software

This version of SCTK includes the CMU-Cambridge Statistical Language Modeling Toolkit v2. Included in STCK is the complete source code tree and documentation. The toolkit has been included so that SCLITE can use a language model to compute conditional word weights for word-weighted scoring.

Optional Software/Notes:

sclite: To use the interactive installation for sclite, edit the file src/sclite/config.sh at line 36, set the user_interaction=1 it will ask you during the configuration (make config) if you want to use GNU diff instead of sclite and CMU-Cambridge SLM for sclite. Those choices are not recommended and not used anymore.

sclite has the ability to use GNU's diff utility as an alternate string alignment method. The installer has the choice whether or not to include this ability in the compiled version.

GNU diff: If the use of GNU's diff is desired, you will need GNU's diff utility from their diffutils distribution (version 2.7). If you cannot locate this distribution, a copy is available at ftp://jaguar.ncsl.nist.gov/pub/diffutils-2.7.tar. When you install diff, make sure the executable is installed as gdiff, gnudiff or diffgnu (config.sh expects it to be called one of these).

Special Note to SGI owners: When installing GNU's diff make sure to call it gnudiff or diffgnu. If you use gdiff the executable name may be overshadowed by a pre-existing executable.

utf_filt: utf_filt is using SGML external software and modules like SGMLS perl module and nsgmls program which have been updated and continued: OpenSP (http://sourceforge.net/project/showfiles.php?group_id=2115&package_id=2869). Version v1.5.2 of OpenSP have been tested and working with the current version of utf_filt.

Special Note to Cygwin users: it has been reported that compilation of rfilter1 can fail in some case, please read the OPTIONS part of the rfilter1/makefile and adapt accordingly before retrying compilation.

64 bits Compilation: With big alignments, sctk needs to be compiled in 64 bits. By default, the C/C++ software are compiled in 32 bits with the options (-Os) but can be compiled in 64 bits. To do so, -m64 is added to the CFLAGS variable in src/asclite/core/makefile, src/asclite/test/makefile, src/rfilter1/makefile and src/sclite/makefile.

Example of CFLAGS for OSX 10.4+: -fast -m64 -arch x86_64 -arch ppc64

Known to compile on:

The distribution was initially written in ANSI C on a SUN Workstation. To date, the code has been successfully compiled and tested on the following platforms:

Systems Version
SUN SunOS 4.1.4 using gcc
SUN Solaris SunOS 5.4 using cc
SUN Solaris SunOS 5.5 using gcc 1.0 - 1.2
SGI IRIX 5.3 using cc
SGI IRIX 4.0.3H using gcc
HP 735 using gcc
HP using "cc" 1.0 - 1.2
DEC Alpha running DEC OSF/1 V2.0 "cc -std1"
DEC Alpha running DEC OSF/1 V2.0 "gcc" 1.1
IBM Powerstation running AIX V3.2 using cc 1.0
IBM Powerstation running AIX V3.2 using gcc [*] 1.0
Linux Version 2.0.33 1.0 - 1.2
Cygwin 1.3 - 2.0B2
Linux Version 2.2.17 1.3 - 2.0B2
Linux Version 2.4.20-20-9 [**] 1.3 - 2.1.7
Linux Version 2.6.19.fc5 2.2.0 - 2.2.4
Linux Version 2.6.9-5.EL (gcc 3.4.6-3) 2.2.0 - 2.2.4
Darwin Version 7.4.0 1.3
Darwin Version 8.5.0 (gcc v4.0.1 build 5367) 2.1.2 - 2.3-rc1
Darwin Version 8.11.0 (gcc v4.0.1 build 5367) 2.3-rc2 - 2.4.0
Darwin Version 9.6.0 (gcc v4.0.1 build 5490) 2.3.6 - 2.4.0
RedHat Fedora 9 (gcc v4.3.0-8) 2.3.9 - 2.4.0
Darwin Version 10.8.0 (gcc version 4.2.1 (Apple Inc. build 5666) 2.4.2 - 2.4.8

[*] compiled with warning messages, but successful self test

[**] compiled and ran, but test6, which uses GNU's diff for alignment fails. The use of DIFF for alignment is rarely used and the failure can be safely ignored.

Docker

A Dockerfile is included to build and use SCTK without compiling the codebase for your particular platform.

To build:

docker build -t sctk .

Hypothesis Test

To run sclite on a reference file and hypothesis file, you need to map a volume to your host.

Linux or a Mac:

docker run -it -v $PWD:/var/sctk sctk sclite -i wsj -r ref.txt -h hyp.txt

Windows:

docker run -it -v /myhostpath:/var/sctk sctk sclite -i wsj -r ref.txt -h hyp.txt

sc_stats

To run more advanced features of the SDK, shell into the container with a volume mapped to your local files:

docker run -it -v $PWD:/var/sctk sctk sh
/var/sctk #

Then run sclite piped to sc_stats:

sclite -i wsj -r ref.txt -h hyp1.txt -h hyp2.txt -o sgml stdout | sc_stats -p -t mapsswe -u
sc_stats: 1.3
Beginning Multi-System comparisons and reports
    Performing the Matched Pair Sentence Segment (Word Error) Test
    Printing Unified Statistical Test Reports
        Output written to 'Ensemble.stats.unified'

Successful Completion

Contact

If you have questions, please address them to Jonathan Fiscus at [email protected]

License

See LICENSE.md for details or http://www.nist.gov/open/license.cfm

More Repositories

1

macos_security

macOS Security Compliance Project
YAML
1,748
star
2

800-63-3

Home to public development of NIST Special Publication 800-63-3: Digital Authentication Guidelines
CSS
702
star
3

OSCAL

Open Security Controls Assessment Language (OSCAL)
XSLT
572
star
4

fipy

FiPy is a Finite Volume PDE solver written in Python
Python
430
star
5

jarvis

JARVIS-Tools: an open-source software package for data-driven atomistic materials design. Publications: https://scholar.google.com/citations?user=3w6ej94AAAAJ
Python
289
star
6

jsip

JSIP: Java SIP specification Reference Implementation (moved from java.net)
Java
287
star
7

frvt

Repository for the Face Recognition Vendor Test (FRVT)
C++
261
star
8

trec_eval

Evaluation software used in the Text Retrieval Conference
C
224
star
9

dioptra

Test Software for the Characterization of AI Technologies
Python
220
star
10

oscal-content

NIST SP 800-53 content and other OSCAL content examples
Shell
218
star
11

alignn

Atomistic Line Graph Neural Network https://scholar.google.com/citations?user=9Q-tNnwAAAAJ&hl=en
Python
218
star
12

SP800-90B_EntropyAssessment

The SP800-90B_EntropyAssessment C++package implements the min-entropy assessment methods included in Special Publication 800-90B.
C++
200
star
13

PrivacyEngCollabSpace

Privacy Engineering Collaboration Space
Python
186
star
14

ACVP

Industry Working Group on Automated Cryptographic Algorithm Validation
HTML
163
star
15

REFPROP-wrappers

Wrappers around NIST REFPROP for languages such as Python, MATLAB, etc.
Mathematica
160
star
16

mobile-threat-catalogue

NIST/NCCoE Mobile Threat Catalogue
HTML
141
star
17

trojai-literature

131
star
18

NFIQ2

Optical live-scan and ink fingerprint image quality assessment tool
C++
130
star
19

MIST

Microscopy Image Stitching Tool
Java
130
star
20

applesec

Draft SP 800-179r1 macOS 10.12 Security project files: draft publication, security settings spreadsheet and Bash script implementation of settings.
Shell
116
star
21

ndn-dpdk

NDN-DPDK: High-Speed Named Data Networking Forwarder
Go
114
star
22

ARIAC

Repository for ARIAC (Agile Robotics for Industrial Automation Competition), consisting of kit building and assembly in a simulated warehouse
C++
110
star
23

SFA

The NIST STEP File Analyzer and Viewer (SFA) generates a spreadsheet and a visualization from an ISO 10303 Part 21 STEP file.
Tcl
109
star
24

NEMO

NEMO is a laboratory logistics web application. Use it to schedule reservations, control tool access, track maintenance issues, and more.
Python
98
star
25

jsfive

A pure javascript HDF5 reader
JavaScript
97
star
26

h5wasm

A WebAssembly HDF5 reader/writer library
C++
84
star
27

pyMCR

pyMCR: Multivariate Curve Resolution for Python
Python
80
star
28

policy-machine-core

Core components of the Policy Machine, a NGAC reference implementation.
Java
76
star
29

psc-ns3

Public Safety Communication modeling tools based on ns-3
C++
68
star
30

chemnlp

ChemNLP: A Natural Language Processing based Library for Materials Chemistry Text Data
Python
65
star
31

Metrology

Metrology for software; software for metrology
JavaScript
65
star
32

STP2X3D

Translator from STEP format to X3D format
C++
62
star
33

combinatorial-testing-tools

Tools for combinatorial testing developed by the NIST ACTS project
Java
61
star
34

jarvis_leaderboard

Explore State-of-the-Art Materials Design Methods: https://www.nature.com/articles/s41524-024-01259-w
Jupyter Notebook
55
star
35

COSMOSAC

A Benchmark Implementation of COSMO-SAC
HTML
52
star
36

ACVP-Server

A repository tracking releases of NIST's ACVP server. See www.github.com/usnistgov/ACVP for the protocol.
C#
52
star
37

pfhub

The CHiMaD Phase Field Community Website
HTML
49
star
38

REFPROP-cmake

Small repo with CMake build system for building REFPROP shared library
CMake
48
star
39

teqp

A highly efficient, flexible, and accurate implementation of thermodynamic EOS powered by automatic differentiation
C++
48
star
40

Lightweight-Cryptography-Benchmarking

C
48
star
41

SimulatedRadarWaveformGenerator

A software tool that generates simulated radar signals and creates RF datasets for developing and testing machine/deep learning detection algorithms.
MATLAB
47
star
42

iheos-toolkit2

XDS Toolkit
Java
46
star
43

OpenSeadragonFiltering

OpenSeadragon filtering plugin
JavaScript
45
star
44

pmml_pymcBN

Jupyter Notebook
42
star
45

ActEV_Scorer

Scoring software for the TRECVID Activities in Extended Video (ActEV) evaluation
Python
41
star
46

HTGS

The Hybrid Task Graph Scheduler API
C++
40
star
47

sctools

Tools for security content automation, baseline tailoring, and overlay development.
HTML
39
star
48

hiperc

High Performance Computing Strategies for Boundary Value Problems
HTML
39
star
49

OpenSeadragonScalebar

OpenSeadragon scalebar plugin
JavaScript
38
star
50

pyPRISM

A framework for conducting polymer reference interaction site model (PRISM) calculations
Python
38
star
51

ocr-pipeline

Convert a corpus of PDF to clean text files on a distributed architecture
Python
38
star
52

800-63-4

HTML
37
star
53

mosaic

A modular single-molecule analysis interface
Python
37
star
54

oscal-cli

A simple open source command line tool to support common operations over OSCAL content.
Java
37
star
55

vulntology

Development of the NIST vulnerability data ontology (Vulntology).
JavaScript
36
star
56

DT4SM

Digital Thread for Smart Manufacturing
C#
34
star
57

OOF3D

Object Oriented for Finite Elements 3D version code.
Python
34
star
58

NetSimulyzer

A flexible 3D visualizer for displaying, debugging, presenting, and understanding ns-3 scenarios.
C++
34
star
59

NetSimulyzer-ns3-module

A flexible 3D visualizer for displaying, debugging, presenting, and understanding ns-3 scenarios.
C++
33
star
60

pyramidio

Image pyramid reader and writer
Java
33
star
61

rcslib

NIST Real-Time Control Systems Library including Posemath, NML communications & Java Plotter
Java
33
star
62

AGA8

Files associated with the AGA8 standard
Rust
33
star
63

hugo-uswds

Implementation of the The United States Web Design System (USWDS) 2.0 using the Hugo open-source static site generator
SCSS
33
star
64

PrivacyFrmwkResources

This repository contains resources to support organizationsโ€™ use of the Privacy Framework. Resources include crosswalks, Profiles, guidelines, and tools. NIST encourages new contributions and feedback on these resources as part of the ongoing collaborative effort to improve implementation of the Privacy Framework.
33
star
65

dataplot

Source code and auxiliary files for dataplot.
Fortran
32
star
66

oscal-tools

Tools for the OSCAL project
XSLT
32
star
67

SDNist

SDNist: Benchmark data and evaluation tools for data synthesizers.
HTML
31
star
68

Voting

The NIST Voting Program repository
31
star
69

metaschema

Documentation for and implementations of the metaschema modeling language
Shell
31
star
70

MDCS

CSS
31
star
71

pySCATMECH

pySCATMECH is a Python interface to SCATMECH: Polarized Light Scattering C++ Class Library
C++
31
star
72

phasefield-precipitate-aging

Phase field model for precipitate aging in ternary analogues to Ni-based superalloys
Cuda
30
star
73

atomvision

Deep learning framework for atomistic image data
Python
29
star
74

OFDM-GAN

Python
29
star
75

feasst

The Free Energy and Advanced Sampling Simulation Toolkit (FEASST) is a free, open-source, modular program to conduct molecular and particle-based simulations with flat-histogram Monte Carlo methods.
C++
29
star
76

liboscal-java

A Java library to support processing OSCAL content
Java
28
star
77

lantern

Interpretable genotype-phenotype landscape modeling
Python
28
star
78

ns3-oran

A module that can be used to model and simulate O-RAN-like behavior in ns-3.
C++
28
star
79

ChebTools

C++ tools for working with Chebyshev expansion interpolants
C++
27
star
80

MediScore

Scoring tools for Media Forensics Evaluations
HTML
27
star
81

hedgehog

C++
27
star
82

REFPROP-issues

A repository solely used for reporting issues with NIST REFPROP
26
star
83

SCATMECH

SCATMECH: Polarized light scattering C++ class library
C++
26
star
84

youbot

Robotic platform for industrial control systems cybersecurity research. We use the research-grade Youbot as the robotics platform for our research. The ROS framework is used for inter-process communication, and Python is the language used for application development.
Python
26
star
85

ThreeBodyTB.jl

Accurate and fast tight-binding calculations, using pre-fit coefficients and three-body terms.
Julia
25
star
86

Circuits

Circuits for functions of interest to cryptography
C++
25
star
87

OOF2

Object Oriented for Finite Elements 2D version.
C++
25
star
88

libbiomeval

Software components for biometric technology evaluations.
C++
25
star
89

F4DE

Framework for Detection Evaluation (F4DE) : set of evaluation tools for detection evaluations and for specific NIST-coordinated evaluations
Perl
24
star
90

optbayesexpt

Optimal Bayesian Experiment Design
Python
24
star
91

blockmatrix

This project is developing code to implement features and extensions to the NIST Cybersecurity Whitepaper, "A Data Structure for Integrity Protection with Erasure Capability". The block matrix data structure may have utility for incorporation into applications requiring integrity protection that currently use permissioned blockchains. This capability could for example be useful in meeting privacy requirements such as the European Union General Data Protection Regulation (GDPR), which requires that organizations make it possible to delete all information related to a particular individual, at that person's request.
Java
24
star
92

texture

Python scripts for analysis of crystallographic texture
Jupyter Notebook
23
star
93

ElectionResultsReporting

Common data format specification for election results reporting data
23
star
94

oscal-deep-diff

Open Security Controls Assessment Language (OSCAL) Deep Differencing Tool
TypeScript
22
star
95

IFA

The NIST IFC File Analyzer (IFA) generates a spreadsheet from an IFC file.
Tcl
22
star
96

MUD-PD

A tool for characterizing the network behavior of IoT Devices. The primary intended use is to assist in the generation of allowlist files formatted according to the Manufacturer Usage Description specification.
Python
21
star
97

trojai-example

Example TrojAI Submission
21
star
98

NIST-Tech-Pubs

XML metadata for NIST Technical Series Publications
HTML
21
star
99

blossom-case-study

A case study for ACSAC 2022 utilizing OSCAL with a custom GitHub action to automate assessments.
HTML
21
star
100

atomgpt

AtomGPT: Atomistic Generative Pretrained Transformer for Forward and Inverse Materials Design
Python
21
star