• Stars
    star
    147
  • Rank 242,545 (Top 5 %)
  • Language
    Objective-C
  • License
    BSD 3-Clause "New...
  • Created about 2 years ago
  • Updated 11 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Command line speech recognition and transcription for macOS

License Language Release Build

hear

hear

macOS speech recognition via the command line

hear is a command line interface for the built-in speech recognition capabilities in macOS. This makes it possible to transcribe spoken audio input or entire audio files.

Apple's macOS has long shipped with the say tool, a command line interface for the operating system's venerable speech synthesis capabilities. As of 10.15 "Catalina", it also ships with highly capable speech recognition for a variety of languages. This project is an attempt to provide a robust CLI for this functionality since no such tool is provided by Apple. hear supports speech recognition and transcription of both live microphone input and audio files in a variety of formats. All formats supported by CoreAudio should work (e.g. WAV, MP3, AIFF, AAC, CAF, ALAC, etc.).

See the man page for further details.

Download

hear is free, open source software written in Objective-C. The source code is freely available under a BSD license. If you find this program useful, please make a donation.

Installation

After downloading and expanding the archive, change to the resulting directory and run the following command:

bash install.sh

This will install the binary into /usr/local/bin and the man page into /usr/local/share/man/man1/.

Usage

Transcribe audio from microphone (or default audio input device)

hear

From microphone in single line output mode

hear -m

Transcribe audio file

hear -d -i /path/to/someone_speaking.mp3 > transcribed_text.txt

The -d flag specifies that only on-device speech recognition capabilities should be used. Otherwise, data may be sent to Apple servers, which as of writing (2022) have a hard limit of something like 500 characters before quitting.

Build

To build hear from source, run the following command from the repository root (requires Xcode command line build tools):

make build_unsigned

The resulting binary is created in products/.

Test

To run tests on the built command line tool, execute the following command from the repository root:

bash test/test.sh

Troubleshooting

If running the binary leads to an abort signal, try running the binary by right-clicking, and selecting open. This should automatically open the binary with the regular terminal and prompt you to allow the required permissions.

BSD License

Copyright (c) 2022-2023 Sveinbjorn Thordarson <[email protected]>

Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met:

  1. Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer.

  2. Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution.

  3. Neither the name of the copyright holder nor the names of its contributors may be used to endorse or promote products derived from this software without specific prior written permission.

THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

More Repositories

1

Sloth

Mac app that shows all open files, directories, sockets, pipes and devices in use by all running processes. Nice GUI for lsof.
Objective-C
7,949
star
2

Platypus

Create native Mac applications from command line scripts.
Objective-C
2,517
star
3

STPrivilegedTask

An NSTask-like wrapper around the macOS Security Framework's AuthorizationExecuteWithPrivileges function to run shell commands with root privileges in Objective-C / Cocoa.
Objective-C
386
star
4

ManDrake

Native Mac man page editor with syntax coloring, mandoc syntax checking & live preview.
Objective-C
142
star
5

DataURLToolkit

Various tools for working with Data URLs, incl. web application (http://dataurl.net), Mac OS X GUI app, command line tool, Perl modules and Apache module.
Perl
107
star
6

osxiconutils

A suite of command line utilities for working with Mac icons
Objective-C
104
star
7

searchfs

Command line tool to quickly search by filename on entire APFS and HFS+ volumes using the file system catalog.
Objective-C
98
star
8

KatSearch

Fast, simple, powerful filesystem search for macOS, just like the good ol' days.
Objective-C
97
star
9

FastDMG

Fast, no-nonsense disk image mounting for macOS
Objective-C
89
star
10

Phew

FLIF image viewer and QuickLook plugin for macOS
Objective-C
82
star
11

lemur2013

WordPress theme for lemurinn.is
PHP
47
star
12

Dim

User-friendly, high quality document icon maker for OS X
Objective-C
42
star
13

SnapTools

Tools to integrate the macOS window environment and command line interface
Objective-C
40
star
14

IconScanner

Find and inspect macOS icon assets (.icns) on your system
Objective-C
30
star
15

ensk.is

A free and open English-Icelandic dictionary. https://ensk.is
Python
27
star
16

PixlView

Mac tool to view raw pixel data in a variety of formats
Objective-C
24
star
17

iceaddr

Python package to look up information about Icelandic street addresses, postcodes and placenames
Python
23
star
18

STPathTextField

Subclass of NSTextField that supports shell and browser-style path autocompletion
Objective-C
20
star
19

sagadb.org

Icelandic Saga Database
HTML
20
star
20

ochre

macOS optical character recognition via the command line
Objective-C
9
star
21

SnowboyFramework

Snowboy hotword detection engine packaged as an iOS framework
C++
8
star
22

docerator

GUI for the docerator.py program
Objective-C
7
star
23

ManOpen-Intel

Intel-buildable package of Carl Lindberg's ManOpen
C
7
star
24

BrowserPick

Control which macOS browsers should open which URLs using rules and hotkeys
Objective-C
7
star
25

cityloc

Python package to look up world city coordinates and country codes
Python
5
star
26

Pizarro

Quirky music-oriented iOS game implemented using Cocos2D and the Chipmunk physics engine
Objective-C
3
star
27

vafuglinn

Váfuglinn website
HTML
2
star
28

MalidMacService

Mac Service to look up Icelandic words on málið.is
Objective-C
2
star