ADINTOOL(1) | ADINTOOL(1) |
adintool
- a tool to record / split / send / receive audio streams
adintool {-in inputdev} {-out outputdev} [options...]
adintool analyzes speech input, finds speech segments skipping silence, and records the detected segments in various ways. It performs speech detection based on zerocross number and power (level), and records the detected parts to files or other output devices sucessively.
adintool is a upper version of adinrec with various functions. Supported input device are: microphone input, a speech file, standard tty input, and network socket (called adin-net server mode). Julius plugin can be also used. Detected speech segments will be saved to output devices: speech files, standard tty output, and network socket (called adin-net client mode). For example, you can split the incoming speech to segments and send them to Julius to be recognized.
Output format is WAV, 16bit (signed short), monoral. If the file already exist, it will be overridden.
All Julius options can be set. Only audio input related options are treated and others are silently skipped. Below is a list of options.
-freq Hz
-in inputdev
Alternatively, input device can be set by "-input" option, in which case you can use plugin input.
-out outputdev
-inport num
-server [host] [,host...]
-port [num] [,num...]
-filename file
-startid number
-oneshot
-nosegment
-raw
-autopause
-loosesync
-rewind msec
-input {mic|rawfile|adinnet|stdin|netaudio|esd|alsa|oss}
´mic' is to get audio input from a default live microphone device, and 'adinnet' means receiving waveform data via tcpip network from an adinnet client. 'netaudio' is from DatLink/NetAudio input, and 'stdin' means data input from standard input.
At Linux, you can choose API at run time by specifying alsa, oss and esd.
-chunk_size samples
-lv thres
-zc thres
-headmargin msec
-tailmargin msec
-zmean
-smpFreq Hz
-48
-NA devicename
-adport port_number
-nostrip
-C jconffile
-plugindir dirlist
ALSADEV
AUDIODEV
LATENCY_MSEC
Record microphone input to files: "data.0000.wav", "data.0001.wav" and so on:
Split a long speech file "foobar.raw" into "foobar.1500.wav", "foobar.1501.wav" ...:
Copy an entire audio file via network socket.
Detect speech segment, send to Julius via network and recognize it:
julius ( 1 ) , adinrec ( 1 )
Copyright (c) 1997-2000 Information-technology Promotion Agency, Japan
Copyright (c) 1991-2008 Kawahara Lab., Kyoto University
Copyright (c) 2000-2005 Shikano Lab., Nara Institute of Science and Technology
Copyright (c) 2005-2008 Julius project team, Nagoya Institute of Technology
The same as Julius.
10/02/2008 |