RSC-4128
Speech Recognition Processor
Data Sheet
General Description
The RSC-4128 represents Sensory’s next generation
speech and analog I/O mixed signal processor. The
RSC-4128 is designed to bring advanced speech I/O
features to cost sensitive embedded and consumer
products. Based on an 8-bit microcontroller, the RSC-
4128 integrates speech-optimized digital and analog
processing blocks into a single chip solution capable
of accurate speech recognition; high quality, low
data-rate compressed speech; and advanced music.
Products can use one or all features in a single
application.
The RSC-4128 supports Sensory Speech™ 7
technology, which includes advanced speech
algorithms that add features and improve
performance. Capable of running both new HMM and
enhanced neural network technologies, accuracy in
all kinds of noise is dramatically improved. New
Speaker Verification technology is perfect for voice
password security applications that must work in
noisy environments. New high quality compressed
speech technology reduces data rates by 5 times.
New 8 voice MIDI-compatible music includes drum
tracks, effectively increasing instruments beyond 8.
Simultaneous music and speech rounds out the
Sensory Speech™ 7 technology.
The RSC-4128 also supports the revolutionary
capability
of
creating
speaker
independent
recognition sets by simply typing in the desired
recognition vocabulary! A few keystrokes creates a
recognition set in seconds without the wait or cost of
recording sessions to train the recognizer, speeding
time to sales.
A new and unique Audio Wakeup feature listens
while the RSC-4128 is in power down mode. When
an audio event such as a clap or whistle occurs,
Audio Wakeup will wakeup the RSC-4128 for speech
or application tasks. Audio Wakeup is perfect for
battery applications that require continuous listening
and long battery life.
In addition to improved recognition performance, the
RSC-4128 provides further on-chip integration of
features. A complete speech I/O application can be
built with as few additional parts as a clock crystal,
speaker, microphone, and few resistors and
capacitors.
Moreover, the RSC-4128 provides an unprecedented
level of cost effective system-on-chip (SOC)
integration, enabling many applications that require
DSP and/or audio processing. The RSC-4128 may
be used as a general-purpose mixed signal
processor
platform
for
custom
algorithms,
technologies and applications.
Features
Full Range of Sensory Speech™ 7 Capabilities
Enhanced Word Spotting capability (10 SI or 4 SD
words) in parallel
Noise robust Speaker Independent, Dependent &
Continuous Listening recognition
Speaker Verification (SVWS) – Noise robust voice
biometric security
High quality, 3.7-7.8 kbps speech synthesis & sound
effects with Sensory “SX” synthesis technology
8 voice MIDI-compatible music synthesis coincident with
speech; drum track feature enables additional voices
Voice record & playback
Audio Wakeup from sleep
Integrated Single-Chip Solution
8-bit microcontroller
ROMless, 128KByte and 256KByte ROM options
16 bit ADC, 10 bit DAC and microphone pre-amplifier
Independent, programmable Digital Filter engine
4.8 KBytes total RAM (256Bytes “user” application RAM)
Five timers (3 GP, 1 Watchdog, 1 Multi Tasking)
Twin-DMA, Vector Math accelerator, and Multiplier
Built-in Analog Comparator Unit (4 inputs)
External memory bus: 20-bit Address(1Mbyte), 8-bit Data
On chip storage for SD, SV, templates (10 templates)
Code security through no ROM dump capability
Uses low cost 3.58MHz crystal (internal PLL)
Low EMI design for FCC and CE requirements
24 configurable I/O lines with 10 mA (typical) outputs
Fully nested interrupt structure with up to 8 sources
Optional Real Time Clock
Long Battery Life
2.4 – 3.6V operation
12mA (typical) operating current at 3V
2 low power modes; 1
µ
A typical sleep current
Full Suite of Quick & Powerful Tools
Quick Text-to-SI (T2SI) text entry to build noise robust SI
recognition sets – low cost & push-button – no recording!
Quick Synthesis for push-button speech compression
Integrated Development Environment, C Compiler,
Debugger & In Circuit Emulator from Phyton, Inc.
© 2004 Sensory Inc.
P/N 80-0206-J
1
RSC-4128
Data Sheet
Table of Contents
General Description ........................................................................................................................................................................ 1
RSC-4128 Overview ....................................................................................................................................................................... 4
Speech Technologies ..................................................................................................................................................................... 5
Speech Recognition...............................................................................................................................................................................................5
Speaker Verification...............................................................................................................................................................................................5
Speech and Music Synthesis.................................................................................................................................................................................5
Record and Playback.............................................................................................................................................................................................5
RSC-4128 Architecture ................................................................................................................................................................... 6
Reference Schematics.................................................................................................................................................................... 8
Using the RSC-4128 ..................................................................................................................................................................... 10
Instruction Set ......................................................................................................................................................................................................10
Stack ....................................................................................................................................................................................................................11
Register RAM.......................................................................................................................................................................................................11
L1 Vector Accelerator/Multiplier ...........................................................................................................................................................................12
Power and Wakeup Control .................................................................................................................................................................................12
General Purpose I/O ............................................................................................................................................................................................13
Memory Addressing .............................................................................................................................................................................................14
Wait States...........................................................................................................................................................................................................17
On-Chip ROM ......................................................................................................................................................................................................18
Oscillators ............................................................................................................................................................................................................18
Clocks ..................................................................................................................................................................................................................19
Timers/Counters ..................................................................................................................................................................................................20
Interrupts..............................................................................................................................................................................................................23
Analog input .........................................................................................................................................................................................................25
Audio Wakeup......................................................................................................................................................................................................26
Microphones ........................................................................................................................................................................................................27
Reset....................................................................................................................................................................................................................28
Digital-to-Analog-Converter (DAC) Output...........................................................................................................................................................28
Pulse Width Modulator (PWM) Analog Output.....................................................................................................................................................30
Comparator Unit...................................................................................................................................................................................................31
Instruction Set Opcodes and Timing Details ................................................................................................................................. 33
MOVE Group Instructions ....................................................................................................................................................................................33
ROTATE Group Instructions ................................................................................................................................................................................34
BRANCH Group Instructions ...............................................................................................................................................................................34
ARITHMETIC/LOGICAL Group Instructions ........................................................................................................................................................34
MISCELLANEOUS Group Instructions ................................................................................................................................................................35
Special Functions Registers (SFRs) Summary............................................................................................................................. 36
DC Characteristics ........................................................................................................................................................................ 38
A.C. Characteristics (External memory accesses) ........................................................................................................................ 38
Timing Diagrams........................................................................................................................................................................... 39
Absolute Maximum Ratings .......................................................................................................................................................... 39
Package Options .......................................................................................................................................................................... 40
Die Pad Ring................................................................................................................................................................................. 43
RSC-4128 Die Bonding Pad Locations ......................................................................................................................................... 44
2
P/N 80-0206-J
© 2004 Sensory Inc.
Data Sheet
RSC-4128
Mechanical Data ........................................................................................................................................................................... 45
Ordering Information..................................................................................................................................................................... 48
The Interactive Speech™ Product Line ........................................................................................................................................ 49
3
P/N 80-0206-J
© 2004 Sensory Inc.
RSC-4128
Data Sheet
RSC-4128 Overview
The RSC-4128 is a member of the Interactive Speech™ line of products from Sensory. It features a high-
performance 8-bit microcontroller with on-chip ADC, DAC, preamplifier, RAM, ROM (except on ROM-less version),
and optimized audio processing blocks. The RSC-4128 is designed to bring a high degree of integration and
versatility into low-cost, power-sensitive applications. Various functional units have been integrated onto the CPU
core in order to reduce total system cost and increase system reliability.
The RSC-4128 operates in tandem with Sensory Speech™ 7 firmware, an ultra compact suite of recognition and
synthesis technologies. This reduced software footprint enables, for example, products with over 150 seconds of
compressed speech, multiple speaker dependent and independent vocabularies, speaker verification, and all
application code built into the RSC-4128 as a single chip solution. Revolutionary Text-to-Speaker-Independent
(T2SI) technology allows the creation of SI recognition sets by simply entering text.
The CPU core embedded in the RSC-4128 is an 8-bit, variable-length-instruction microcontroller. The instruction
set is similar to the 8051 microcontroller, and has a variety of addressing mode,
MOV
and 16 bit instructions. The
RSC-4128 processor avoids the limitations of dedicated A, B, and DPTR registers by having completely
symmetrical sources and destinations for all instructions.
The RSC-4128 provides a high level of on-chip features and special DSP engines, providing a very cost effective
mixed signal platform for general-purpose applications and development of custom algorithms. The full suite of
industry standard tools for easy product development makes the RSC-4128 an ideal platform for consumer
electronics.
RSC-4128 Block Diagram
3 .5 8 M H z O s c illa t o r
o r R e s o n a to r
3 2 K H z O s c illa t o r ( o p t io n a l)
ADC
D ig ita l
F ilte r s
A u d io
W akeup
AGC
T im e r s ( 3 )
W a tc h d o g
T im e r
DAC
DAC Out
M ic r o p h o n e
P re -A m p
and
G a in
C o n tr o l
PW M
RSC
G e n e ra l P u rp o s e
M ic r o c o n tr o lle r
Speaker O ut
In te r n a l R o m S p a c e
(0 K , 1 2 8 K o r 2 5 6 K )
4 .8 K S R A M
L o w B a tte r y
D e te c tio n
R S C - 4 x S e r ie s
C o m p a r a to r s
( 4 In p u t)
V e c to r A c c e le r a to r
w ith T w in D M A
G e n e r a l P u r p o s e I/O
3 x 8 - b it p o r ts ( 2 4 I/O )
E x t e r n a l M e m o r y In t e r f a c e
8 - b it d a ta , 2 0 - b it A d d r e s s
4
P/N 80-0206-J
© 2004 Sensory Inc.
Data Sheet
RSC-4128
Speech Technologies
Speech Recognition
The RSC-4128 is designed to support HMM (Hidden Markov Modeling) as well as Neural Network technologies
provided in Sensory Speech™ 7 firmware, to perform speaker independent (SI) speech recognition. Speaker
independent recognition requires on-chip or off-chip ROM to store the words to be recognized.
Speaker dependent (SD) recognition requires programmable memory to store personalized speech templates.
This programmable memory may be on-chip SRAM or off-chip Serial EEPROM, Flash Memory, or SRAM.
The RSC-4128 has several additional speech recognition features as described below:
Speaker Independent
recognition requires no user training. The RSC-4128 can recognize up to 20 words in an
active set (number of sets is limited only by internal ROM or external memory size). Text-to-SI (T2SI) recognition,
based on HMM technology, allows creation of SI recognition sets in seconds by simply typing in the vocabulary
desired, with no costs or delays associated with recording and training the recognizer.
Speaker Dependent
recognition allows the user to create names for products or customize vocabularies. Up to
100 words can be recognized in an active set (number of sets is limited only by internal ROM or external memory
size). The RSC-4128 can store up to 10 SD words in on-chip SRAM.
Continuous Listening
allows the chip to continuously listen for a specific trigger word. With this feature, a product
“activates” when a specific word is spoken, framed by quiet before and after. Continuous listening provides the
lowest false fire rate for trigger words.
Word Spotting
allows the chip to continuously recognize for up to 10 SI or 5 SD words at a time. In word spotting
mode, the word(s) to be recognized may be spoken in the middle of speech.
Speaker Verification
The RSC-4128 also supports Sensory’s speaker verification (SV) technology – the most successful biometric
security on the market. After a speaker trains the chip on a specific word or words, the chip is able to identify
whether a particular word is spoken by the original speaker. The RSC-4128 can store up to 10 SV templates on-
chip, or more with external programmable memory.
Speech and Music Synthesis
The RSC-4128 provides high-quality speech synthesis using state-of-the-art frequency domain techniques in
Sensory’s new “SX” synthesis technology. Typical data rates for SX are approximately 6000 bits per second. One
may select various data rates from approximately 3.7 to 7.8Kbps to manage speech quality versus allotted memory.
Speech, music and sound effects may also be produced using the RSC-4128 8 bit, 58Kbps or 4 bit, 30Kbps
compression technologies.
The RSC-4128 provides high-quality, eight-voice, wave table music synthesis which allows multiple, simultaneous
instruments for harmonizing. The RSC-4128 uses a MIDI-like system to generate music. One or more of the eight
voices may be speech playback instead of music. One or more of the eight voices may be a drum track comprising
multiple drums. In effect, this allows the number of simultaneous instruments to exceed 8.
Speech and music synthesis requires on-chip or off-chip ROM to store data for synthesis playback. Easy to use
tools allow the developer to record and compress their own voice talents and create with the push of a button, or to
create their own MIDI scores and instruments.
Record and Playback
The RSC-4128 can perform speech record and playback (sometimes called “voice memo”) at various compression
levels depending on the quantity and quality of playback desired. Data rates less than 14,000 bits per second are
achievable while maintaining very high quality reproduction. The record and playback technology also performs
silence removal to improve sound quality and reduce memory requirements.
5
P/N 80-0206-J
© 2004 Sensory Inc.