RSC-464
Speech Recognition Processor
Preliminary Data Sheet
General Description
The RSC-464 is the newest member of Sensory’s
RSC-4x Family of microcontrollers with on-chip
speech I/O capabilities. The RSC-464 has many
features of the RSC-4128, but reduced in cost by
integrating less memory. The RSC-464 is designed to
bring high performance speech I/O features to cost
sensitive embedded and consumer products. Based
on an 8-bit microcontroller, the RSC-464 integrates
speech-optimized digital and analog processing
blocks into a single chip solution capable of accurate
speech recognition; high quality, low data-rate
compressed speech; and advanced music. Products
can use one or all features in a single application.
The RSC-464 operates in tandem with the radically
new FluentChip™ technology, offering the best
speech recognition technologies in the industry.
FluentChip™ includes Hidden Markov Model-Neural
Net hybrid speech recognition. Accuracy in all kinds
of noise is dramatically improved. New Speaker
Verification technology is perfect for voice password
security applications that must work in noisy
environments. New high quality compressed speech
technology reduces data rates by 5 times. New 8-
voice MIDI-compatible music includes drum tracks,
effectively increasing instruments beyond 8.
Simultaneous music and speech rounds out the
FluentChip™ technology.
FluentChip™ technology tools also support the
revolutionary capability of creating speaker
independent recognition sets by simply typing in the
desired recognition vocabulary! A few keystrokes
creates a recognition set in seconds without the wait
or cost of recording sessions to train the recognizer,
speeding time to sales.
The Audio Wakeup feature listens while the RSC-464
is in power down mode. When an audio event such
as a clap or whistle occurs, Audio Wakeup will
wakeup the RSC-464 for speech or application tasks.
Audio Wakeup is perfect for battery applications that
require continuous listening and long battery life.
The RSC-464 provides further on-chip integration of
features. A complete speech I/O application can be
built with as few additional parts as a clock crystal,
speaker, microphone, and few resistors and
capacitors.
Moreover, the RSC-464 provides an unprecedented
level of cost effective system-on-chip (SOC)
integration, enabling many applications that require
DSP and/or audio processing. The RSC-464 may be
used as a general-purpose mixed signal processor
platform for custom algorithms, technologies and
applications.
Features
Full Range of FluentChip™ Capabilities
Noise-robust Speaker Independent and Speaker
Dependent recognition
Many languages now available for international use
Speaker Verification – voice password biometric security
Word Spotting and Continuous Listening recognition
options
High quality, 2.4-10.8 kbps speech synthesis & sound
effects, with Sensory SX
™
synthesis technology
8 voice MIDI-compatible music synthesis coincident with
speech; drum track feature enables additional voices
Voice Record & Playback (voice memo)
Audio Wake Up from sleep with whistles or claps
Touch Tone (DTMF) output
Integrated Single-Chip Solution
8-bit microcontroller
64K bytes ROM
16 bit ADC, 10 bit DAC & PWM, and microphone pre-
amplifier; PWM 30% louder than before!
Independent, programmable Digital Filter engine
2.8 KBytes total RAM (262 bytes “user” application RAM)
Five timers (3 GP, 1 Watchdog, 1 Multi Tasking)
Twin-DMA, Vector Math accelerator, and Multiplier
Built-in Analog Comparator Unit (4 inputs)
On chip storage for SD, SV, templates
16 configurable I/O lines with 10 mA (typical) outputs
Uses low cost 3.58MHz crystal (internal PLL)
Low EMI design for FCC and CE requirements
Fully nested interrupt structure with up to 8 sources
Optional Real Time Clock
Long Battery Life
2.4 – 3.6V operation
10mA (typical) operating current at 3V during
2 low power modes; 1
µ
A typical sleep current
Full Suite of Quick & Powerful Tools
Quick Text-to-SI (T2SI) text entry to build noise robust SI
recognition sets – low cost & push-button – no recording!
Quick Synthesis for push-button speech compression
Integrated Development Environment, C Compiler,
Debugger & In Circuit Emulator from Phyton, Inc.
© 2005 Sensory Inc.
P/N 80-0282-A
1
RSC-464
Preliminary Data Sheet
Table of Contents
General Description ........................................................................................................................................................................ 1
Speech Technologies ..................................................................................................................................................................... 4
Speech Recognition .............................................................................................................................................................................................. 4
Speech and Music Synthesis ................................................................................................................................................................................ 4
Record and Playback ............................................................................................................................................................................................ 4
RSC-464 Architecture..................................................................................................................................................................... 5
Reference Schematics.................................................................................................................................................................... 7
Using the RSC-464......................................................................................................................................................................... 8
Instruction Set ....................................................................................................................................................................................................... 8
Stack ..................................................................................................................................................................................................................... 9
Register and User RAM ........................................................................................................................................................................................ 9
L1 Vector Accelerator/Multiplier .......................................................................................................................................................................... 10
Power and Wakeup Control ................................................................................................................................................................................ 10
General Purpose I/O ........................................................................................................................................................................................... 11
Memory Addressing ............................................................................................................................................................................................ 13
Oscillators ........................................................................................................................................................................................................... 13
Clocks ................................................................................................................................................................................................................. 14
Timers/Counters.................................................................................................................................................................................................. 15
Interrupts ............................................................................................................................................................................................................. 18
Audio Wakeup..................................................................................................................................................................................................... 21
Microphones........................................................................................................................................................................................................ 22
Reset................................................................................................................................................................................................................... 23
Digital-to-Analog-Converter (DAC) Output.......................................................................................................................................................... 23
Pulse Width Modulator (PWM) Analog Output.................................................................................................................................................... 25
Comparator Unit.................................................................................................................................................................................................. 26
Instruction Set Opcodes and Timing Details ................................................................................................................................. 28
MOVE Group Instructions ................................................................................................................................................................................... 28
ROTATE Group Instructions ............................................................................................................................................................................... 29
BRANCH Group Instructions............................................................................................................................................................................... 29
ARITHMETIC/LOGICAL Group Instructions ....................................................................................................................................................... 29
MISCELLANEOUS Group Instructions ............................................................................................................................................................... 30
Special Functions Registers (SFRs) Summary............................................................................................................................. 31
DC Characteristics........................................................................................................................................................................ 33
Absolute Maximum Ratings .......................................................................................................................................................... 33
Package Options .......................................................................................................................................................................... 34
Die Pad Ring ................................................................................................................................................................................ 37
RSC-464 Die Bonding Pad Locations........................................................................................................................................... 38
Mechanical Data ........................................................................................................................................................................... 39
Ordering Information..................................................................................................................................................................... 40
The Interactive Speech™ Product Line ........................................................................................................................................ 41
2
P/N 80-0282-A
© 2005 Sensory Inc.
Preliminary Data Sheet
RSC-464
RSC-464 Overview
The RSC-464 is a member of the Interactive Speech™ line of products from Sensory. It features a high-
performance 8-bit microcontroller with on-chip ADC, DAC, preamplifier, RAM, ROM, and optimized audio
processing blocks. The RSC-464 is designed to bring a high degree of integration and versatility into low-cost,
power-sensitive applications. Various functional units have been integrated onto the CPU core in order to reduce
total system cost and increase system reliability.
The RSC-464 operates in tandem with FluentChip™ firmware, an ultra compact suite of recognition and synthesis
technologies. This reduced software footprint enables, for example, products with 60 seconds of compressed
speech, multiple speaker dependent and independent vocabularies, speaker verification, and all application code
built into the RSC-464 as a single chip solution. Revolutionary Text-to-Speaker-Independent (T2SI) technology
allows the creation of SI recognition sets by simply entering text.
The CPU core embedded in the RSC-464 is an 8-bit, variable-length-instruction microcontroller. The instruction set
is similar to the 8051 microcontroller, and has a variety of addressing mode,
MOV
and 16 bit instructions. The RSC-
464 processor avoids the limitations of dedicated A, B, and DPTR registers by having completely symmetrical
sources and destinations for all instructions.
The RSC-464 provides a high level of on-chip features and special DSP engines, providing a very cost effective
mixed signal platform for general-purpose applications and development of custom algorithms. The full suite of
industry standard tools for easy product development makes the RSC-464 an ideal platform for consumer
electronics.
RSC-464 Block Diagram
3
P/N 80-0282-A
© 2005 Sensory Inc.
RSC-464
Preliminary Data Sheet
Speech Technologies
Speech Recognition
The RSC-464 is designed to operate in tandem with the FluentChip™ technology library, including speaker
independent (SI), speaker dependent (SD), and speaker verification (SV) speech recognition. Combinations of
these technologies may used to create applications that are rich in features. These are described below:
Speaker Independent
recognition requires no user training. The RSC-464 can recognize up to 15 commands in
an active set (number of sets is limited only by internal ROM size). Text-to-SI (T2SI), based on a hybrid of
Hidden Markov Modeling and Neural Net technologies, allows creation of accurate SI recognition sets in
seconds. SI requires on-chip ROM.
Speaker Dependent
recognition allows the user to create names for products or customize recognition sets. SD
is implemented with DTW (dynamic time warping) pattern matching technology. SD requires programmable
memory to store the personalized speech templates(trained patterns) that may be on-chip SRAM, or off-chip
serial EEPROM, Flash Memory, or SRAM. Up to 50 templates can be recognized in an active set (the number of
unique sets is limited only by programmable memory capacity). The RSC-464 can store 1 SD templates in on-
chip SRAM.
Speaker Verification
enables the RSC-464 to authenticate when a previously trained password is spoken by the
target user. SV is also implemented with DTW technology. 1 SV template can be stored in on-chip SRAM, or
more with external programmable memory such as delineated in SD above.
Word Spotting
enables the RSC-464 to spot a specific word surrounded by other speech within a phrase. This
can be quite effective when the users response may vary (e.g. spotting “telephone” in the phrases “ummm
telephone”, or “telephone call”). This option is available for SI and SD.
Continuous Listening
allows the chip to continuously listen for a specific word. This may be used as a trigger
word to request a device to listen for a command. This option is available for SI and SD.
Speech and Music Synthesis
The RSC-464 provides high-quality speech compression using Sensory SX™ technology. One may select various
data rates from approximately 2.4 to 10.8Kbps to manage speech quality versus allotted memory. The highest data
rates use 16KHz sample rates to provide high quality reproduction of high pitched voices. Speech and sound
effects may also be compressed using 8-bit PCM (64Kbps) or 4-bit ADPCM (32Kbps) technologies.
The RSC-464 also provides eight-voice, wave table music synthesis which allows multiple, simultaneous
instruments for harmonizing. The RSC-464 uses a MIDI-like system to generate music. One or more of the eight
voices may be speech playback instead of music. One or more of the eight voices may be a drum track comprising
multiple drums. In effect, drum tracks allow the number of simultaneous instruments to exceed 8.
Speech and Music data may be stored in on-chip ROM. Speech data may alternatively be stored in off-chip serial
data ROM or serial data Flash for extended durations.
Easy to use tools allow the developer to record and compress their own voice talents and create with the push of a
button, or to create their own MIDI scores and instruments.
Record and Playback
The RSC-464 can perform speech record and playback (sometimes called “voice memo”) using either 8 bits
(64Kbps) or 4 bits (32Kbps) per sample, depending on the quantity and quality of playback desired. The record and
playback technology also optionally performs silence removal to reduce memory requirements.
External serial Flash or SRAM is required to store the compressed speech.
4
P/N 80-0282-A
© 2005 Sensory Inc.
Preliminary Data Sheet
RSC-464
RSC-464 Architecture
The RSC-464 is a highly integrated speech and analog I/O mixed signal processor that combines:
8-bit microcontroller with enhanced instructions and interrupt control, superior register architecture, independent
Digital Filter engine and “L1” Vector Math Accelerator
On-chip ROM and RAM (2.8 Kbytes).
Input microphone preamp and 16 bit Analog-to-Digital Converter (ADC) for speech and audio/analog input
10 bit Digital-to-Analog Converter (DAC), and 10 bit Pulse Width Modulator (PWM) to directly drive a speaker or
other analog device
Low power Audio Wakeup from power down mode, when a selected audio event, such as clap or whistle, occurs
RSC-464 Internal Block Diagram
Two bi-directional ports provide 16
configurable, general-purpose I/O
pins to communicate with or control
external devices with a variety of
source and sink currents. Up to 4 of
these I/O may be used as
programmable Analog Comparator
inputs. 16 may be used as I/O
wakeup.
The RSC-464 has a high frequency
(14.32 MHz) clock as well as a low
frequency (32,768 Hz) clock. The
processor clock can be selected
from either source, with a selectable
divider value. The device performs
speech recognition when running at
14.32 MHz.
OSC1 is a very low-cost 3.58 MHz
crystal oscillator that is used by a
4X PLL to generate the 14.32MHz
clock. The OSC2 oscillator provides
the options of using an external
crystal or its own internal RC
devices (no external components
required for the internal RC mode).
There are three programmable,
general-purpose 8-bit counters /
timers – Timers 1 and 3 are derived
from OSC1, and Timer2 from
OSC2. There is also a Watchdog
timer that may be used to exit an
undesired condition in program flow,
and Multi-tasking timer to allow chip
operations to share resources in
parallel.
A single chip speech I/O solution may be created with the RSC-464. An external microphone passes an audio
signal to the preamplifier and ADC to convert the incoming speech signal into digital data. Speech features are
extracted using the Digital Filter engine. The microcontroller CPU processes these speech features using speech
recognition algorithms in firmware, with the help of the “L1” Vector Accelerator and enhanced instruction set. The
resulting speech recognition results may be used to control the consumer product application code, or to output
5
P/N 80-0282-A
© 2005 Sensory Inc.