KimJinkee1
JangYunyoung1
KimJong Pal1
-
(Advanced Research Center for Mechatronics Engineering, School of Mechatronics Engineering,
Korea University of Technology and Education, Cheonan 31253, Korea)
Copyright © The Institute of Electronics and Information Engineers(IEIE)
Index Terms
Microphone, low power, sound activity monitor, wake-up time
I. INTRODUCTION
MEMS technology has replaced a microphone with a MEMS type from a conventional electret
condenser (ECM) thanks to the advantages of mass production and miniaturization using
a semiconductor process [1]. Subsequently, MEMS microphones are emerging as application examples that are combined
with artificial intelligence [2]. Recent great advances in artificial intelligence have brought voice recognition
performance to human levels [3].
Intelligent MEMS microphones are turned on 24 hours to recognize user’s voice commands,
so power consumption becomes one of key competition factors. A readout integrated
circuit (ROIC) can achieve low power consumption by averaging power consumption during
long low power mode periods and short normal power mode periods. Most of the time
when there is no voice command, the ROIC operates in a low power mode. If an external
sound is generated while the ROIC is in low power mode, the ROIC immediately switches
to normal power mode. To enable this operating scenario, a SAM that detects external
sound in low power mode and controls switching between power modes is required. Therefore,
SAMs are required to have extremely low power. Fig. 1 shows an operating scenario for a microphone utilizing SAM. The power mode of a source
follower (SF) is controlled according to the output signal (WUP) of the SAM. When
there is no sound input, the SAM generates a LOW state in the WUP signal and the SF
stays in low power mode. When an external sound input occurs, the SAM outputs a HIGH
state signal (WUP) and the SF switches to normal power mode.
It is recommended that the wake-up time from low power mode to normal power mode should
be as short as possible to fully detect the initiation of a user command. Therefore,
SAM should quickly detect the voice signal and switch the SF’s power mode. When switching
the power mode, it affects the shift of the operating point of the circuit, so the
power mode may not switch to the desired mode and return to the previous mode. In
the next chapters, a novel SAM circuit is presented to improve issues such as low
power, fast wake-up and power mode switching robustness.
Fig. 1. Operation scenario of microphone with SAM: (a) SF operates in low power mode when no external sound exists; (b) SF operates in normal power mode when external sound exists.
II. ARCHITECTURE AND CIRCUITS
Fig. 2 shows the microphone ROIC consisting of a source follower (SF) and a low-power, fast
wake-up SAM. SF works as an analog buffer whose power mode is controlled by the WUP
signal. When WUP is LOW, SF stands by in low power mode, and operates in normal power
mode when WUP is HIGH. SAM consists of open amplifier (OA), 2$^{\mathrm{nd}}$amplifier
(AMP2), comparator (CMP), and decision logic circuit (LOGIC). OA and AMP2 amplify
the output signal of SF, CMP converts the presence or absence of an AC signal corresponding
to sound input into a digital signal, and LOGIC outputs the control signal WUP. In
Fig. 2, there are two SFs and they output differential signals (SF$_{OP}$/SF$_{ON}$). If
the MEMS transducer is a differential type, all SF inputs (SF$_{IP}$/SF$_{IN}$) are
connected to the MEMS transducer, and if the MEMS transducer is a single type, only
the SF input SF$_{IP}$ is connected to the MEMS transducer and the SF input SF$_{IN}$
is connected to a fixed bias.
Fig. 3 shows the detailed circuit diagram of the SF and how it is connected to the MEMS
transducer. MEMS transducers can be modeled with a capacitance C$_{\mathrm{mems}}$
that fluctuates with sound pressure and a fixed parasitic capacitor C$_{\mathrm{p}}$.
When a sound pressure is generated from the outside, an ac voltage signal corresponding
to V$_{m}$ (= V$_{dr}$ ${\times}$ C$_{\mathrm{mems}}$/(C$_{\mathrm{mems}}$ + C$_{\mathrm{p}}$))
is generated. SF analog buffers this V$_{m}$ voltage.
The input signal SF$_{IP}$ is buffered to SF$_{OP}$ by an MN30 operating as source
follower. The SF$_{IP}$ is connected to V$_{bsf}$ using a PMOS pseudo resistor R$_{bsf}$,
to make same the DC voltage as V$_{bsf}$. In Fig. 3, V$_{bsf}$ is 0.9 V, and R$_{bsf}$ has a value of 146.7 G${\Omega}$ from simulation.
When the WUP signal is LOW, only 100 nA of current (I$_{\mathrm{LP}}$) flows in the
source follower MN30 and the SF operates in low power mode. When the WUP signal is
HIGH, 7.6 ${\mu}$A of current (I$_{\mathrm{LP}}$ + I$_{\mathrm{NP}}$) flows in the
source follower MN30 and the SF operates in normal power mode.
The SAM for intelligence microphone should have low power consumption. For low power
and fast wake-up characteristics, the envelope detector used in the conventional SAM
structure in the previous work is removed [4].
Fig. 4(a) shows the detail circuit diagram of OA. The OA consists of a high pass filter (HPF),
an inverter-based amplifier and common mode feedback. The amplification gain of OA
is based on the open-loop type rather than the closed-loop type because the application
does not require precise gain. In the case of an open loop type amplifier, compensation
is not required to secure stability, so it has advantages in terms of power consumption
and area [5,6].
Fig. 4(b) shows the circuit diagram of AMP2. The AMP2 have charge amplifier structure to remove
the DC offset and common-mode signal from OA outputs [7,8]. The resistors used in the HPF of OA and the feedback loop of AMP2 are adjustable
pseudo resistors using the PMOS referenced in [9].
The CMP is designed with positive feedback structure. The CMP makes digital 1 bit
signal by comparing the differential output of AMP2. A detailed description of the
analog circuits can be found in the author’s previous paper on the design of the SAM
[10].
Fig. 5 shows the circuit diagram of LOGIC. SAM should make the WUP signal rising edge when
the voice signal occurs to switch the SF to normal power mode. When a rising edge
occurs in CMP due to a voice signal, the WUP signal immediately becomes HIGH through
D flip-flops (DFF#1, DFF#2).
When no more voice signal is input to the SAM, the WUP signal should go from HIGH
to LOW. The WUP signal goes LOW when the low active reset signal (RST_FALLING) resets
DFF#2. Low active reset signals are generated in the auto pulse block (AP) when a
rising edge occurs on the input. If there is no sound from the outside for a certain
period of time, the CALM signal changes from LOW to HIGH, and a reset signal is generated
on AP#2 output. When there is a signal change in input signal CMP$_{out}$, DIV is
continuously reset by AP#1 and output CALM maintains LOW state. If there is no more
signal change in the input signal CMP$_{out}$, DIV is no longer reset and the output
value is converted from LOW to HIGH after and certain period of time (T$_{wait}$).
The T$_{wait}$ time can be adjusted by register setting.
At the moment WUP transitions from HIGH to LOW, the current flowing through SF decreases,
causing the output DC value to step up. This DC step in SF acts as an artifact, leading
to false positive detection of sound. This artifact changes the power mode back to
the normal power mode, and thus falls into an infinite circulation loop in the normal
power mode. Therefore, after WUP changes from HIGH to LOW, DFF#1 responds to the input
signal CMP$_{out}$ after half cycle of CLK. DFF#1 does not transfer CMP$_{out}$ change
to output until it is reset by EN_INPUT signal. The EN_INPUT signal is generated by
combining the CALM signal and the CLK signal.
Fig. 2. Block diagram of ROIC.
Fig. 3. Detailed circuit diagram of SF and connection to MEMS.
Fig. 4. Detailed circuit diagram of (a) OA; (b) AMP2.
Fig. 5. Detailed circuit diagram of LOGIC.
III. MEASUREMENT RESULTS
Fig. 6 show a picture of a chip fabricated based on the CMOS 0.18 ${\mu}$m process. Regions
corresponding to blocks in the schematic are indicated by rectangular boxes.
Fig. 7(a) shows the frequency response of the SF, OA, and AMP2. The SF has a bandwidth of 5.2
kHz in low power mode and 35.3 kHz in normal power mode. The OA block has a high-pass
cutoff frequency of 2.1 Hz, a low-pass cutoff frequency of 1.7 kHz and a peak gain
of 44 dB (158.5 V/V) at 100 Hz. The AMP2 block has a high-pass cutoff frequency of
1.5 Hz, a low-pass cutoff frequency of 3.1 kHz, with a peak gain of 19.5 dB (9.4 V/V).
Therefore, the overall gain of the analog amplifiers (OA + AMP2) is 63.5 dB. The output
signal of AMP2 may be saturated when a large input signal is supplied to the SAM.
However, since SAM detects only the presence of sound, it does not matter even if
the signal is saturated. Since the audio signal is an ac type signal, the saturated
signal will be in the form of a digital clock, and in this case, it will be a more
favorable condition for detecting the presence or absence of sound in the comparator
and logic.
Fig. 7(b) shows the noise characteristics of the combined blocks of SF, OA and AMP2. Input
referred noise at 1 kHz has 351.4 nV$_{\mathrm{rms}}$/${\sqrt{}}$Hz in low power mode
and 277.8 nV$_{\mathrm{rms}}$/${\sqrt{}}$Hz in normal power mode.
Fig. 8 shows the signals of successive blocks (SF, OA, AMP2, and CMP) and the WUP signal
of LOGIC block. When EN_SOURCE is HIGH, a sinusoidal signal with a magnitude of 150
${\mu}$V$_{\mathrm{pk}}$ and a frequency of 1 kHz was fed to the SF input. While the
sinusoidal wave input is applied from time t$_{1}$ to time t$_{2}$, the successively
amplified differential output signal OA$_{OPN}$ of OA and the differential output
signal AMP2$_{OPN}$ of AMP2 can be seen. It can also be seen that LOW and HIGH signals
appear at the comparator output CMP$_{OUT}$ in response to the amplified analog signal.
As soon as the input signal appears, it can be verified that the WUP signal changes
from LOW to HIGH at time t$_{1}$. As the WUP signal goes HIGH, the SF operates in
normal power mode, and it can be seen that the operation DC level of SF$_{OP}$ changes
at t$_{1}$. This is caused by the increased current flowing in the SF block. The sinusoidal
input is cut off at t$_{2}$ and the WUP signal goes LOW at t$_{3}$ after a T$_{wait}$
period. Since command input may be interrupted for a while depending on syllables
and words, it is necessary to wait for T$_{wait}$ without changing the power mode
even if there is no sound for a while.
As WUP changes from HIGH to LOW at time t$_{3}$, it can be seen that the output value
SF$_{OP}$ of SF steps up. This is because the flowing current of SF is reduced from
7.6 ${\mu}$A to 100 nA as the SF switches from normal power mode to low power mode.
This SF output step signal change is amplified and transmitted to the input signal
of LOGIC, causing a false recognition reaction as if external sound was re-input.
In order to prevent the WUP signal from malfunctioning from these artifacts, the LOGIC
input was enabled after half a clock cycle after the WUP signal falling edge. Since
the anti-artifact function is implemented, we can see that WUP signal does not bounce
back to HIGH and WUP stays LOW at t$_{3}$.
Fig. 9 shows the wake-up time of the ROIC. When a signal is input to the SF, the wake-up
time T$_{WUP}$ is the sum of the time interval T$_{sense}$ and the time interval T$_{set}$.
A 1 kHz, 3 mV$_{\mathrm{pk}}$ sine waveform is input to the SF to measure the wake-up
time. The time interval T$_{sense}$ is the time consumed for WUP to become HIGH after
the signal is input. The time interval T$_{sense}$ was measured as 61.3 ${\mu}$s.
The time interval T$_{set}$ is the time it takes for the SF to switch power modes
and output signals to reach 90% of the input signal. The time interval T$_{set}$ was
measured as 35 ${\mu}$s.
In Table 1, the performance of the proposed SAM is summarized and compared with previous work.
The proposed SAM aims at fast wake-up and achieved a wake-up time of 96.3 ${\mu}$s,
which is much faster than the existing wake-up time of 760 ${\mu}$s to 4 ms. In reference
[4], Yang uses two envelope detectors for the anti-artifact function, so the wake-up
time is as slow as 4 ms and the power consumption is 8.3 uW. On the other hand, in
this paper, the wake-up time was reduced to 96.3us and power consumption to 2.5 uW
because the anti-artifact function was handled by LOGIC without an envelope detector.
In addition, an active mode hold function has been added to suppress frequent power
mode changes by considering short breaks between syllables as command end.
Fig. 6. Fabricated chip photograph.
Fig. 7. Measured analog characteristics: (a) Frequency response; (b) Input referred noise of SF, OA, and AMP2.
Fig. 8. Measured key signals with and without input signal.
Fig. 9. Measured wakeup time.
Table 1. Performance summary and comparison with previous work
Parameter
|
This work
|
TCAS-II’20 [4]
|
ISPLED’19
[11]
|
Fabrication technology
|
0.18 μm
|
0.18 μm
|
0.18 μm
|
Supply voltage
|
1.8 V
|
3.3 V
|
3.3 V
|
SAM power (current) consumption
|
2.5 μW
(1.4 μA)
|
8.3 μW
(2.5 μA)
|
46.2 μW
(14 μA)
|
Wake-up time
|
96.3 μs
|
4 ms
|
760 μs
|
Chip Area
|
2.5 mm2
|
3.7 mm2
|
3.1 mm2
|
Anti-artifact
|
YES
|
YES
|
NO
|
Active mode holding
|
YES
|
NO
|
NO
|
IV. CONCLUSION
A low power, fast wake-up SAM with a novel structure is proposed. SAM reduces the
average temporal power consumption of ROIC by controlling the power of the main channel
according to the presence or absence of sound. Low power consumption was realized
by removing the envelope detector in the previous structure, and artifacts were overcome
by using comparator's hysteresis characteristics and LOGIC. The ROIC using the proposed
SAM was fabricated with a standard CMOS 0.18 ${\mu}$m process and verified through
functional and performance measurements. The total gain and noise of the analog amplifier
(OA + AMP2) measured 63.5 dB and 351.4 nV$_{\mathrm{rms}}$/${\sqrt{}}$Hz respectively
at 1 kHz in low power mode. The power consumption and wake-up time of the fabricated
SAM were measured as 2.5 ${\mu}$W and 96.3 ${\mu}$s, respectively. In addition, it
was confirmed that the proposed immune function works well in response to power mode
change artifact.
ACKNOWLEDGMENTS
This paper was supported by Education and Research promotion program of KOREATECH
in 2021. This paper was supported by "Leaders in INdustry-university Cooperation 3.0
(1345356194)" project grant funded by the Ministry of Education and the National Research
Foundation of Korea (LINC3.0-2022-31). The EDA tool was supported by the IC Design
Education Center (IDEC), Korea.
This paper was supported by Korea Institute for Advancement of Technology (KIAT)
grant funded by the Korea Government (MOTIE) (P0008458, The Competency Development
Program for Industry Specialist, 2023).
References
Alice, M., “Audio Market Expected to Be Worth $20bn in 2022,” electronic specifier,
Accessed May 13, 2023. https://www.electronicspecifier.com/ products/micros/audio-market-expected-to-be-worth-20bn-in-2022.
A, Pele., “Voice Assistants Are Benefiting the MEMS Industry,” EE Times EUROPE, Accessed
May 14, 2023. https://www.eetimes.eu/voice-assistants-are-benefiting-the-mems-industry/.
Meeker, M., “Internet Trends 2018,” KLEINER PERKINS. Accessed May 13, 2023. https://www.kleinerperkins.com/perspectives/internet-trends-report-2018/.
Yang, Y., Lee, B., Cho, J., Kim, S. and Lee, H., “A Digital Capacitive MEMS Microphone
for Speech Recognition With Fast Wake-Up Feature Using a Sound Activity Detector,”
IEEE Transactions on Circuits and Systems II 67, no. 9, 2020, 1509-1513.
Figueiredo, M., Santos-Tavares, R., Santin, E., Ferreira J., Evans, G., and Goes,
J., “A Two-Stage Fully Differential Inverter-Based Self-Biased CMOS Amplifier With
High Efficiency,” IEEE Transactions on Circuits and Systems I 58, no. 7, 2011, 1591-1603.
Sherif, S., “Design of the CMOS Inverter-Based Amplifier: A Quantitative Approach,”
International Journal of Circuit Theory and Applications 47, no. 7, 2019, 1006-1036.
Reid, R. H., Cameron C., “A Low-Power Low-Noise CMOS Amplifier for Neural Recording
Applications,” IEEE Journal of Solid-State Circuits 38, no. 6, 2003, 958-965.
Ye, S. and Xiong, B. Y., “Capacitive Biopotential Measurement for Electrophysiological
Signal Acquisition: A Review,” IEEE Sensors Journal 16, no. 9, 2016, 2832-2853.
Kim, J., “Temperature-insensitive Pseudo-resistor,” Journal of Semiconductor Technology
and Science 22, no. 1, 2022, 47-51.
Kim, J., “Sound Activity Monitor Circuit for Low Power Consumption of Always-On Microphone
Applications,” Applied Sciences 12, no. 23, 2022, 11947.
Yang, Y., Cho, J., Lee, B. and Kim, S., “A Sound Activity Detector Embedded Low-Power
MEMS Microphone Readout Interface for Speech Recognition,” 2019 IEEE/ACM International
Symposium on Low Power Electronics and Design (ISLPED), Lausanne, Switzerland, 2019,
pp. 1-6, doi: 10.1109/ISLPED.2019.8824939.
Jinkee Kim received his B.S. degree in Mechatronics Engineering from Korea University
of Technology and Education, Cheonan, Korea, in 2021, where he is currently pursuing
M.S. degree. His research interests include low power bio-applicable circuit.
Yunyoung Jang received his B.S. degree in Mechatronics Engineering from Korea
University of Technology and Education, Cheonan, Korea, in 2021, where he is currently
pursuing M.S. degree. His research interests include low power bio-applicable circuit.
Jong Pal Kim received his B.S. degree in mechanical design from the Department
of Mechanical Design, Chung-Ang University, Seoul, Korea, M.S. degree in mechanical
engineering from KAIST, Daejon, Korea, and Ph.D. degrees in electrical engineering
and computer science from Seoul National University, Seoul, Korea, in 1995, 1997,
and 2003, respectively. He was a member of research staff at Samsung Advanced Institute
of Technology (SAIT) from 2001 to 2019. In 2020, he joined the Faculty of School of
Mechatronics Engineering, Korea University of Technology and Education, Cheonan, Korea.
His research interests include low power and low noise analog integrated circuits
for biomedical and MEMS applications.