New Low-level Microphone API

Open Source & Free

DEVELOPERS

Getting Started

Docs

Demos

Plugins

Dashboard

RESOURCES

Academy/Training

Tutorials

Videos

Templates

Compare

Search

Blog

COMMUNITY

GitHub

Stack Overflow

Reddit

Forum/Help

PRICING

SUPPORT

Services

FAQ

Contact Us

Menu

Search

Search
Close this search box.

Login/Signup

Login/Signup

New Low-level Microphone API

Home

Blog

New Low-level Microphone API

New Low-level Microphone API

Steve Hannah

January 2, 2020

No Comments

Today’s blog post will delve further into our new media features. We’ve recently added an API to access raw PCM data from the device’s microphone. Previously, the media recording API could only be configured to save audio to a file. This is fine for most use cases, but sometimes it is necessary to access the the raw PCM stream directly. For example for voice recognition, or audio processing, or audio visualization.

How it works

In order to access an audio PCM stream, you need to create an AudioBuffer object, which will be used as a destination for microphone input.

AudioBuffer buffer = MediaManager.getAudioBuffer("mybuffer.pcm", true, 4096);

A couple of points here:

The “mybuffer.pcm” is the virtual path to the buffer. Think of it like a file path that doesn’t correspond to an actual file. This can be any arbitrary string. We will be referencing it later when we construct the media recorder, to redirect its output to this audio buffer.

The 2nd parameter (true) says to “create” the audio buffer object if it doesn’t already exist in the central registry.

The 3rd parameter, is the buffer size. You can put anything here, and the API will adapt. I’m using 4096 here, but that was chosen rather arbitrarily.

Next, you add a callback which will be executed whenever the buffer’s contents are changed. This happens when a new chunk of PCM data is available from the microphone.

final float[] sampleData = new float[buffer.getMaxSize()]; buffer.addCallback(buf->{ buf.copyTo(sampleData); int sampleRate = buf.getSampleRate(); int numChannels = buf.getNumChannels(); int len = buf.getSize(); sendDataToServerForProcessing(sampleData, 0, len, sampleRate, numChannels); });

Some key points here:

The callback does NOT run on the EDT. It runs on its own thread.

The buf.copyTo() method will copy all new data from the buffer into our own float[] array. It will only write values in the range [0, buf.getSize()). Each entry will be a float between -1 and 1.

buf.getSize() may return a different value in each invocation, as the “size” of the buffer reflects the current data in the buffer. Not to be confused with the maxSize of the buffer, which is the original size of the buffer, as it was created in the getAudioBuffer() method.

If you are processing the data in any way, you’ll need to know both the sampleRate, and the number of channels of the input. It is important to collect this data from the audioBuffer inside this callback, and not depend on the settings you provided originally to createMediaRecorder().

We’ll use MediaRecorderBuilder to construct our media recorder now as follows:

MediaRecorderBuilder mrb = new MediaRecorderBuilder() .path("mybuffer.pcm") .redirectToAudioBuffer(true); Media microphone = MediaManager.createMediaRecorder(mrb);

Notice that, for the path() parameter of the builder, we use the same value we used in getAudioBuffer(). This is extremely important, otherwise the media recorder won’t run the callback in your AudioBuffer instance.

We can start recording now using microphone.play(), and pause using microphone.pause(). Or use the new async APIs, playAsync() and pauseAsync() to gain more clarity about the recording state.

Saving PCM Stream to a WAV File

In order to test the AudioBuffer class, we needed to be able to play the PCM stream that we capture to make sure that it is working correctly, and that it hasn’t been corrupted in any way. We added a class, WAVWriter, for writing a PCM stream to a WAV file to facilitate this testing. A WAV file, after all, just contains a raw stream of PCM data, with some headers to declare the data format, so this class is pretty minimal.

The following example records directly from the PCM stream to a WAV file in file system storage.

WAVWriter wavWriter; AudioBuffer audioBuffer; private void record() throws IOException { audioBuffer = MediaManager.getAudioBuffer(bufferPath, true, 4096); final float[] floatSamples = new float[audioBuffer.getMaxSize()]; audioBuffer.addCallback(buf->{ synchronized(clipLock) { if (wavWriter == null) { try { wavWriter = new WAVWriter( new File(fileName), buf.getSampleRate(), buf.getNumChannels(), 16 ); } catch (IOException ex) { Log.e(ex); return; } } buf.copyTo(floatSamples); try { wavWriter.write(floatSamples, 0, buf.getSize()); } catch (IOException ex) { Log.e(ex); } } }); } MediaRecorderBuilder builder = new MediaRecorderBuilder() .audioChannels(1) .path(bufferPath) .redirectToAudioBuffer(true); MediaManager.createMediaRecorder(builder)); synchronized(clipLock) { wavWriter = null; } } // … And when you’re finished recording, just close the WAVWriter // for the file to be written. wavWriter.close();

The key parts of this example are:

We don’t necessarily need to instantiate the WAVWriter object inside the AudioBuffer callback, but we do need some information that the callback provides: the sample rate, and number of channels. This information is supplied in the audiobuffer callback, and won’t change, so you could also just fetch this information in the first callback, and store it for when and where you do instantiate the WAVWriter object.

The WAVWriter.write(float[] samples, int offset, int len) method is where you can pass the PCM samples directly to WAV file.

Remember to call close() on the WAVWriter to ensure that the file is written.

You can find some examples using the AudioBuffer and WAVWriter classes to write PCM streams to a WAV File in the Samples project. Specifically, the AsyncMediaSample and the AudioBufferSample.

Sample Rates and Downsampling

A PCM data stream is a digital approximation of a sound wave form. The sample rate, usually expressed in Hz (hertz) is the number of samples we extract per second. A sample rate of 16000 Hz indicates that we are extracting 16000 floating point values (per channel) per second. The higher the sample rate, the better wave approximation will be, and therefore the better quality the sound will be. But higher sample rates also correspond to larger file sizes.

When you construct a media recorder, you can request a specific sample rate, but there is no guarantee that the underlying platform will comply with your request. Some platforms only support the native sample rate of the audio hardware, so you’re at the mercy of the audio chip to a certain extent. You can find out the actual sample rate by calling audioBuffer.getSampleRate(), any time after the first callback is executed – as this is where the platform informs the audio buffer about the underlying sample rate.

Some common sample rates you’ll see are 16000, 22050, 44100, and 48000. If you are passing the PCM data stream to service that only accepts a certain sample rate, then you may need to downsample the data. The AudioBuffer class includes a downsample() method with a rudimentary algorithm that may be sufficient for some cases. It is lacking some of the features of high-end down-sampling algorithms, such as low pass filtering, and it does noticeably lower the audio quality, but if your application doesn’t need “perfect sound”, then it might be appropriate for your needs. If you do need perfect sound, you should either perform the downsampling server-side, or use a 3rd-party sound library.

The downSample() method works as follows:

audioBuffer.downSample(16000); // downsample to 16000Hz)

You should call this method inside your callback, before copying the data to your float samples buffer. This is because it will modify the data in the audio buffer, and update both the “size” property, and the “sampleRate” property of the buffer, to accurately reflect the new sample rate.

The AudioBufferSample includes an example usage of this method.

Leave a Reply Cancel reply
You must be logged in to post a comment.

SUBSCRIBE TO OUR NEWSLETTER

Email

Name

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

TWITTER

Tweets Liked by @Codename_One

Follow Us on Twitter

Tags
Android App Bundle API Level 30 App Security Async Debugging Build-Hints Editor Code Freeze Codename One 7.0 CodeRAD Control Center CSS Units data processing debugging Demos Hacktoberfest Inspect Component IntelliJ iOS Certificate Jailbreak JavaDocs Java iOS Development Javascript Port Kitchen Sink Learning Codename One Lightrun Local Builds Maven Moving to Xcode 12 New Website Paddle Pricing Change Property Sheet Reddit Support Rooted Simulator Single-Sign-On Spring Boot SSO Themes Top 10 Best Cross Platform App Development Frameworks VSCode

OTHER RESOURCES

STACKOVERFLOW

REDDIT

GITHUB

Quick Start with Codename One initializr

Get Started

Learn all about building
native mobile apps using Java

CN1 Academy

Newsletter

Subscribe to our Newsletter to get important News & Updates:

Email

Name

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Github Stack-overflow Twitter Facebook Linkedin Instagram

Important Links

Search This Site

Documentation

Support Forums

Plugins/cn1libs

Blog

Recommended Sites

About Us

Affiliate Program

Press

Privacy Policy

Terms of Service

Site Map

Reinventing mobile development.

~
0
M

Apps Installed

0
k+

Developers

Codename One LTD © 2022. All Rights Reserved.

The Java® logo and name are trademarks of Oracle corp. Facebook and the Facebook logo are trademarks of Facebook. Uber and the Uber logo are trademarks of Uber Corp.
Terms of Use

[x]
We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept”, you consent to the use of ALL the cookies.
REJECT ACCEPT
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.

Necessary
Necessary
Always Enabled

Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.

Non-necessary
Non-necessary

Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.

SAVE & ACCEPT

Open Source & Free

New Low-level Microphone API

New Low-level Microphone API

How it works

Saving PCM Stream to a WAV File

Sample Rates and Downsampling

Leave a Reply Cancel reply

SUBSCRIBE TO OUR NEWSLETTER

TWITTER

Tags

OTHER RESOURCES

Quick Start with Codename One initializr

Learn all about building native mobile apps using Java

Newsletter

Important Links

Reinventing mobile development.

Learn all about building
native mobile apps using Java