Upgrade & Secure Your Future with DevOps, SRE, DevSecOps, MLOps!

We spend hours on Instagram and YouTube and waste money on coffee and fast food, but won’t spend 30 minutes a day learning skills to boost our careers.
Master in DevOps, SRE, DevSecOps & MLOps!

Learn from Guru Rajesh Kumar and double your salary in just one year.

Get Started Now!

Google AI Team Explains How Its Audio Recorder App Leverages On-Device Machine Learning

Source: digitalinformationworld.com

At the beginning of this month, the Recorder app of Pixel 4 was made available for older Google phones as well. The company has now explained the machine learning behind the on-device transcription tool.

A post on Google AI blog describes the rationale for creating the Recorder app. Speech is the most effective way of communication but there not sufficient ways for capturing and organizing it. The company wants to make ideas and conversations easy to search and be accessible.

According to Google, in the last two decades, they have made the search easier to be in the form of text, visual content, maps, videos or even jobs. Still, most of the important information is recorded and shared in the form of speech, like conversations, lectures, interviews and more. Though it is often difficult to extract the required information from the hours of recording.

The Recorder app has three parts. An automatic speech recognition model, which was first introduced in Gborad in March this year, is powered by transcription that is built on the all-neural on-device system. A “Faster voice typing” is included in the Android keyboard that can work offline after downloading and transcribes character by character.

Hours-long sessions can be recorded on the Recorder and the word mapping to timestamps has been computed by the speech recognition model. Through this, users can click on the word of their choice from the transcript and listen from where they want to.

Though the text is a convenient form to present information but visual and sounds at times are more useful. Every bar of a waveform is of 50 milliseconds and is colored with the dominant sound in that period.

Audio is presented in a colored waveform in which each color identifies a different sound category. Convolutional Neural Networks (CNNs) are used to differentiate the sounds by comparing it with the already published datasets and classify each audio frame.

Also, Google allows three tags every time a recording ends that can be used to form a title for the video instead of using day and time. It helps the Recorder recognize the kind of content at the time of transcribing it.

Related Posts

Google fires second AI ethics leader

Source – https://www.itnews.com.au/ As dispute over research, diversity grows. Google fired staff scientist Margaret Mitchell on Saturday, they both said, a move that fanned company divisions on Read More

Read More

Total and Google to launch AI tool Solar Mapper in Europe

Source: solarpowerportal.co.uk O&G giant Total and Google Cloud are launching a new artificial intelligence (AI) tool to help accelerate the deployment of residential solar panels. Together they Read More

Read More

Unlock a new career in Google Cloud with this mastery bundle

Source: androidguys.com You may not realize this, but you interact with AI technology on a consistent, if not daily basis. And if you do recognize it, chances Read More

Read More

Cloud computing is betting on outer space

Source: livemint.com Microsoft CEO Satya Nadella announced the preview of Azure Orbital at Microsoft Ignite 2020 in New Orleans. According to Microsoft, Orbital is ‘Ground Station as Read More

Read More

Google Cloud And Anaplan Innovate To Transform Enterprise Planning

Source: aithority.com Google Cloud and Anaplan, Inc. announced a strategic partnership to offer Anaplan’s platform for enterprise planning and business performance on Google Cloud. As Anaplan’s first public cloud Read More

Read More

HOW DEEPMIND ALGORITHMS HELPED IMPROVE THE ACCURACY OF GOOGLE MAPS?

Source: analyticsinsight.net DeepMind is one of the companies that are leading the AI charge and coming up with innovative uses of AI. This London-based AI lab has been Read More

Read More
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
0
Would love your thoughts, please comment.x
()
x