<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom"><title>UH Biocomputation Group - cepstral coefficients</title><link href="http://biocomputation.herts.ac.uk/" rel="alternate"/><link href="http://biocomputation.herts.ac.uk/feeds/tags/cepstral-coefficients.atom.xml" rel="self"/><id>http://biocomputation.herts.ac.uk/</id><updated>2020-12-16T12:28:04+00:00</updated><entry><title>Speech Emotion recognition using deep neural networks using Mel Frequency Cepstral coefficients</title><link href="http://biocomputation.herts.ac.uk/2020/12/16/speech-emotion-recognition-using-deep-neural-networks-using-mel-frequency-cepstral-coefficients.html" rel="alternate"/><published>2020-12-16T12:28:04+00:00</published><updated>2020-12-16T12:28:04+00:00</updated><author><name>Shreyah Iyer</name></author><id>tag:biocomputation.herts.ac.uk,2020-12-16:/2020/12/16/speech-emotion-recognition-using-deep-neural-networks-using-mel-frequency-cepstral-coefficients.html</id><summary type="html">&lt;p class="first last"&gt;Shreyah Iyer's Journal Club session where she will talk about a project&amp;quot;Speech Emotion recognition using deep neural networks using Mel Frequency Cepstral coefficients&amp;quot;.&lt;/p&gt;
</summary><content type="html">&lt;p&gt;This week on Journal Club session Shreyah Iyer will talk about her project &amp;quot;Speech Emotion recognition using deep neural networks using Mel Frequency Cepstral coefficients&amp;quot;.&lt;/p&gt;
&lt;hr class="docutils" /&gt;
&lt;p&gt;Speech is a very important context in understanding human emotions for example
in psychology and criminology as the effects of emotions in voice can be
recognized by all people irrespective of the language of speech. In this
presentation I will talk about an ongoing KTP Project on Speech Emotion
Recognition system.&lt;/p&gt;
&lt;p&gt;The aim of the project is to build a system which can interpret the underlying
emotion from an audio/speech signal. So far, I have worked on using Deep
Learning architectures, i.e CNN’s with most widely used features for emotion
detection such as MFCC’s and Mel-spectrograms. I have especially investigated
what the best way is to use the coefficients extracted from MFCC’s. I have
worked on are two publicly available Speech Emotion Corpus i.e., TESS and
RAVDESS. Results conducted on these experiments show that the MFCC’s features
with an optimal stack length supersedes the other CNN architectures used.&lt;/p&gt;
&lt;p&gt;In this presentation I will also talk about the challenges and future work for
this project.&lt;/p&gt;
&lt;div class="line-block"&gt;
&lt;div class="line"&gt;&lt;br /&gt;&lt;/div&gt;
&lt;/div&gt;
&lt;p&gt;&lt;strong&gt;Date:&lt;/strong&gt; 16/12/2020 &lt;br /&gt;
&lt;strong&gt;Time:&lt;/strong&gt; 16:00 &lt;br /&gt;
&lt;strong&gt;Location&lt;/strong&gt;: online&lt;/p&gt;
</content><category term="Seminars"/><category term="Speech recognition"/><category term="deep learning"/><category term="cepstral coefficients"/></entry></feed>