The Internet belongs to everyone. Let’s keep it that way.

Protect Net Neutrality
Loading presentation...

Present Remotely

Send the link below via email or IM

Copy

Present to your audience

Start remote presentation

  • Invited audience members will follow you as you navigate and present
  • People invited to a presentation do not need a Prezi account
  • This link expires 10 minutes after you close the presentation
  • A maximum of 30 users can follow your presentation
  • Learn more about this feature in our knowledge base article

Do you really want to delete this prezi?

Neither you, nor the coeditors you shared it with will be able to recover it again.

DeleteCancel

APPLICATION OF SPEECH PROCESSING TO SWIFTLET SOUND

No description
by

Watashino Inochi

on 20 April 2015

Comments (0)

Please log in to add your comment.

Report abuse

Transcript of APPLICATION OF SPEECH PROCESSING TO SWIFTLET SOUND

RESEARCH OBJECTIVE
APPLICATION OF
SPEECH PROCESSING
TO SWIFTLET SOUND
PROBLEM STATEMENT


The framework analysis in speech processing applied toward swiftlet sounds
do not explored.

Swiftlet sound attraction recording evaluated by
human expert using try and error method
without specific analysis or synthesis about characteristic of the sound be causes swiftlet attract that sound.

At the end of the research,
we are achieved the objective our research which are:

implemented speech processing technique which are
MFCC and HMM
toward swiftlet sound.

analyzed three types of sound (
baby sound, adult sound and colony
) features use for swiftlet attraction in swiftlet husbandary premise.

applied the combination of feature extraction and classification and get classified percentage reached
88.7%
accuracy
and 11.3%
error
.
research background
Swiftlet industry becomes one of
economy income
in Malaysia based on potential and high demand on the bird nest for health care (Azman, 2011).
By
SITI NURZALIKHA ZAINI BT HUSNI ZAINI
MEL12004
Supervisor
:

Dr. Sunardi
(FKEE
)
Co-Supervisor:
PM Dr. Kamarul Hawari
(FKEE)
Co-Supervisor:
PM Dr. Saiful Nizam
(FIST)
Previously, sound that produced at swiftlets husbandry premise actually is produced from
recording audio sound swiftlet voice without analysis
.

Within more this a decade, entrepreneurs
explored various methods and new technology
to increase production. Therefore, the research and development about sound of swiftlets attraction needed to technology develop swiftlets industry.


To
implement
speech processing techniques toward swiftlet sounds.

To
analysis
the three different types of swiftlet sounds use in swiftlet house.
To
apply
combination of feature extraction and recognition technique, due to classify the swiflet sounds.
LITERATURE REVIEW
speech processing
Data sound for baby swiftlet, adult swiftlet and colony swiftlet we buy from
swiftlet farming industry, TCL Resources Sdn Bhd
where supply various swiftlet sound through Faculty of Science and Technology (FIST) Universiti Malaysia Pahang.
1. sample of sound
3. feature extraction
4. classificion
conclusion
RESEARCH
METHODOLOGY

Sample of Sound
Pre-processing
Feature Extraction
Classification
Decision
At frame 1,
baby swiftlet
sound get
smaller

features vector values about -36.94
adult swiftlet
sound get
high
features vector values -16.90
colony swiftlet
sound with -30.1 features vector values.

At frame 2
baby swiftlet
sound get
higher

features vector value reached -4.72

adult swiftlet
sound get
smaller


features vector value -1.45
colony swiftlet
sound with 0.27 features vector value.

Graph pattern
at
frame 2, frame 3, and frame 4
obviously shown the higher value start from baby swiftlet sound follwed by colony sound and the smaller value from adult sound.
at
frame 5 until frame 20
the feature vector values like nearest values for these three types of sound.

We can conclude that
same pattern at frame 2, frame 3 and frame 4 for three types of swiftlet sound because have increasing value.
But inverse at frame 1.
at frame 5 until 20 difficult to differentiate because the value only 0.10 feature vector values between this three sounds.
Features Analysis
Take a sample of sound for 3 types of sound
Baby swiftlet sound
Adult swiftlet sound
Colony swiftlet sound
convert all sounds from .mp3 to .wav format, cut the sound into two seconds and filtering the environment sound
Extract features using Mel Frequency Cepstral Coefficient (MFCC)
Classify using Hidden Markov Model (HMM)
Matching the type of swiftlet sound
scope of research
There are three type of sound selected in this project for recognition at the end of research such as:

Baby Sound
Produce by swiftlet baby using in internal house for make young bird comfortable in that house.


Adult Sound
Produce by adult swiftlet mate to produce eggs in internal house.


Colony Sound
Produce by a group of swiftlet using in puller house for call swiftlet fly in sky near and come to build their nest in swiftlet house.
Result & discussion
MFCCs of Baby Sound
MFCCs of Adult Sound
MFCCs of Colony Sound
This research get
88.7%
accuracy and
11.3%
error.
Compare with (Clemins, 2003)
get
83.8%
accuracy and
16.2%
error (using MFCC and HMM) application on elephant sound.
That shown the MFCC and HMM reasonable for animals sound application not only apply in human speech.
From Robert et al. (2012) also
using same technique
(MFCC and HMM) for detection of bird species get best accuracy
92.1%
and error only
7.9%.
Althought Robert et. al get higher than my research but the 88.7% still in acceptable in best accuracy range.
Feature Extraction

Desai et al. (2013) research from their experiment,
MFCC technique is superior to other techniques
when compared the result.

In animal speech application, Zeppelzauer (2005) stated that
MFCCs are well suited to discriminate the classes of animal sound.
That proven the use of the MFCC has the remarkable result in the field of speech recognition regard previous research. Therefore, MFCC was choosing in this research.

Klautau A. (2005) identified the
main step for MFCC
are pre-emphasis, framing, windowing, Discrete Fourier Transform (DFT), Mel-Filter Bank, Logarithm and Discrete Cosine Transform (DCT).

MFCC feature has been proposed
best popular extraction of speech
by Gaikwad et al. (2010) in their journal about speech recognition technique

Classification

There are two main stages in a speech recognition system, which are
training and testing stages.
Statistical representations of the temporal structure as well as spectral variation using Hidden Markov Models (HMMs) are the main technologies that have
contributed to the improvement of the recognition performances
(Podder, 1997).

Rabiner (1989) stated two strong reasons for the importance of HMMs. Firstly; HMM models are very rich in mathematical structure and hence can form the theoretical basis for
use in a wide range of applications.
Secondly, HMM models
work very well in practice
for several important applications provided that they are applied properly.
The main producers of edible nest in Asian country are
White-nest
Swiftlets
(Aerodramus fuciphagus)

Black-nest
Swiftlets
(Aerodemus maximus)
Baby sound
Adult Sound
Colony Sound
swiftlet
Birds produce sounds for
various reasons
, with the majority falling in the categories of songs and calls.
Birds song to attract
mates or define territory
Fagerlund
(2004)
Shaw
(2011)
Jabatan Veterinar
Malaysia (2012)
The animals generate sounds to
communicate with members of same species
Lee et al.
(2006)
There are environmental factors such as
temperature, light intensity, humidity and sound is the key of successful place for swiftlets
because the swiftlet comfortable with environment like their
original habitat in caves.
The
swiftlets’s voice proven very effective attracts
swiftlets to be nested in bird house for swiftlets farming
the most interesting feature of swiftlets is
utilize a sonar-like system
to attract swiftlet.
Roger studied that swiftlets hearing responses to the frequency
1 - 16 kHz
and which most energy on
2-5kHz
proven by James et al. (1993) in their paper.
The frequency also falls into human hearing (20 – 20 kHz).
Original habitat of swiftlet at the cave with
nature environment
make human made swiftlet farming look like original habitat for swiftlet build their nest for industry.
Entrepreneurs make swiftlet house by consider environmental factors such as
aroma, light, temperature, humidity and sound
(Roger et al. 1987).
But, the swiftlets character is
sensitive toward sound
. Sound is the
main factors
swiftlet
come enter
to the swiftlets house (Mulia, 2007) .
Birds and humans produce sound of a
complex acoustic signal nature.
So
speech processing can be used for human and birds
Allinson and Patrica (1999)
Roger et. al
(1987)
Henri (2005)
Full transcript