My Profile

My photo
He graduated from the Dept. of Mathematics of the Aristotle University of Thessaloniki in 2001. He continued his studies at the School of Medicine of the same University until 2003, where he obtained the M.Sc. in Medical Informatics. In 2008, he obtained the Ph.D. in Informatics entitled as "Digital Processing Techniques in Speech Emotion Recognition" at the Computer Science faculty of the same University. He has been awarded the ERCIM fellowship for 2009-2011. In 2009, he was with VTT Technical Research Center of Finland working on Alzheimer's disease and Neuraly Adjusted Ventilation Assist (NAVA). In 2010-2011, he was with IAIS Fraunhofer Institute in Bonn working on Speech Analysis. From 2012 until now he is a researcher and software developer in Centre for Research and Technology Hellas (CERTH). In the 15 years of his professional career, he has experience in signal processing and statistical pattern recognition with Python and Matlab, Android development, Javascript-PHP development for WordPress, Joomla, Three.js frameworks, Augmented Reality with Layar-Wikitude frameworks, Virtual Reality with Unity3D, dance recognition with Kinect, and gesture recognition with Myo.

Friday, January 1, 2010

Speaker Segmentation Demo for Matlab

The target of the software is to divide speech into 3 classes: Silence, Male, Female. In Stage 1, speech is classified into voiced or unvoiced frames by applying Gabor filtering and energy tracking by a method of G. Evangelopoulos. In Stage 2, it is assumed that if two speakers exist, then they would have significant different fundamental frequency and energy below 150 Hz regions, i.e. one actor would tend to be bass and the other will tend to be soprano, these differences are tracked again with the GMM algorithm. This method can be found in [5] at Journal Publications Section.
Video: dimitriosververidis.blospot.com
video