Aplicação do método de fusão para verificação de locutor independente de texto

Export this record:

Please use this identifier to cite or link to this item: https://tede2.pucrs.br/tede2/handle/tede/6452

Full metadata record

DC Field	Value	Language
dc.creator	Silva, Mayara Ferreira da	-
dc.creator.Lattes	http://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4355201T7	por
dc.contributor.advisor1	Castro, Maria Cristina Felippetto de	-
dc.contributor.advisor1Lattes	http://buscatextual.cnpq.br/buscatextual/visualizacv.do?id=K4763071T9	por
dc.date.accessioned	2016-01-04T17:56:48Z	-
dc.date.issued	2015-07-10	-
dc.identifier.uri	http://tede2.pucrs.br/tede2/handle/tede/6452	-
dc.description.resumo	Este trabalho apresenta uma visão geral acerca de verificação de locutor independente de texto, demonstrando o funcionamento básico do sistema e as principais referências de métodos já utilizados ao longo de anos para extração de características da fala e modelamento do locutor. Detectado um ponto a ser trabalhado dentro da etapa de extração de características, objetiva-se determinar coeficientes ou um conjunto destes relevantes para discriminação do locutor, com o intuito de minimizar a EER (Equal Error Rate). A proposta consiste em substituir os coeficientes delta(Δ) e double-delta(Δ2) por coeficientes de um preditor LPC (Linear Predictor Coding) o qual realiza a predição dos coeficientes MFCC (Mel Frequency Cepstral Coeficients). Além disso, aplica-se uma fusão a nível de score em função de sistemas baseados em MFCC e LPC. Outra análise discutida no trabalho é a fusão de um sistema MFCC com Δ e Δ². Um tópico também avaliado é com relação a variações de SNRs (Signal to Noise Ratios) nos áudios testados. Além disso, é elaborado um banco de falas em português brasileiro. Por fim, são expostos os resultados obtidos e é feita a análise dos mesmos, a fim de refletir sobre o que era esperado e levantar alguns comentários. Enfim, são feitas as considerações a respeito do trabalho, e elencadas as perspectivas futuras em torno das pesquisas de verificação de locutor independente de texto. Com este trabalho atingiu-se uma redução de 4% na taxa de erro igual (EER) em comparação ao sistema de referência, sendo que os melhores resultados foram apresentados pelo sistema que realiza um fusão do sistema MFCC com o Δ e Δ².	por
dc.description.abstract	This work presents an overview of text independent speaker verification, describing the basic operation of the system and the reviewing some important developments in speaker modeling and feature extraction from speech. Following, a point of improvement identified within the feature extraction stage leads to the main objective of this work: to determine one or more sets of coefficients relevant to speaker discrimination while minimizing the equal error rate (EER). The proposal is to replace the delta(Δ) and double-delta(Δ²) coefficients by a linear predictor code (LPC) for the mel frequency cepstral coefficients (MFCC). In addition, score level fusion is employed to combine the ouputs of MFCC-only and MFCC-LPC systems, as well as MFCC-only and MFCC-Δ-Δ² systems. In all cases, performance is evaluated with respect to variations of the signal to noise-ratio (SNR) in the tested audio. In addition, the work introduces a new Brazilian Portuguese speech repository containing free-speech from 155 males. Results and discussions are presented with a reflection on the expected outcomes, as well as general comments and observations. Finally, concludings remarks are made about the work, featuring future prospects regarding text independent speaker verification research. This work attained a 4% reduction in the EER compared to the reference system (MFCC-only), with best results occuring in the case fusion of MFCC-only and MFCC-Δ-Δ² scores.	eng
dc.description.provenance	Submitted by Setor de Tratamento da Informação - BC/PUCRS ([email protected]) on 2016-01-04T17:56:48Z No. of bitstreams: 1 DIS_MAYARA_FERREIRA_DA_SILVA_COMPLETO.pdf: 2803272 bytes, checksum: 9305b74451ec83ddca38d1c444ffb3dd (MD5)	eng
dc.description.provenance	Made available in DSpace on 2016-01-04T17:56:48Z (GMT). No. of bitstreams: 1 DIS_MAYARA_FERREIRA_DA_SILVA_COMPLETO.pdf: 2803272 bytes, checksum: 9305b74451ec83ddca38d1c444ffb3dd (MD5) Previous issue date: 2015-07-10	eng
dc.description.sponsorship	Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - CAPES	por
dc.format	application/pdf	*
dc.thumbnail.url	http://tede2.pucrs.br:80/tede2/retrieve/163908/DIS_MAYARA_FERREIRA_DA_SILVA_COMPLETO.pdf.jpg	*
dc.language	por	por
dc.publisher	Pontifícia Universidade Católica do Rio Grande do Sul	por
dc.publisher.department	Faculdade de Engenharia	por
dc.publisher.country	Brasil	por
dc.publisher.initials	PUCRS	por
dc.publisher.program	Programa de Pós-Graduação em Engenharia Elétrica	por
dc.rights	Acesso Aberto	por
dc.subject	ENGENHARIA ELÉTRICA	por
dc.subject	REDES NEURAIS (COMPUTAÇÃO)	por
dc.subject	RELAÇÃO HOMEM-MÁQUINA	por
dc.subject	RECONHECIMENTO DE VOZ (INFORMÁTICA)	por
dc.subject	SINTETIZADORES DE VOZ (INFORMÁTICA)	por
dc.subject	PROCESSAMENTO DE SINAIS - TÉCNICAS DIGITAIS	por
dc.subject	PROCESSAMENTO DE VOZ - TÉCNICAS DIGITAIS	por
dc.subject.cnpq	ENGENHARIAS	por
dc.title	Aplicação do método de fusão para verificação de locutor independente de texto	por
dc.type	Dissertação	por
Appears in Collections:	Programa de Pós-Graduação em Engenharia Elétrica

Files in This Item:

File	Description	Size	Format
DIS_MAYARA_FERREIRA_DA_SILVA_COMPLETO.pdf	Texto Completo	2.74 MB	Adobe PDF	Download/Open Preview ×

Show simple item record Recommend this item

PUCRS

Digital Library of Theses and Dissertations