du.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
En undersökning och jämförelse av två röststyrningsramverk för Android i bullriga miljöer
Dalarna University, School of Technology and Business Studies, Information Systems.
Dalarna University, School of Technology and Business Studies, Information Systems.
2017 (Swedish)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesisAlternative title
An examination and comparison of two speech recognition frameworks for Android in noisy environments (English)
Abstract [en]

Voice control is a technology that most people encounter or use on a daily basis. The voice control technology can be used to interpret voice commands and execute tasks based on the command pronounced. According to previous studies problems arise with the precision when the voice control technologies are used in noisy environments. This study has been conducted as an experiment where the precision in two voice control frameworks for Android has been examined. The purpose with this study is to examine the precision in these two frameworks to assist a decision making for an organisation who has developed an application which will be used by midwives in low and middle income countries. Two prototypes was developed using the two voice control frameworks PocketSphinx and iSpeech. The precision of these frameworks was tested in three different surroundings. The surroundings the frameworks was tested in had the decibel levels 25, 60, and 80. The result shows that the number of correctly registered voice commands reduces considerably depending on which sound level the frameworks are being tested in. The framework who got the most voice commands correctly registered was PocketSphinx, but even this framework had a big margin of error.

Abstract [sv]

Röststyrning är idag en teknologi som de flesta människor någon gång stöter på eller använder sig av dagligen. Röststyrningsteknologin kan användas för att tolka vissa kommandon som sedan utför en uppgift baserat på det kommando som uttalas. Enligt tidigare studier uppkommer det problem med precisionen hos de röststyrningsramverk som används i bullriga miljöer. Denna studie har utförts som ett experiment där precisionen hos två stycken röststyrningsramverk för Android har undersökts. Syftet med denna studie var att undersöka precisionen hos dessa ramverk för att bistå med underlag till en organisation som utvecklat en applikation som används av barnmorskor i låg- och medelinkomstländer. Två stycken prototyper utvecklades med hjälp av röststyrningsramverken PocketSphinx och iSpeech. Dessa ramverks precision testades i tre stycken olika miljöer. De miljöer som prototyperna testades i hade ljudnivåerna 25dB, 60dB samt 80dB. Resultatet påvisar att antalet korrekt registrerade kommandon minskar avsevärt beroende på vilken ljudnivå som ramverken testas i. Det ramverk som korrekt registrerade flest röstkommandon var PocketSphinx men även denna hade en stor felmarginal.

Place, publisher, year, edition, pages
2017.
Keyword [en]
Speech Recognition in noisy environments, Robust speech recognition.
National Category
Information Systems
Identifiers
URN: urn:nbn:se:du-25579OAI: oai:DiVA.org:du-25579DiVA: diva2:1123287
Available from: 2017-07-13 Created: 2017-07-13

Open Access in DiVA

fulltext(954 kB)3 downloads
File information
File name FULLTEXT01.pdfFile size 954 kBChecksum SHA-512
68b03f014440da63c04881c3034db8e9a3d2226e468e9f0b5cd02ca4fa97c9ab019f4e5228179235f1d138c527dfaf89b1c3dbe2756b5ab8d0f3ac26a3359df3
Type fulltextMimetype application/pdf

By organisation
Information Systems
Information Systems

Search outside of DiVA

GoogleGoogle Scholar
Total: 3 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 11 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf