{"id":673,"date":"2023-06-29T10:39:02","date_gmt":"2023-06-29T10:39:02","guid":{"rendered":"https:\/\/www.fidelsoftech.com\/case-studies\/?p=673"},"modified":"2025-08-13T12:41:53","modified_gmt":"2025-08-13T12:41:53","slug":"voice-data-processing-for-asr-engine","status":"publish","type":"post","link":"https:\/\/www.fidelsoftech.com\/case-studies\/voice-data-processing-for-asr-engine\/","title":{"rendered":"Voice Data Processing for ASR Engine and Extracting Value from Spoken Words &#8211; Case Study"},"content":{"rendered":"<p>[et_pb_section fb_built=&#8221;1&#8243; fullwidth=&#8221;on&#8221; _builder_version=&#8221;4.18.0&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_fullwidth_image src=&#8221;https:\/\/www.fidelsoftech.com\/case-studies\/wp-content\/uploads\/2023\/06\/voice-data-processing-for-asr-engine-and-extracting-value-from-spoken-words-.jpg&#8221; alt=&#8221;Voice Data Processing for ASR Engine and Extracting Value from Spoken Words&#8221; _builder_version=&#8221;4.27.4&#8243; _module_preset=&#8221;default&#8221; custom_margin=&#8221;||||false|false&#8221; global_colors_info=&#8221;{}&#8221;][\/et_pb_fullwidth_image][\/et_pb_section][et_pb_section fb_built=&#8221;1&#8243; fullwidth=&#8221;on&#8221; _builder_version=&#8221;4.18.0&#8243; _module_preset=&#8221;default&#8221; custom_margin=&#8221;2px||0px||false|false&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_fullwidth_post_title meta=&#8221;off&#8221; featured_image=&#8221;off&#8221; _builder_version=&#8221;4.18.0&#8243; _module_preset=&#8221;default&#8221; title_text_align=&#8221;left&#8221; custom_margin=&#8221;0px||||false|false&#8221; custom_padding=&#8221;30px||||false|false&#8221; global_colors_info=&#8221;{}&#8221;][\/et_pb_fullwidth_post_title][\/et_pb_section][et_pb_section fb_built=&#8221;1&#8243; _builder_version=&#8221;4.18.0&#8243; _module_preset=&#8221;default&#8221; custom_padding=&#8221;25px||||false|false&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_row module_class=&#8221;inner-page&#8221; _builder_version=&#8221;4.18.0&#8243; _module_preset=&#8221;default&#8221; custom_padding=&#8221;0px||||false|false&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_column type=&#8221;4_4&#8243; _builder_version=&#8221;4.18.0&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_text _builder_version=&#8221;4.27.4&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;]<\/p>\n<h2>About the client<\/h2>\n<p>The client, a prominent company, has transformed the shopping experience for all of us through the innovative use of cloud technology and Speech Recognition Engine (ASR).<\/p>\n<h3>Requirements:<\/h3>\n<p>The client wanted to establish a process with minimum human intervention for transcribing the source audio data in multiple Indian and foreign languages.<\/p>\n<h3>Challenges :<\/h3>\n<p>The client wanted us to develop a tool with an interface to process the source audio data and also to set up an automation process to perform quality assurance on the transcribed version. The greatest challenge was first to convert analog to digital signal processing, then to identify and classify speech and non-speech segments, and finally to preserve the important audio contents without any data loss for further text analysis.<\/p>\n<h3>Solution Provided :<\/h3>\n<p>Fidel analyzed the requirements and identified key areas which play an important role in processing audio data and performs an analysis on transcribed audio contents.<\/p>\n<ol>\n<li><a href=\"https:\/\/www.fidelsoftech.com\/ai-ml-development-python\/\">Python<\/a> GUI which supports AI modules.<\/li>\n<li><a href=\"https:\/\/www.filose.com\/nlp-services\" target=\"_blank\" rel=\"noopener\">NLP<\/a> to identify language detection and classification on various target languages.<\/li>\n<\/ol>\n<p>Based on this analysis, Fidel laid out the flow to achieve the audio signal processing with easy use of technology and programming languages like Python and its supported AI modules like NLP and NumPy.<\/p>\n<p><img decoding=\"async\" src=\"https:\/\/www.fidelsoftech.com\/case-studies\/wp-content\/uploads\/2023\/06\/analog-to-digital-signal-processing.png\" alt=\"analog to digital signal processing\" \/><\/p>\n<p><strong>Analog to Digital Signal Processing:<\/strong><\/p>\n<ul>\n<li>Process audio files using Python \u2013 AI modules.<\/li>\n<li>Algorithms provide optimum results on digital signals.<\/li>\n<li>Filter creation based upon input digital signals for noise reduction and to extract average audible signals.<\/li>\n<\/ul>\n<h4><strong>AI Supported Modules used for complete process:<\/strong><\/h4>\n<p><strong>Automated Quality Checks System:<\/strong><\/p>\n<ul>\n<li>Recognize non-linguistics issues from intermediate files.<\/li>\n<li>Define 18 quality check parameter rules to identify issues.<\/li>\n<li>Language detection, sentiment analysis and POS tagging using NLP AI module.<\/li>\n<\/ul>\n<p><strong>Data Creation for ASR Engine:<\/strong><\/p>\n<ul>\n<li>Chunking speech audio data to train ASR Engine.<\/li>\n<li>Using speech and time parameters for chunking audio data.<\/li>\n<\/ul>\n<p><strong>NLP \u2013 Classification of transcribed contents:<\/strong><\/p>\n<ul>\n<li>POS tagging is the process of marking up a word in a text (corpus) as corresponding to a particular part of speech, based on both its definition and its context.<\/li>\n<li>Its relationship with adjacent and related words in a phrase, sentence, or paragraph.<\/li>\n<li>A simplified form in the identification of words as nouns, verbs, adjectives, adverbs.<\/li>\n<\/ul>\n<h3>Result:<\/h3>\n<ul>\n<li>Client is able to classify the speech and non-speech segments and introduce quality checks on the transcribed data.<\/li>\n<li>Client is able to process high-volume transcribed data across multiple languages<\/li>\n<li>Client was able to effectively process their audio data while identifying and classifying the contents which is vital to train their ASR engines.<\/li>\n<\/ul>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][\/et_pb_section][et_pb_section fb_built=&#8221;1&#8243; admin_label=&#8221;Section&#8221; module_class=&#8221;footer-cnt&#8221; _builder_version=&#8221;4.19.5&#8243; _module_preset=&#8221;default&#8221; background_color=&#8221;#103e66&#8243; custom_margin=&#8221;||0px||false|false&#8221; custom_padding=&#8221;||0px||false|false&#8221; locked=&#8221;off&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_row column_structure=&#8221;1_2,1_2&#8243; _builder_version=&#8221;4.17.1&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_column type=&#8221;1_2&#8243; _builder_version=&#8221;4.17.1&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][et_pb_text _builder_version=&#8221;4.19.5&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;]<\/p>\n<h2 style=\"color: #ffffff;\">Is this Case Study interests you?<\/h2>\n<p style=\"color: #ffffff;\">If you found this case study similar to your requirement OR interested to get our services, please connect with us using form. We will be happy to respond.<\/p>\n<p>[\/et_pb_text][et_pb_text _builder_version=&#8221;4.19.5&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;]<span><\/span><\/p>\n<h3 style=\"color: #ffffff;\"><i id=\"et-info-phone\"><\/i>Call us<\/h3>\n<p><a href=\"tel:+91-20-49007800\" onclick=\"gtag('event', 'Phone', {'event_category': 'engagement','event_label': 'Voice Data Processing for ASR Engine and Extracting Value from Spoken Words case study'});\">+91-20-49007800<\/a><\/p>\n<p><span><\/span><\/p>\n<h3 style=\"color: #ffffff;\"><i id=\"et-info-email\"><\/i> Email us<\/h3>\n<p><a href=\"mailto:sales@fidelsoftech.com\" onclick=\"gtag('event', 'Email', {'event_category': 'engagement','event_label': 'Voice Data Processing for ASR Engine and Extracting Value from Spoken Words Case Study'});\">sales@fidelsoftech.com<\/a>[\/et_pb_text][\/et_pb_column][et_pb_column type=&#8221;1_2&#8243; _builder_version=&#8221;4.17.1&#8243; _module_preset=&#8221;default&#8221; global_colors_info=&#8221;{}&#8221;][dvppl_cf7_styler cf7=&#8221;1670&#8243; form_background_color=&#8221;#FFFFFF&#8221; _builder_version=&#8221;4.27.4&#8243; _module_preset=&#8221;default&#8221; form_field_font_font=&#8221;Arial||||||||&#8221; background_enable_color=&#8221;off&#8221; global_colors_info=&#8221;{}&#8221; locked=&#8221;off&#8221;][\/dvppl_cf7_styler][\/et_pb_column][\/et_pb_row][\/et_pb_section]<\/p>\n","protected":false},"excerpt":{"rendered":"<p>About the client The client, a prominent company, has transformed the shopping experience for all of us through the innovative use of cloud technology and Speech Recognition Engine (ASR). Requirements: The client wanted to establish a process with minimum human intervention for transcribing the source audio data in multiple Indian and foreign languages. Challenges : [&hellip;]<\/p>\n","protected":false},"author":6,"featured_media":680,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_et_pb_use_builder":"on","_et_pb_old_content":"","_et_gb_content_width":"2880","footnotes":""},"categories":[21],"tags":[],"class_list":["post-673","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai"],"_links":{"self":[{"href":"https:\/\/www.fidelsoftech.com\/case-studies\/wp-json\/wp\/v2\/posts\/673","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.fidelsoftech.com\/case-studies\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.fidelsoftech.com\/case-studies\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.fidelsoftech.com\/case-studies\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/www.fidelsoftech.com\/case-studies\/wp-json\/wp\/v2\/comments?post=673"}],"version-history":[{"count":17,"href":"https:\/\/www.fidelsoftech.com\/case-studies\/wp-json\/wp\/v2\/posts\/673\/revisions"}],"predecessor-version":[{"id":1709,"href":"https:\/\/www.fidelsoftech.com\/case-studies\/wp-json\/wp\/v2\/posts\/673\/revisions\/1709"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.fidelsoftech.com\/case-studies\/wp-json\/wp\/v2\/media\/680"}],"wp:attachment":[{"href":"https:\/\/www.fidelsoftech.com\/case-studies\/wp-json\/wp\/v2\/media?parent=673"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.fidelsoftech.com\/case-studies\/wp-json\/wp\/v2\/categories?post=673"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.fidelsoftech.com\/case-studies\/wp-json\/wp\/v2\/tags?post=673"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}