Malik Talha

Journey into GSoC 2023

GSoC 2023 Journey: Week 14 Report

02 September 2023

4 Minutes

My contribution details and experiences during the fourteenth week of coding period of Google Summer of Code (GSoC) 2023.

Introduction

Welcome to my weekly report documenting my journey during Google Summer of Code 2023 with the Linux Foundation! In this project, I am working on enhancing the existing speech-to-text feature of Automotive Grade Linux (AGL) by introducing a Natural Language Intent engine and implementing software daemons/controllers to execute the extracted intent. This endeavor aims to significantly improve the user experience and functionality of the speech-to-text feature in automotive environments. Throughout this report, I will share my progress, challenges faced, and achievements made as I contribute to the development of AGL and pave the way for more intuitive and intelligent voice interactions in automobiles.

Summary of the week

This week's progress was marked by significant achievements in various aspects of the project. Notably, I successfully designed and developed a mechanism to map intents with Kuksa Vehicle Signal Specification (VSS). Additionally, a new Flutter application was created to seamlessly interact with our voice agent service. Furthermore, I accomplished the bundling and packaging of the voice agent service as a Python package, complete with the ability to function as a background service.

Tasks completed

  • Designed and developed a mechanism to map intents with Kuksa VSS. This mechanism forms the foundation for translating user intents into actionable signals within the vehicle, enhancing the overall user experience.
  • Created a new Flutter app specifically tailored to work seamlessly with our voice agent service. This application serves as a critical interface between users and the voice agent service, enabling voice command input and providing a responsive user interface.
  • Bundled and packaged the voice agent service as a Python package with the capability to run as a background service. This packaging ensures the service's stability and its ability to function reliably, even in resource-constrained environments.

Flutter App UI

Flutter App UI 1

Flutter App UI 2

Tasks leftover

No tasks were leftover this week.

Next steps

In the coming week, my focus will be on the following tasks:

  • Add comprehensive documentation for the intent engines and voice agent service. Thorough documentation is essential for users and developers to understand and effectively utilize these components.
  • Integrate and test the newly created Flutter app with the voice agent service on the AGL target machine (x86). This testing phase will ensure that the entire system functions seamlessly and reliably in a real-world environment, bringing us closer to our project's goals.

Conclusion

Overall, this week was productive, and I am satisfied with the progress made in achieving the goals outlined for the week. I am excited to continue my GSoC journey and further enhance the speech-to-text feature in Automotive Grade Linux.

Resources

There were no resources found attached to this post.

TwitterGitHubLinkedIn

© 2023 Malik Talha, All rights reserved.