Orizzonte Sistemi Navali proposes a three-year thesis project aimed at generating and optimizing transcripts obtained from videos of technical/training seminars recorded on the Teams platform. The objective is to obtain accurate, readable and consistent texts with technical terminology, so as to enhance and make the company's information assets more usable through indexing and consultation.
The resource will deal with the following activities:
· Test and compare different Speech-to-Text systems on a set of business videos
· Definition of a pipeline: audio extraction →transcription → post-processing → validation
· Quality assessment with metrics and error analysis (reliability, terminology consistency, readability)
· Production of guidelines and best practices to make the process replicable