A subtitle segmentation system employs a neural network model to find good segment boundaries. The model may be trained on millions of professionally segmented subtitles, and implicitly learns from data the underlying guidelines that professionals use. For controlling different characteristics of the output subtitles, the neural model may be combined with a number of heuristic features. To find the best segmentation according to the model combination, a dedicated beam search decoder may be implemented. The segmentation system incorporates a trained neural model comprising a word embedding layer, at least two bi-directional LSTM layers, a softmax layer and program instructions for segmenting text into subtitles.
AppTek.ai is a global leader in artificial intelligence (AI) and machine learning (ML) technologies for automatic speech recognition (ASR), neural machine translation (NMT), natural language processing/understanding (NLP/U), large language models (LLMs) and text-to-speech (TTS) technologies. The AppTek platform delivers industry-leading solutions for organizations across a breadth of global markets such as media and entertainment, call centers, government, enterprise business, and more. Built by scientists and research engineers who are recognized among the best in the world, AppTek’s solutions cover a wide array of languages/ dialects, channels, domains and demographics.