Home Page The Uptime Difference Solutions General Services FAQ's Contact Us
 

FOR IMMEDIATE RELEASE

 UISPEECH INTRODUCES ENHANCED "MEDIA MINING SYSTEM" TO BROADCAST INDUSTRY AT THIS YEAR'S NATIONAL ASSOCIATION OF BROADCASTERS CONFERENCE

Revolutionary New Speech Recognition and Archive Retrieval Technology Set To Reduce Editing Costs and Crucial Man Hours Through Innovative Technological Platform

March 11, 2003- Los Angeles, CA—UISpeech, a division of Uptime Integrated Systems (UIS), will be introducing an enhanced Media Mining Voice-Recognition Technology System at this year’s National Association of Broadcasters Conference, it was announced today by Ken Stirbl, President of Uptime Integrated Systems. The innovative technological platform is set to transform the way business is conducted within the broadcasting, newsgathering, motion picture, and cable and satellite industries, among others, by new cost-cutting and efficient methods of handling media content. 

This technology was previously used within the government intelligence community for surveillance and intelligence gathering, but commercial applications have been recently discovered for this technology, including broadcast monitoring, broadcast news indexing and editing, digital media asset management, media and financial analysis, interactive news services and much more. The Media Mining System was created by Sail Labs Technology, and is being launched by UISpeech within the United States. UISpeech is responsible for integrating and implementing the Media Mining Technology into new or existing systems, corporate networks and Oracle Data Centers within the broadcast industry. Additionally, UISpeech will develop and identify new commercial applications for this system.

“UISpeech and Uptime Integrated Systems have determined after years of testing and refining that this product is ready to go. We are confident that this technology will add substantial value to organizations from diverse industry backgrounds and now is the time to run with it,” said Stirbl.  

The technology behind the Media Mining System enables media streams (audio, video, satellite, etc.) to be received through the Media Indexer which then are passed through a series of technologies utilized by the Indexer including speaker change detection, speech recognition, speaker identification, named entity detection and topic detection. Speech recognition technologies are utilized initially, which breaks down transcripts so that the input can be converted to an indexed XML output file for archiving and retrieval as needed. All files are uploaded to the Media Mining Server and are accessible through the Media Mining Explorer, the graphical user interface that allows the user to search, summarize, and replay audio/video content. 

The highly advanced speech recognition technology within the system utilizes a context-sensitive statistical model of natural language to identify the individual voice characteristics of different speakers. Spoken words are analyzed and evaluated according to their position within a sentence rather than relying on the phonetic spelling of a word. The system is equipped with a vocabulary of between 65,000 and 500,000 words, depending on language selection, as well as over 400 million probability algorithms of broadcast based text sentences, which ensures high recognition accuracy. The UISpeech system has an average accuracy rate of 87.5% for general live broadcast news, but results have been seen as high as 98% accuracy for specific speeches for controlled recorded environments, as compared to other systems whose accuracy rate ranges from 40-50%. These added features can help a user search for and identify speaker specific sound bytes that are easily accessible through a user-friendly graphical user interface. UISpeech has also developed a new learning tool that is able to increase the recognition accuracy for specific vertical market segments.

This platform provides significant benefits to the broadcasting industry, including indexing and archiving live broadcasts in real time and recorded media in real time or better, closed captioning, automatic keyword translation, advanced searching capabilities and creating edit decision lists (EDL’s). The timing of information retrieval is crucial for breaking news stories, and the Media Mining Technology expedites this process by enabling the end-user to search for pertinent audio/video content in multimedia sources in real time

 ARCHIVING AND INDEXING

Broadcast news organizations handle immense volumes of information on a daily basis. Indexing and annotating this information manually consumes several productive hours of manpower. The Media Mining System is a highly sophisticated system that automates this process. Information from multiple sources can be indexed and archived in real time or better 24 hours a day, 7 days a week resulting in higher productivity and cost savings for the broadcasting company. This archiving tool manages a company’s media in a more effective way.

 Closed Captioning

At present, the closed captioning business is constrained by a few factors. Manual closed captioning of news broadcasts, television series or movies, requires considerable time to complete the transcribing of scripts, timing and placing of captions, and encoding processes. Human error can also account for inaccuracies in captioning, as well as relying on pre-scripted teleprompter coverage. The closed captioning solution offered by the Media Mining System greatly reduces the need for manual processing. The UISpeech system automates this process, which annotates and archives information continuously while simultaneously displaying closed captioning. Automating this procedure allows more information to be processed quickly and accurately and ensures that no spoken data is lost within newscasts, sporting events, and other breaking stories due to that spoken text not being included in the original script.

 KEYWORD TRANSLATION

An integral tool to monitoring international broadcasts is keyword translation. The Media Mining System allows users to create English language queries for Arabic, Chinese, French, German, and Spanish and vice versa. These queries search for specific words within the foreign language text and alerts a user to the appearance of that word within the foreign text according to a color key. No prior knowledge is needed of these foreign languages as a user can search for translations of specific keywords within the broadcast. For security purposes, this information can be tracked and logged.  

Advanced Searches

Information storage is structured according to predefined topics and subcategories within those topic areas, which are displayed in an archive tree format. The advanced search function allows a user to search by person, place, topic, location, organization, date, time, percentage and text. Narrowing the focus of a search to channel, time, or language can further refine searches for the broadcast monitoring industry. Based on the search criterion selected by the user, the search engine will search through folders within the archive and return a list of stories and associated media files for processing the advanced queries.

 EDIT DECISION MODULE

UISpeech was directly responsible for developing the Edit Decision Module, a major front end of this system that has proven beneficial to the editing, production and post-productions processes. Formerly, searches were conducted manually for sound bytes to find spoken dialogs, transcripts had to be read, time codes had to be researched, words and video had to be separated and logged manually into the existing editing tools. Utilizing the Edit Decision Module, an Edit Decision List (EDL) can be created,

which allows a user to cut and paste audio/video content with time codes for live-to-air playback capabilities. This has proven to be highly valuable for producing movie trailers, spots and news pieces. Story creation can be facilitated by virtually anyone from anywhere using this application. 

About UISpeech

UISpeech, a division of Uptime Integrated Systems, Inc (UIS), is at the forefront of speech communications technology for the broadcasting industry, in addition to other diverse industry sectors. UISpeech is able to customize and integrate Sail Labs technology with other companies’ technologies to satisfy the unique requirements of each client in media mining. Our primary function is to integrate Media Mining and Speech Recognition Technology into new or existing systems, corporate networks and Oracle Data Centers. UISpeech will create custom Interfaces and Enhanced support function modules, which will get the most out of the Media Mining Technology.  For more information visit www.uispeech.com.

back to press releases