Q 017
Image

Extracting and Translating Speech from Random Video

Extracting information from video is arguably one of the most anticipated functionalities of AI. However, the technology is not fully mature yet. There is still no single perfect tool that can "read" a long video with very high accuracy.

In this AI Challenge, you are given a long video—approximately 30 minutes. Your task is to use a combination of tools, AI or non-AI, to quickly extract the spoken content from the video (i.e. transcribe the video) and translate it into another language. The final result should be a well-structured, readable article in the target language. You should not need to spend 30 minutes working on a 30-minute video—meaning, do not manually play and listen to the video, but instead, use tools to extract the information asynchronously.

Your submission must include the AI tools you used, the exact prompt you entered, and the steps you followed to complete the challenge. Provide the link to the original video, the final transcribed and translated article, and attach a screenshot of a captured frame from the original video.

Submit My Answer

There is always more than one answer for every question. Just submit your best shot to give it a try! Your email address will not be published.

Reveal Correct Solution Now

Teacher Login

If you are already our AI-certified teacher or in the process of obtaining AI certification, you can log in here to access this month's latest AI tools update and incorporate them into your teaching.