The first homework assignment does not involve any programming. Instead, you will take a closer look at the quality of todays’s machine translation systems.
Try to find challenging cases, either due to the language (not one of the top 100 languages in terms of resources), specialized domain (e.g., technical jargon), linguistic constructions, or writing style (e.g., social media with creative and ungrammatical expressions).
Write a report about the quality of the machine translation.
Go over at least 20 sentences, manually correct each sentence, and report for each sentence:
You may do step 4 in any way you want. For instance, you could classify errors as “reordering errors”, “word sense error for a noun”, or any other type of error you can think of.
For instance:
Conclude your report with a summary of your impression of the major quality problems in the machine translation system that you analysed.
Turn in a written report on Sunday, September 7 by midnight, on Gradescope.