The expansive dataset is designed to translate 55 languages, including Cobol, Java, C++, and Python IBM unveiled a suite of new artificial intelligence (AI) services on Monday ahead of its virtual IBM Think 2021 conference. The new features include software that teaches AI how to translate code, a cloud migration service, and more functions for the firm's Watson Assistant. The most interesting of the announcements was a Rosetta Stone-like service for programming code called 'CodeNet', which is an expansive dataset designed to teach AI and machine learning systems how to translate code. It comprises some 14 million snippets and 500 million lines of code spread across more than 55 legacy and active languages, such as Cobol, Java, C++, and Python. The dataset is constructed in a way that allows for bidirectional translation, enabling Cobol legacy code to be translated into Java, for example.