1. Japanese Credit Screening: Includes domain theory (generated by talking to Japanese domain experts); data in Lisp
2. Reuters Transcribed Subset: This dataset is created by reading out 200 files from the 10 largest Reuters
classes and using an Automatic Speech Recognition system to create
corresponding transcriptions.