fb_corpus.txt is the raw corpus and fb_eng_beng_corpus_tagged.txt is the tagged corpus with only words.