This dataset is Portuguese journalistic corpus with 180 million words that spans eight years of news, from 1991 to 1998.