Datasets
BIG-Corpus-PT
Portuguese corpus composed of news articles, news comments, blog posts and twitter messages crawled during a period of 7 months.
DBpediaEntities-PT
Corpus of entities(persons, places, organisations) extracted from the Portuguese DBpedia.
DBpediaRelations-PT
Corpus of semantic relationships(persons, places, organisations) extracted from the Portuguese Wikpedia and DBpedia.
NomesLex-PT
Lexicon of person names from Portugal.
POWER-PT ontology
Portuguese politics.
SentiCorpus-PT
Comments in Portuguese manually annotated with sentiment and opinions about politicians.
SentiLex-PT
Sentiment lexicon for Portuguese.
SentiTuites-PT
Corpus of tweets posted by Portuguese users during the 2011 election campaign.
Twitter-BrownClusters-PT
Word clusters induced from Portuguese Twitter messages