Skip to content

Instantly share code, notes, and snippets.

@joragupra
Created March 5, 2016 20:18
Show Gist options
  • Save joragupra/6cd7ed1798a240fe9f8c to your computer and use it in GitHub Desktop.
Save joragupra/6cd7ed1798a240fe9f8c to your computer and use it in GitHub Desktop.
Usando palabras filtro cuando creamos el clasificador
prepositions =['a','ante','bajo','cabe','con','contra','de','desde','en','entre','hacia','hasta','para','por','según','sin','so','sobre','tras']
prep_alike = ['durante','mediante','excepto','salvo','incluso','más','menos']
adverbs = ['no','si','sí']
articles = ['el','la','los','las','un','una','unos','unas','este','esta','estos','estas','aquel','aquella','aquellos','aquellas']
aux_verbs = ['he','has','ha','hemos','habéis','han','había','habías','habíamos','habíais','habían']
tfid = TfidfVectorizer(stop_words=prepositions+prep_alike+adverbs+articles+aux_verbs)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment