Skip to content

Conversation

sadatrafsanjani
Copy link

in order to use the vectorizer, the data must be preprocessed. the dataset here is used contains non-standard character. the added code snippet ensures that all the data read from imdb dataset contains UTF-8 characters.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant