-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Day38 - bad data in csv file #8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Thanks @alt3red, must indeed be some weird character in
If I use ISO-8859-1 it works. See also: https://intellij-support.jetbrains.com/hc/en-us/community/posts/360000004284-File-was-loaded-in-the-wrong-encoding-UTF-8- @mikeckennedy maybe something to add to the README? |
Thanks for the heads up guys! I fixed the file and cloned the repo. There is a new link the readme.md now. I also did a PR back to FiveThirtyEight fixing their data. We'll see if they accept it. |
PR here: |
Hello,
On Day 38 when loading the Thanksgiving data from the file, there is a error that occurs:
'utf-8' codec can't decode byte 0xed in position 3599: invalid continuation byte
The file gets processed until the record with 'RespondentID':'4335955206' then it errors out.
Also when opening the csv file in PyCharm I get "File was loaded in the wrong encoding: 'UTF-8'"
The error goes away when removing the line with RspondentID: '4335955152'
The text was updated successfully, but these errors were encountered: