You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
From your paper it seems like the byte-level classification decomposes a character i.e. 'C' into its binary representation, something like 000101110, but your code gives back 68, which I think it's not what you intended, cause that is simply a char level representation.
Am I wrong?
Your dataset would be still fulfilling its purpose of using very long sequences, but I think it's not char-byte-level, but char-level.
The text was updated successfully, but these errors were encountered:
From your paper it seems like the byte-level classification decomposes a character i.e. 'C' into its binary representation, something like 000101110, but your code gives back 68, which I think it's not what you intended, cause that is simply a char level representation.
Am I wrong?
Your dataset would be still fulfilling its purpose of using very long sequences, but I think it's not char-byte-level, but char-level.
The text was updated successfully, but these errors were encountered: