How to ignore line separator character like <0x2028> when using smart_open and unzipping the file #761
-
I have file in which there exits event like this, Line 1 On unzipping and reading this file using smart_open, the above event is separated into two events(Line 1 and Line2) like this:
What I am looking for is to ignore this line separator so that single event won't get separated into two events. Any help would be appreciated? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
Read the file as binary and perform the line splitting yourself (on the newline byte). |
Beta Was this translation helpful? Give feedback.
Read the file as binary and perform the line splitting yourself (on the newline byte).