Code to preserve original linefeeds (issue #121) #131

techtonik · 2015-10-19T10:30:59Z

This is pseudo code - it may work or may not. I haven't got chance to test it yet

takluyver · 2015-10-19T10:34:42Z

Nice, I like this idea

takluyver · 2015-10-19T10:35:24Z

libmodernize/main.py

+        # detect linefeeds
+        lineends = {'\n':0, '\r\n':0, '\r':0}
+        lines = []
+        for line in open(filename, 'rb'):


To make this work the same way on Python 2 and 3, use:

io.open(filename, newline='')

But file needs to be opened in binary more or else the information about lineends will be lost.

No, the newline='' means it will do no conversion of line endings. Binary mode has a much bigger effect on Python 3, because it means you're dealing with bytes rather than strs.

But binary mode guarantees that Python 3 won't bail out with UnicodeDecodeError. How to address that? I can't know the file encoding before opening it as Python 3 requires.

Fair point. There are ways around that, like using errors='ignore' to skip characters that can't be decoded, but it may be easier to use binary mode here. In that case, though, you'll need to update the checks below to use bytes, e.g. if line.endswith(b'\r\n').

techtonik · 2015-10-19T10:37:09Z

Well, Python 3 is a pain. But it should work for now on Python 2. I may be able to finish this later.

takluyver · 2015-10-19T10:38:21Z

libmodernize/main.py

+            newline = [x for x in lineends if lineends[x] != 0][0]
+            if os.linesep != newline:
+                with open(filename, 'wb') as f:
+                    for line in lines:


You're using the lines that you read before the file was rewritten, so this will undo the changes modernize made. You need to base it on new_text, or re-read the file after it is changed.

Right! Fixed in 7787615

@takluyver

Thanks @takluyver for review

techtonik · 2015-10-19T11:56:42Z

I am not sure I can work on a test. Not today. Basically, the test need to create two files - one with LF linefeeds and another with CRLF. And after fix is applied, check that linefeeds didn't change. So either produce 4 files (input and output) or copy/paste the function being tested to the test case (which is no good). The files are also can not be committed to Git, because it messes with linefeeds.

takluyver · 2015-10-19T12:33:36Z

I added machinery for testing line endings in #130 (actually, it's a modification of @daira's machinery in #129). We can reuse that for this, although the actual tests will need to be slightly different.

takluyver · 2015-10-19T12:35:16Z

libmodernize/main.py

+    CRLF = '\r\n'
+    CR = '\r'
+else:
+    LF = bytes('\n', encoding='ascii')


Just use the b prefix, like b'\n'. It's valid syntax on Python 2.6 and above, which is what we support. And then you don't need an if/else, because on Python 2, b'\n' == '\n'.

daira · 2015-10-20T12:43:33Z

This isn't correct as-is, but I like the basic approach. I will work on it tomorrow.

graingert · 2020-08-23T22:00:32Z

@techtonik I think this feature would be best in https://github.com/jreese/fissix

techtonik · 2020-09-05T09:35:14Z

@graingert I see this project became derived from fissix, which need more docs to understand what is latest lib2to3 and what are enhancements.

graingert · 2020-09-05T09:57:17Z

@techtonik not much has changed, but you can see what enhancements are applied here:
https://github.com/jreese/fissix/blob/main/scripts/update.sh

Everything goes via black + a git merge from CPython master

@jreese maybe a changelog will help?

techtonik added 2 commits October 19, 2015 13:27

Code to preserve original linefeeds (issue PyCQA#121)

b5e8eb5

Typos

d410995

techtonik mentioned this pull request Oct 19, 2015

Line ending options #130

Closed

import os is needed

cb502ae

takluyver reviewed Oct 19, 2015
View reviewed changes

daira mentioned this pull request Oct 19, 2015

121 add line ending options #129

Closed

daira added the needs test label Oct 19, 2015

techtonik added 4 commits October 19, 2015 13:51

Make sure to write new_text on rewriting newlines

7787615

Thanks @takluyver for review

Convert \r\n to constants for Python 3

3d6a520

Reread source file as binary for Python 3

b7c81aa

Prevent ResourceWarning's

ff8f479

takluyver reviewed Oct 19, 2015
View reviewed changes

Use bytes literals which are compatible back to Python 2.6

e51e19f

Fix indentation

4e45b5a

graingert added the problem with fissix label Sep 3, 2020

graingert mentioned this pull request Sep 3, 2020

121 add line ending options 1 #132

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code to preserve original linefeeds (issue #121) #131

Code to preserve original linefeeds (issue #121) #131

techtonik commented Oct 19, 2015

takluyver commented Oct 19, 2015

takluyver Oct 19, 2015

techtonik Oct 19, 2015

takluyver Oct 19, 2015

techtonik Oct 19, 2015

takluyver Oct 19, 2015

techtonik commented Oct 19, 2015

takluyver Oct 19, 2015

techtonik Oct 19, 2015

techtonik commented Oct 19, 2015

takluyver commented Oct 19, 2015

takluyver Oct 19, 2015

techtonik Oct 19, 2015

daira commented Oct 20, 2015

graingert commented Aug 23, 2020

techtonik commented Sep 5, 2020

graingert commented Sep 5, 2020

Code to preserve original linefeeds (issue #121) #131

Are you sure you want to change the base?

Code to preserve original linefeeds (issue #121) #131

Conversation

techtonik commented Oct 19, 2015

takluyver commented Oct 19, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

techtonik commented Oct 19, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

techtonik commented Oct 19, 2015

takluyver commented Oct 19, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

daira commented Oct 20, 2015

graingert commented Aug 23, 2020

techtonik commented Sep 5, 2020

graingert commented Sep 5, 2020