Remove the ability to use SliceReader with raw bytes #436

dralley · 2022-07-23T03:13:22Z

In the near future, decoding will be performed automatically as the input is read. If the input has an unknown encoding, it must be decoded first, necessitating a buffer. Therefore only the buffered implementation can be used for Reader::from_bytes()

If the encoding of the bytes is known up-front, you can decode them up-front and subsequently use Reader::from_str() for borrowing behavior if desired.

Made some utilities such as detect_encoding(), decode(), and decode_with_bom_removal() available as standalone functions so that they can be used on user-provided data.

This commit only moves code without significant changes (the only changes is: - corrected imports - add imports to the doc comments which have become inaccessible )

Main code moved from `read_namespaced_event_into` to `resolve_namespaced_event_inner`

This also changes the test cases in the `reader::test::check` macro to allow for reader-specific tests.

…he input slice

dralley · 2022-07-23T03:44:40Z

Waiting on #425

Next: Decoding into an internal buffer, then parsing the decoded data

After: Evaluate whether user-provided buffers are still useful once we already have one internally anyway - if the benefit is minimal or nonexistent the API can be collapsed back into borrowing-based APIs which would allow us to deduplicate a bunch of code again.

After: Swap the internals of Event, Attribute, etc. and remove the decoding functionality and wrappers

dralley · 2022-07-23T04:01:00Z

src/reader/buffered_reader.rs

+        Reader::from_reader_internal(BufferedReader(bytes))
+    }
+
+    #[cfg(feature = "encoding")]


Moving these because it doesn't make sense to check these for both when SliceReader can't work with encoded bytes.

codecov-commenter · 2022-07-23T04:03:05Z

Codecov Report

Merging #436 (cc17a44) into master (ebbcce0) will increase coverage by 2.53%.
The diff coverage is 79.22%.

@@            Coverage Diff             @@
##           master     #436      +/-   ##
==========================================
+ Coverage   49.51%   52.04%   +2.53%     
==========================================
  Files          22       26       +4     
  Lines       13847    13453     -394     
==========================================
+ Hits         6856     7002     +146     
+ Misses       6991     6451     -540

Flag	Coverage Δ
unittests	`52.04% <79.22%> (+2.53%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
benches/macrobenches.rs	`0.00% <0.00%> (ø)`
benches/microbenches.rs	`0.00% <0.00%> (ø)`
examples/read_buffered.rs	`0.00% <0.00%> (ø)`
examples/read_texts.rs	`0.00% <0.00%> (ø)`
src/de/escape.rs	`21.05% <ø> (ø)`
src/de/seq.rs	`91.83% <ø> (ø)`
src/de/simple_type.rs	`90.63% <ø> (ø)`
src/events/mod.rs	`68.20% <ø> (ø)`
src/lib.rs	`12.26% <0.00%> (ø)`
src/reader/buffered_reader.rs	`76.87% <76.87%> (ø)`
... and 7 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ebbcce0...cc17a44. Read the comment docs.

In the near future, decoding will be performed automatically as the input is read. If the input has an unknown encoding, it must be decoded first, necessitating a buffer. Therefore only the buffered implementation can be used for `Reader::from_bytes()` If the encoding of the bytes is known up-front, you can decode them up-front and subsequently use `Reader::from_str()` if desired.

Mingun and others added 9 commits July 20, 2022 12:00

Move buffered and borrowing parts of reader to separate files

ded1b77

This commit only moves code without significant changes (the only changes is: - corrected imports - add imports to the doc comments which have become inaccessible )

Implement reading namespaced events for borrowing reader

7aba3dd

Main code moved from `read_namespaced_event_into` to `resolve_namespaced_event_inner`

Change the check! macro to more flexibly define buffers

10c736e

Introduce SliceReader and BufferedReader

a018ada

Split reader into BufferedReader and SliceReader

9eb0d9b

This also changes the test cases in the `reader::test::check` macro to allow for reader-specific tests.

Remove buffered access for SliceReader as events always borrow from t…

b6a2af1

…he input slice

Add example for buffered access when reading from a file

c972101

Add changelog entry

c3a07b6

Add debug_assert! in a few places to protect invariants

559d0e8

dralley requested a review from Mingun July 23, 2022 03:13

dralley marked this pull request as draft July 23, 2022 03:24

dralley force-pushed the buffer-decode2 branch from 72f6a41 to ea93eeb Compare July 23, 2022 03:43

dralley marked this pull request as ready for review July 23, 2022 03:43

dralley force-pushed the buffer-decode2 branch from ea93eeb to fe73aa1 Compare July 23, 2022 03:52

dralley commented Jul 23, 2022

View reviewed changes

dralley force-pushed the buffer-decode2 branch from fe73aa1 to dca3a71 Compare July 23, 2022 04:03

dralley marked this pull request as draft July 23, 2022 04:41

dralley added 2 commits July 23, 2022 10:28

Move everything related to actually decoding text to a new module

f4bb0db

dralley force-pushed the buffer-decode2 branch from dca3a71 to f4bb0db Compare July 23, 2022 14:28

dralley added the encoding Issues related to support of various encodings of the XML documents label Jul 23, 2022

Provide some utilities for decoding entire buffers

cc17a44

dralley force-pushed the buffer-decode2 branch from 48806e1 to cc17a44 Compare July 23, 2022 17:54

dralley closed this Jul 24, 2022

dralley deleted the buffer-decode2 branch July 24, 2022 17:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove the ability to use SliceReader with raw bytes #436

Remove the ability to use SliceReader with raw bytes #436

Uh oh!

dralley commented Jul 23, 2022 •

edited

Loading

Uh oh!

dralley commented Jul 23, 2022 •

edited

Loading

Uh oh!

dralley Jul 23, 2022

Uh oh!

codecov-commenter commented Jul 23, 2022 •

edited

Loading

Uh oh!

Uh oh!

Remove the ability to use SliceReader with raw bytes #436

Remove the ability to use SliceReader with raw bytes #436

Uh oh!

Conversation

dralley commented Jul 23, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dralley commented Jul 23, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dralley Jul 23, 2022

Choose a reason for hiding this comment

Uh oh!

codecov-commenter commented Jul 23, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

dralley commented Jul 23, 2022 •

edited

Loading

dralley commented Jul 23, 2022 •

edited

Loading

codecov-commenter commented Jul 23, 2022 •

edited

Loading