Skip to content

Commit c22f130

Browse files
committed
Improve FragmentFinder public API
1 parent e9eb5d7 commit c22f130

11 files changed

+876
-251
lines changed

CHANGELOG.md

+21
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,27 @@
22

33
All Notable changes to `Csv` will be documented in this file
44

5+
## [Next] - TBD
6+
7+
### Added
8+
9+
- `League\Csv\Fragment\Expression`
10+
- `League\Csv\Fragment\Selection` (internal class)
11+
12+
### Deprecated
13+
14+
- `League\Csv\FragmentFinder::findAll` use `League\Csv\Fragment\Expression::fragment` instead
15+
16+
### Fixed
17+
18+
- `Cast*` methods accept more input type.
19+
- `FragmentFinder` now removes duplicate selection.
20+
- `TabularaDataReader::matching` will return an empty `Iterable` instance when no selection is valid, previously and empty `TabularDataReader` instance was returned as unique item of the iterable returned.
21+
22+
### Removed
23+
24+
- None
25+
526
## [9.16.0](https://github.com/thephpleague/csv/compare/9.15.0...9.16.0) - 2024-05-24
627

728
### Added

docs/9.0/reader/statement.md

+75-6
Original file line numberDiff line numberDiff line change
@@ -434,16 +434,17 @@ Here are some selection example:
434434
- `cell=5,2-8,9` : will select the cells located between row `4` and column `1` and row `7` and column `8`;
435435

436436
Of note, the RFC allows for multiple selections, separated by a `;`. which are translated
437-
as `OR` expressions. To strictly cover The RFC the class exposes the `findAll` method
437+
as `OR` expressions. To strictly cover The RFC the class exposes the `find` method
438438
which returns an iterable containing the results of all found fragments as distinct `TabulatDataReader`
439439
instances.
440440

441-
<p class="message-warning">If some selections are invalid no error is returned; the invalid
442-
selection is skipped from the returned value.</p>
441+
<p class="message-info">This <code>find</code> method is introduced with version <code>9.17.0</code>.</p>
442+
<p class="message-notice">The <code>findAll</code> method is deprecated, you should use <code>find</code> instead.</p>
443+
<p class="message-warning">If some selections are invalid no error is returned; the invalid selection is skipped from the returned value.</p>
443444

444445
To restrict the returned values you may use the `findFirst` and `findFirstOrFail` methods.
445446
Both methods return on success a `TabularDataReader` instance. While the `first` method
446-
always return the first selection found or `null`; `firstOrFail` **MUST** return a
447+
always return the first selection found if it is not empty or `null`; `firstOrFail` **MUST** return a non-empty
447448
`TabularDataReader` instance or throw. It will also throw if the expression syntax is
448449
invalid while all the other methods just ignore the error.
449450

@@ -456,15 +457,83 @@ use League\Csv\FragmentFinder;
456457
$reader = Reader::createFromPath('/path/to/file.csv');
457458
$finder = FragmentFinder::create();
458459

459-
$finder->findAll('row=7-5;8-9', $reader); // return an Iterator<TabularDataReader>
460+
$finder->find('row=7-5;8-9', $reader); // return an Iterator<TabularDataReader>
460461
$finder->findFirst('row=7-5;8-9', $reader); // return an TabularDataReader
461462
$finder->findFirstOrFail('row=7-5;8-9', $reader); // will throw
462463
```
463464

464-
- `FragmentFinder::findAll` returns an Iterator containing a single `TabularDataReader` because the first selection
465+
- `FragmentFinder::find` returns an Iterator containing a single `TabularDataReader` because the first selection
465466
is invalid;
466467
- `FragmentFinder::findFirst` returns the single valid `TabularDataReader`
467468
- `FragmentFinder::findFirstOrFail` throws a `SyntaxError`.
468469

469470
Both classes, `FragmentFinder` and `Statement` returns an instance that implements the `TabularDataReader` interface
470471
which returns the found data in a consistent way.
472+
473+
### Fragment Expression builder
474+
475+
<p class="message-info">This mechanism is introduced with version <code>9.17.0</code>.</p>
476+
477+
The `Expression` class provides an immutable, fluent interface to create valid expressions.
478+
We can rewrite the previous example as followed:
479+
480+
```php
481+
use League\Csv\Reader;
482+
use League\Csv\Fragment\Expression;
483+
use League\Csv\FragmentFinder;
484+
485+
$reader = Reader::createFromPath('/path/to/file.csv');
486+
$finder = FragmentFinder::create();
487+
$expression = Expression::fromRow('7-5', '8-9');
488+
489+
$finder->find($expression, $reader); // return an Iterator<TabularDataReader>
490+
$finder->findFirst($expression, $reader); // return an TabularDataReader
491+
$finder->findFirstOrFail($expression, $reader); // will throw
492+
```
493+
494+
The `Expression` validates that your selections are valid according to the selection scheme chosen.
495+
The class exposes method to create fragment expression for:
496+
497+
- cell selection using `Expression::fromCell`;
498+
- row selection using `Expression::fromRow`;
499+
- column selection using `Expression::fromColumn`;
500+
501+
```php
502+
use League\Csv\Fragment\Expression;
503+
504+
$expression = Expression::fromRow('7-5', '8-9');
505+
echo $expression;
506+
// returns 'row=8-9' and removes `7-5` because it is an invalid selection
507+
```
508+
509+
You can even gradually create your expression using a fluent and immutable API
510+
using the `push`, `unshift` and `remove` methods. And there are convenient method to
511+
inspect the class to know how nmany selections are present and to select them according
512+
to their indices using the `get` a `has` methods. You are also able to tell if a specific
513+
selection in present via the `contains` method.
514+
515+
```php
516+
use League\Csv\Fragment\Expression;
517+
518+
$expression = Expression::fromRow()
519+
->push('5-8')
520+
->unshift('12-15')
521+
->replace('5-8', '12-*')
522+
->remove('12-15');
523+
524+
echo $expression->toString();
525+
// or
526+
echo $expression;
527+
// returns 'row=12-*'
528+
```
529+
530+
You can use que `Expression` to directly query a `TabularDataReader` using the `fragment` method.
531+
The result will be an iterable structure containing `TabularDataReader` instances whose index present the
532+
selection used to generate the data. The `firstFragment` method can be used to return the first
533+
`TabularDataReader` instance regardless if it contains data or not.
534+
535+
```php
536+
$document = Reader::createFromPath('/path/to/file.csv');
537+
Expression::fromRow('7-5', '8-9')->fragment($document); // returns an iterable<string, TabularDataReader>
538+
Expression::fromRow('7-5', '8-9')->firstFragment($document); // returns the first selection found regardless if it contains data or not
539+
```

docs/9.0/reader/tabular-data-reader.md

+1
Original file line numberDiff line numberDiff line change
@@ -506,6 +506,7 @@ $reader->matchingFirstOrFail('row=3-1;4-6'); // will throw
506506

507507
<p class="message-info"> Wraps the functionality of <code>FragmentFinder</code> class.</p>
508508
<p class="message-notice">Added in version <code>9.12.0</code> for <code>Reader</code> and <code>ResultSet</code>.</p>
509+
<p class="message-info">In addition to using a string expression, starting with version <code>9.17.0</code> you can alternatively use an `Expression` object.</p>
509510

510511
### chunkBy
511512

0 commit comments

Comments
 (0)