Skip to content

Commit 1fe25fb

Browse files
fix: Correct typos vid_ds_scripts.md (#2669)
- Fixed the phrase 'In our we filtering we required all scores to be below the threshold…' by removing the extra 'we'. This correction clarifies the sentence and ensures proper grammar. - Fixed the phrase 'for each filters as well as the captions from Florence-2' by changing 'filters' to 'filter'. 'Each' should be followed by a singular noun, which improves the accuracy of the text.
1 parent 1378d4e commit 1fe25fb

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

vid_ds_scripts.md

+2-2
Original file line numberDiff line numberDiff line change
@@ -105,13 +105,13 @@ The shard of glass scores low at 4.04
105105
|:---------:|:-----:|
106106
| 4.04 | ![19s8CRUVf3E-Scene-015_1.jpg](https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/blog/vid_ds_scripts/19s8CRUVf3E-Scene-015_1.jpg) |
107107

108-
In our we filtering we required all scores to be below the threshold, in this case using the aesthetic score from the first frame only would be a more effective strategy.
108+
In our filtering we required all scores to be below the threshold, in this case using the aesthetic score from the first frame only would be a more effective strategy.
109109

110110
If we review [`finetrainers/crush-smol`](https://huggingface.co/datasets/finetrainers/crush-smol) we can notice that many of the objects being crushed are round or rectangular and colorful which is similar to our findings in the example frames. Aesthetic scores can be useful yet have a bias that will potentially filter out good data when used with extreme thresholds like > 5.5. It may be more effective as a filter for bad content than good with a minimum threshold of around 4.25 - 4.5.
111111

112112
### OCR/Caption
113113

114-
Here we provide some visual examples for each filters as well as the captions from Florence-2.
114+
Here we provide some visual examples for each filter as well as the captions from Florence-2.
115115

116116
<table>
117117
<tr>

0 commit comments

Comments
 (0)