Generalize parser tests and allow parsing of the old log files #1

kaklik · 2025-10-17T22:28:29Z

Summary

Introduce parser test module test_parser.py
make the parser tests iterate over any non-hidden fixtures placed in data/ and skip cleanly when none exist
Make parser sensitive to old log file formats.

Copilot

Pull Request Overview

This PR refactors parser logic into a dedicated module and adds comprehensive test coverage. The main goal is to decouple parsing functionality from the GUI layer (PyQt) to enable independent testing and improve code organization.

Parser classes moved from dosview/__init__.py to new dosview/parsers.py module
Added parametrized tests for parser detection and validation in tests/test_parser.py
Included test fixture data file and GitHub Actions CI workflow

Reviewed Changes

Copilot reviewed 6 out of 8 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
dosview/parsers.py	New module containing extracted and refactored parser classes with type hints
dosview/init.py	Removed inline parser code and replaced with import from parsers module
tests/test_parser.py	Added comprehensive parser tests with fixture-based validation
data/DATALOG_AIRDOS_GEO.TXT	Test fixture data for legacy AIRDOS log format
.github/workflows/tests.yml	CI workflow configuration for automated testing
dosview/version.py	Version bump to 0.1.22

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-10-29T21:35:28Z

dosview/parsers.py

+                                for idx, val in enumerate(parts[4:]):
+                                    try:
+                                        current_hist[idx] += int(val)
+                                    except ValueError:


[nitpick] Catching a bare ValueError masks potential parsing issues. The old code used a bare except Exception which was also problematic. While this is an improvement, consider logging the error or at least the line number/context when ValueError occurs to aid debugging malformed log files.

Copilot · 2025-10-29T21:35:28Z

dosview/parsers.py

+                    case "$HIST":
+                        df_lines.append(parts[1:])
+                    case "$HITS":
+                        for i in range(2, len(parts) - 1, 2):


Off-by-one error: range(2, len(parts) - 1, 2) will skip the last value if len(parts) is even. For example, if parts has indices 0,1,2,3,4,5 (length 6), this range produces [2,4], missing the pair at indices 4,5. Should be range(2, len(parts), 2) or ensure pairs exist with range(2, len(parts) - 1, 2) only if len(parts) is odd.

Suggested change

for i in range(2, len(parts) - 1, 2):

for i in range(2, len(parts), 2):

Copilot · 2025-10-29T21:35:29Z

dosview/parsers.py

+                    case _:
+                        continue
+        if not df_lines:
+            raise ValueError("Soubor neobsahuje žádné záznamy $HIST pro starší log.")


[nitpick] Error message is in Czech. Consider using English for consistency with the rest of the codebase (comments, docstrings, and other error messages are in English).

Suggested change

raise ValueError("Soubor neobsahuje žádné záznamy $HIST pro starší log.")

raise ValueError("The file does not contain any $HIST records for the older log format.")

Copilot · 2025-10-29T21:35:29Z

dosview/parsers.py

+    for parser_cls in LOG_PARSERS:
+        if parser_cls.detect(file_path):
+            return parser_cls(file_path)
+    raise ValueError("Neznámý typ logu nebo žádný vhodný parser.")


[nitpick] Error message is in Czech. Consider using English for consistency with the rest of the codebase (comments, docstrings, and other error messages are in English).

Suggested change

raise ValueError("Neznámý typ logu nebo žádný vhodný parser.")

raise ValueError("Unknown log type or no suitable parser found.")

Copilot · 2025-10-29T21:35:29Z

tests/test_parser.py

+import importlib.util
+
+import numpy as np
+import pytest
+
+ROOT = Path(__file__).resolve().parent.parent
+PARSERS_PATH = ROOT / "dosview" / "parsers.py"
+
+spec = importlib.util.spec_from_file_location("dosview_parsers", PARSERS_PATH)
+parsers = importlib.util.module_from_spec(spec)
+assert spec.loader is not None
+spec.loader.exec_module(parsers)
+
+LOG_PARSERS = parsers.LOG_PARSERS
+get_parser_for_file = parsers.get_parser_for_file
+parse_file = parsers.parse_file
+


[nitpick] The test uses dynamic module loading via importlib.util instead of standard imports. Since dosview.parsers is now a proper module in the package, consider using from dosview import parsers or from dosview.parsers import ... directly. This approach is more maintainable and allows IDEs to provide proper code completion and type checking.

Suggested change

import importlib.util

import numpy as np

import pytest

ROOT = Path(__file__).resolve().parent.parent

PARSERS_PATH = ROOT / "dosview" / "parsers.py"

spec = importlib.util.spec_from_file_location("dosview_parsers", PARSERS_PATH)

parsers = importlib.util.module_from_spec(spec)

assert spec.loader is not None

spec.loader.exec_module(parsers)

LOG_PARSERS = parsers.LOG_PARSERS

get_parser_for_file = parsers.get_parser_for_file

parse_file = parsers.parse_file

import numpy as np

import pytest

from dosview.parsers import LOG_PARSERS, get_parser_for_file, parse_file

Copilot · 2025-10-29T21:35:30Z

dosview/parsers.py

+    raise ValueError("Neznámý typ logu nebo žádný vhodný parser.")
+
+
+def parse_file(file_path: str | Path):


Missing return type annotation. Based on the implementation, this function returns a list containing [np.ndarray, np.ndarray, np.ndarray, dict]. Consider adding the return type annotation for better API documentation and type safety.

Suggested change

def parse_file(file_path: str | Path):

def parse_file(file_path: str | Path) -> list[np.ndarray | dict]:

Generalize parser data-driven tests

93513a9

kaklik added the codex label Oct 17, 2025 — with ChatGPT Codex Connector

kaklik added 2 commits October 18, 2025 00:32

Add logfile fragments.

3f26371

Update version.py

b7921ce

kaklik requested a review from roman-dvorak October 17, 2025 22:47

kaklik changed the title ~~Generalize parser tests and drop bundled fixtures~~ Generalize parser tests and allow parsing of the old log files Oct 17, 2025

roman-dvorak requested a review from Copilot October 29, 2025 21:33

Copilot AI reviewed Oct 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Generalize parser tests and allow parsing of the old log files #1

Generalize parser tests and allow parsing of the old log files #1

Uh oh!

kaklik commented Oct 17, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Oct 29, 2025

Uh oh!

Copilot AI Oct 29, 2025

Uh oh!

Copilot AI Oct 29, 2025

Uh oh!

Copilot AI Oct 29, 2025

Uh oh!

Copilot AI Oct 29, 2025

Uh oh!

Copilot AI Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	for i in range(2, len(parts) - 1, 2):
	for i in range(2, len(parts), 2):

	raise ValueError("Soubor neobsahuje žádné záznamy $HIST pro starší log.")
	raise ValueError("The file does not contain any $HIST records for the older log format.")

	raise ValueError("Neznámý typ logu nebo žádný vhodný parser.")
	raise ValueError("Unknown log type or no suitable parser found.")

		raise ValueError("Neznámý typ logu nebo žádný vhodný parser.")


		def parse_file(file_path: str \| Path):

	def parse_file(file_path: str \| Path):
	def parse_file(file_path: str \| Path) -> list[np.ndarray \| dict]:

Generalize parser tests and allow parsing of the old log files #1

Are you sure you want to change the base?

Generalize parser tests and allow parsing of the old log files #1

Uh oh!

Conversation

kaklik commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kaklik commented Oct 17, 2025 •

edited

Loading