Skip to content

CLI Captures "COOKIE BANNERS" and/or does not work the same as the Web Extension #143

@QueenieArtemisia

Description

@QueenieArtemisia

I was trying to download this page with the CLI: https://additional-creatures.fandom.com/wiki/Additional_Creatures:_Aquaria

  • it captures the cookie banner, you can test it.

Secondly, I tried downloading this page the same way: https://game-maps.com/MEA/Mass-Effect-Andromeda-Walkthrough.asp

  • This one downloads a dimmed page that does not have even 20% of the information it should.

Both cases are 100% reproducible for me - even in VMs.


Both pages download just fine with the web extension.
I am trying to use the CLI for these because I want to crawl... but if I do, every single page downloads the same way as the aforementioned - cookie banners on the first site, dimmed and missing info on the second.


Here's my settings code (when crawling):

"C:\Program Files\Google\Chrome\Application\single-file.exe" https://additional-creatures.fandom.com/wiki/Additional_Creatures:_Aquaria --filename-template="{page-title}.{filename-extension}" --crawl-links=true --crawl-inner-links-only=false --crawl-max-depth=1 --crawl-replace-urls=true --crawl-rewrite-rule="^(.)\?.$ $1"


  • Running as admin
  • Antivirus exception

What am I doing wrong?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions