Consider changing to/giving option of couchDB? #212

PidgeyL · 2015-08-19T09:07:03Z

Should we consider merging to couchDB, or making CVE-Search compatible with both?
I have heard a lot of negative comments about Mongo, and it would be neat to give multiple database options.
Also, the database layer should be abstracted a lot more as well. (I can do this)
My idea would be to make a database abstraction layer, which implements functions for both mongo, couch, postgres,... (we could further this if we see fit), and then we might be able to change databases with the configuration files.
Your thoughts? @adulau @wimremes

adulau · 2015-08-19T12:01:04Z

Sure, it's a good idea. I think the best would be to abstract more the database access.

Some document-based database like Hyperdex has compatible MongoDB compatible database http://hackingdistributed.com/2015/01/12/hyperdex-1.6.0/ .

Maybe we should start together to abstract more the database access. Then we see if can split the Document-based access compared to the key-value store access.

replace posts with ajax requests

pombredanne · 2017-03-25T15:24:00Z

Have you ever considered using a more structured DB, e.g. a traditional relational DB?

PidgeyL · 2019-07-17T14:09:24Z

We will start migration to Postgres soon

github-actions · 2020-10-03T01:40:56Z

Stale issue message

iTosun · 2021-06-11T14:46:09Z

When is the postgres migration planned?

P-T-I · 2021-06-16T04:27:06Z

Not as far as I'm aware off.

baonq-me · 2023-12-13T04:01:15Z

Not as far as I'm aware off.

@P-T-I If you have any migration plan, I would like to spend some weeks executing it. I have experience in MongoDB, PostgreSQL, and ElasticSearch. As I see that the cve-search code base is a little bit messy, so a complete rework is needed, maybe a 4.3 version.

P-T-I · 2023-12-13T05:14:16Z

@baonq-me Well there where some thoughts (also briefly discussed with @oh2fih) stripping cve-search from all backend code and keep it solely as a front end; then let CveXplore handle all the backend code. So in that case moving towards as little overlap between the two as possible (probably taking care of the messy code base in the process) and let the users choose based on their requirements; if they prefer to work on the cli they would only need to use CveXplore and if they would like a GUI they could simply add cve-search to the mix. So any new database logic should be added to cvexplore. For me this split up in functionality makes sense, so you agree? So coming back on the topic; I believe a sql backend is a nice addition too. But I wouldn't narrow it towards postgres, I would opt for a SqlAlchemy ORM model approach so you could use a variety of sql databases (MySQL, mariadb, postgres etc).

baonq-me · 2023-12-13T06:25:46Z

@baonq-me Well there where some thoughts (also briefly discussed with @oh2fih) stripping cve-search from all backend code and keep it solely as a front end; then let CveXplore handle all the backend code. So in that case moving towards as little overlap between the two as possible (probably taking care of the messy code base in the process) and let the users choose based on their requirements; if they prefer to work on the cli they would only need to use CveXplore and if they would like a GUI they could simply add cve-search to the mix. So any new database logic should be added to cvexplore. For me this split up in functionality makes sense, so you agree? So coming back on the topic; I believe a sql backend is a nice addition too. But I wouldn't narrow it towards postgres, I would opt for a SqlAlchemy ORM model approach so you could use a variety of sql databases (MySQL, mariadb, postgres etc).

I agree that letting CveXplore handle all the backend code is a good choice in terms of maintainability. Let's do a quick analysis:

As I know, there are three kinds of users:

The first is a person who only needs to use CLI functionality and doesn't wish to run anything on their system continuously. He just needs something clean and quick.
The second is people who are in some kind of air-gap system (like SOC) and do not have an internet connection so they need to use tools like cve-search to search for CVE for collaboration in tasks like Incident response.
The third is people who want to integrate cve-search into the existing system. For example, I have a list of 3rd apps running in my system, and I want to know which one has CVE as soon as possible (ideally several hours after CVE announcement). So, cve-search can be an alert tool or an HTTP endpoint to provide data for other systems.

This analysis led me to the idea that we can use the SqlAlchemy ORM model as you said.

For the first group of people, a lightweight embedded database like SQLite is acceptable, portable, and almost requires no installation.
The second and third should be okay with MySQL, PostgreSQL, MariaDB, etc.
Another advantage of this approach is that we can utilize the Change-data-capture (CDC) capability to do more things like reindexing data to another database like ElasticSearch (full-text search), Redis (caching) or message queue (alert new CVEs) while requiring no coding at all.

Here are my additional ideas for this big refactor:

When we initialize the cve-search instance, we only need to use CveXplore to initialize its SQLite database first (can include it like the way CVE-Search-Docker do), then import that SQLite database to a higher-level database like MySQL, PostgreSQL, MariaDB, etc. This approach will decouple two projects. This process can be slow but just need to do one single time.
As I see that with the current workflow where cve-search is highly dependent on the CveXplore interface, it is very tricky for new developers or when I need to do some debugging. Pass mongodb connection string when initialize CveXplore cve-search#1030 can be an example as MongoDB connection strings is not passed from cve-search to CveXplore.

P-T-I · 2023-12-13T08:14:42Z

As I know, there are three kinds of users:

The first is a person who only needs to use CLI functionality and doesn't wish to run anything on their system continuously. He just needs something clean and quick.

The second is people who are in some kind of air-gap system (like SOC) and do not have an internet connection so they need to use tools like cve-search to search for CVE for collaboration in tasks like Incident response.

The third is people who want to integrate cve-search into the existing system. For example, I have a list of 3rd apps running in my system, and I want to know which one has CVE as soon as possible (ideally several hours after CVE announcement). So, cve-search can be an alert tool or an HTTP endpoint to provide data for other systems.

Agreed with those three groups; I believe for the first CveXplore alone should suffice; for the second both CveXplore and CveSearch should be needed and for the third either CveSearch or CveXplore could suffice depending on 'how' you would facilitate 3rd party integration (CveSearch HTTP API or via the the CveXplore package)

Another advantage of this approach is that we can utilize the Change-data-capture (CDC) capability to do more things like reindexing data to another database like ElasticSearch (full-text search), Redis (caching) or message queue (alert new CVEs) while requiring no coding at all.

I like this idea; especially the push towards a message queue (Kafka would be the defacto goto I guess). These functionalities should be added into the CveXplore functionality, right?

When we initialize the cve-search instance, we only need to use CveXplore to initialize its SQLite database first (can include it like the way CVE-Search-Docker do), then import that SQLite database to a higher-level database like MySQL, PostgreSQL, MariaDB, etc. This approach will decouple two projects. This process can be slow but just need to do one single time.

I'm reluctant to actually add a database dump into the code base (I know I've done this in the CVE-Search-Docker repo); I would opt in hosting database dumps externally, which might already could be provided by the vulnerability-lookup project.

As I see that with the current workflow where cve-search is highly dependent on the CveXplore interface, it is very tricky for new developers or when I need to do some debugging. Pass mongodb connection string when initialize CveXplore cve-search#1030 can be an example as MongoDB connection strings is not passed from cve-search to CveXplore.

Although I agree; I do not see a way around this; once this path of decoupling is taken, there is no way back. For the long term I would say the configuration effort, maintainability and de-duplication of code benefits outweighs the 'high dependency' downfall.

I would suggest we move this discussion into a new project / issue list in the cve-search/CveXplore repo, agreed?

If so, I'll transfer this issue into the cve-search/CveXplore repo

baonq-me · 2023-12-13T10:24:46Z

I like this idea; especially the push towards a message queue (Kafka would be the defacto goto I guess). These functionalities should be added into the CveXplore functionality, right?

I can update CVE-Search-Docker for demonstration as well as documents to clarify this.

I would suggest we move this discussion into a new project / issue list in the cve-search/CveXplore repo, agreed?

I agree

P-T-I · 2023-12-13T19:14:40Z

I can update CVE-Search-Docker for demonstration as well as documents to clarify this.

Which specific demonstration are you talking about?

pombredanne · 2023-12-14T00:28:12Z

FWIW, my concern with MongoDB was/is that this is no longer using an open source license.

baonq-me · 2023-12-14T03:28:37Z

I can update CVE-Search-Docker for demonstration as well as documents to clarify this.

Which specific demonstration are you talking about?

Here is my reference architecture.

In my use case, cve-search/CveXplore is not just a tool to search for CVEs but also a offline data source to provide the ability to detect vulnerable software timely as well as highly reliable and fully automated from gathering software versions to detection, even in very special cases like CVE-2023-22522 where vulnerable configurations are updated by NVD 5 days after vendor announcement (in this case human people must be involved to manually review).

Of course, I can add more code to cve-search/CveXplore if needed. The above is just a reference architecture to express my idea.

P-T-I · 2023-12-14T06:35:12Z

Looks very nice; I'll give a more detailed response in an hour or so, let me get to the office first ;-)

P-T-I · 2023-12-14T08:08:02Z

Right, the way I see it is that the bulk of the logic needs to be incorporated into the cvexplore repo (the green parts). The GUI (frontend as discussed earlier) should be the cvesearch repo (purple):

The dotted parts should be, in my opinion, made optional / configurable and cvesearch should be able to fully function with, but also without them present.
The blue boxes (alerting etc) are 3rd party options / integrations, but not part of both code bases and are out of scope for development, right?

baonq-me · 2023-12-14T08:39:09Z

The dotted parts should be, in my opinion, made optional / configurable and cvesearch should be able to fully function with, but also without them present.

The blue boxes (alerting etc) are 3rd party options / integrations, but not part of both code bases and are out of scope for development, right?

I agree both. I think we can narrow down those ideas to a more detailed task list that needs to be done.

P-T-I · 2023-12-14T09:41:24Z

Ageed; I'll start on that task list right away (in random order), please append as you see fit, I'll create a new issue: #213 as a master issue to track the work

P-T-I · 2023-12-14T10:10:23Z

FWIW, my concern with MongoDB was/is that this is no longer using an open source license.

@pombredanne any specific database whishes?

P-T-I · 2023-12-19T16:10:53Z

closing in favor of #213

pombredanne referenced this issue in pombredanne/cve-search Sep 18, 2015

#87 error handling on no/bad internet connection and invalid urls

37fbcbf

pombredanne referenced this issue in pombredanne/cve-search Sep 18, 2015

Merge pull request #87 from PidgeyL/master

cadd9ea

replace posts with ajax requests

PidgeyL self-assigned this Jul 17, 2019

PidgeyL added the In progress In progress label Jul 17, 2019

github-actions bot added the no-issue-activity label Oct 3, 2020

github-actions bot closed this as completed Oct 10, 2020

P-T-I removed the no-issue-activity label Dec 13, 2023

P-T-I reopened this Dec 13, 2023

P-T-I transferred this issue from cve-search/cve-search Dec 13, 2023

P-T-I unassigned PidgeyL Dec 13, 2023

P-T-I added the enhancement New feature or request label Dec 13, 2023

baonq-me mentioned this issue Dec 16, 2023

Retire HTTP API as datasource for cvexplore #222

Closed

P-T-I closed this as completed Dec 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider changing to/giving option of couchDB? #212

Consider changing to/giving option of couchDB? #212

PidgeyL commented Aug 19, 2015

adulau commented Aug 19, 2015

pombredanne commented Mar 25, 2017

PidgeyL commented Jul 17, 2019

github-actions bot commented Oct 3, 2020

iTosun commented Jun 11, 2021

P-T-I commented Jun 16, 2021

baonq-me commented Dec 13, 2023

P-T-I commented Dec 13, 2023

baonq-me commented Dec 13, 2023 •

edited

Loading

P-T-I commented Dec 13, 2023

baonq-me commented Dec 13, 2023

P-T-I commented Dec 13, 2023

pombredanne commented Dec 14, 2023

baonq-me commented Dec 14, 2023 •

edited

Loading

P-T-I commented Dec 14, 2023

P-T-I commented Dec 14, 2023

baonq-me commented Dec 14, 2023

P-T-I commented Dec 14, 2023

P-T-I commented Dec 14, 2023

P-T-I commented Dec 19, 2023

Consider changing to/giving option of couchDB? #212

Consider changing to/giving option of couchDB? #212

Comments

PidgeyL commented Aug 19, 2015

adulau commented Aug 19, 2015

pombredanne commented Mar 25, 2017

PidgeyL commented Jul 17, 2019

github-actions bot commented Oct 3, 2020

iTosun commented Jun 11, 2021

P-T-I commented Jun 16, 2021

baonq-me commented Dec 13, 2023

P-T-I commented Dec 13, 2023

baonq-me commented Dec 13, 2023 • edited Loading

P-T-I commented Dec 13, 2023

baonq-me commented Dec 13, 2023

P-T-I commented Dec 13, 2023

pombredanne commented Dec 14, 2023

baonq-me commented Dec 14, 2023 • edited Loading

P-T-I commented Dec 14, 2023

P-T-I commented Dec 14, 2023

baonq-me commented Dec 14, 2023

P-T-I commented Dec 14, 2023

P-T-I commented Dec 14, 2023

P-T-I commented Dec 19, 2023

baonq-me commented Dec 13, 2023 •

edited

Loading

baonq-me commented Dec 14, 2023 •

edited

Loading