Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions Custom-search CLI/output.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
#,Title,Link
41 changes: 41 additions & 0 deletions Custom-search CLI/readme.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,41 @@
# Custom-search CLI
A simple Python script that uses the **Google Custom Search API** to fetch search results and export them into a CSV file.


## Requirements
- Python 3.8+
- A Google API key
- A Google Custom Search Engine (CX) ID
- Install dependencies:
```bash
pip install requests
pip install beautifulsoup4
pip install python-csv
pip install argparse
Comment on lines +12 to +14
Copy link

Copilot AI Sep 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The dependencies listed are incorrect. The script doesn't use beautifulsoup4 or python-csv (csv is built-in), and argparse is part of the standard library. Only 'requests' is needed as an external dependency.

Suggested change
pip install beautifulsoup4
pip install python-csv
pip install argparse

Copilot uses AI. Check for mistakes.
```

## Setup
1. Get a Google API key from [Google Cloud Console](https://console.cloud.google.com/)
2. Create a Custom Search Engine (CX) at [Google CSE](https://cse.google.com/cse/all)
3. Run the script with your API key to create setting.json:

python main.py -sq [SEARCH_QUERY] --add_api_key [YOUR_API_KEY]
Copy link

Copilot AI Sep 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The filename in the example is incorrect. It should be 'scraper.py' instead of 'main.py' to match the actual script filename.

Suggested change
python main.py -sq [SEARCH_QUERY] --add_api_key [YOUR_API_KEY]
python scraper.py -sq [SEARCH_QUERY] --add_api_key [YOUR_API_KEY]

Copilot uses AI. Check for mistakes.

## Usage
Search with query:
```bash
python scraper.py -sq "github"
```
Fetch multiple pages (10 results per page):
```bash
python scraper.py -sq "github" --pages 3
```
## Output
- Results are saved in output.csv in the following columns:

\# , Title , Link

> [!NOTE] </br>
> Free quota: 100 queries/day (10 results per query). </br>
> If `setting.json` is missing or doesn’t have an API key, use `--add_api_key`.
---
101 changes: 101 additions & 0 deletions Custom-search CLI/scraper.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,101 @@
import requests
import json
import os
import csv
import argparse
from typing import List, Dict, Tuple, Any

SETTING_ROUTE = 'setting.json'
DEFAULT_CX = 'b0264518c3d104eda'


def load_settings(api_key: str | None = None) -> Dict[str, str]:
"""
Load API settings from setting.json, or create it if missing.
"""
if os.path.exists(SETTING_ROUTE):
with open(SETTING_ROUTE, 'r', encoding="utf-8") as f:
settings = json.load(f)

if not settings.get("API_KEY"):
if api_key:
settings["API_KEY"] = api_key
with open(SETTING_ROUTE, 'w', encoding="utf-8") as f:
json.dump(settings, f, indent=4)
else:
raise ValueError("API_KEY is missing in setting.json. Use --add_api_key to add one.")
else:
if not api_key:
raise FileNotFoundError("No setting.json found. Please run with --add_api_key to create one.")
settings = {"API_KEY": api_key, "CX": DEFAULT_CX}
with open(SETTING_ROUTE, 'w', encoding="utf-8") as f:
json.dump(settings, f, indent=4)

return settings


def scrape(search_query: str, api_key: str, cx: str, pages: int = 1) -> Tuple[List[Dict[str, Any]], float]:
"""
Perform a Google Custom Search and return results.
"""
results = []
search_time = 0.0

for page in range(pages):
start = page * 10 + 1
url = (
f"https://www.googleapis.com/customsearch/v1"
f"?key={api_key}&q={search_query}&cx={cx}&start={start}"
)

response = requests.get(url)
if response.status_code != 200:
raise RuntimeError(f"API request failed: {response.status_code} {response.text}")

data = response.json()

if "items" not in data:
print("No results found or error:", data)
break

results.extend(data["items"])
search_time += float(data['searchInformation']['searchTime'])

return results, search_time


def export_to_csv(results: List[Dict[str, Any]], filename: str = "output.csv") -> None:
"""
Export search results to a CSV file.
"""
rows = [[i + 1, item.get("title", ""), item.get("link", "")] for i, item in enumerate(results)]

with open(filename, "w", encoding="utf-8", newline="") as f:
writer = csv.writer(f)
writer.writerow(["#", "Title", "Link"])
writer.writerows(rows)

print(f"Exported {len(results)} results to {filename}")


def main():
parser = argparse.ArgumentParser(description="Google Custom Search scraper")
parser.add_argument("-sq", "--search_query", required=True, help="Search query to search for")
parser.add_argument("--add_api_key", type=str, help="Your Google API key")
parser.add_argument("--pages", type=int, default=1, help="Number of pages of results to fetch")
args = parser.parse_args()

settings = load_settings(args.add_api_key)
api_key = settings["API_KEY"]
cx = settings["CX"]

print(f"Using API key: {api_key}")
Copy link

Copilot AI Sep 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Printing the API key to console poses a security risk as it could expose sensitive credentials in logs or terminal history. Consider printing only a masked version or removing this line entirely.

Suggested change
print(f"Using API key: {api_key}")
print(f"Using API key: {api_key[:4]}...{api_key[-4:]} (masked)")

Copilot uses AI. Check for mistakes.

results, elapsed_time = scrape(args.search_query, api_key, cx, args.pages)

export_to_csv(results)
print(f"Completed in {elapsed_time:.2f} seconds.")


if __name__ == "__main__":
main()
4 changes: 4 additions & 0 deletions Custom-search CLI/setting.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
{
"API_KEY": "",
"CX": ""
}
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -55,6 +55,7 @@ More information on contributing and the general code of conduct for discussion
| CSV to Excel | [CSV to Excel](https://github.com/DhanushNehru/Python-Scripts/tree/main/CSV%20to%20Excel) | A Python script to convert a CSV to an Excel file. |
| CSV_TO_NDJSON | [CSV to Excel](https://github.com/DhanushNehru/Python-Scripts/tree/main/CSV_TO_NDJSON) | A Python script to convert a CSV to an NDJSON files file. |
| Currency Script | [Currency Script](https://github.com/DhanushNehru/Python-Scripts/tree/main/Currency%20Script) | A Python script to convert the currency of one country to that of another. |
| Custom-search CLI | [Custom-search CLI](https://github.com/DhanushNehru/Python-Scripts/tree/main/Custom-search%20CLI) | Python script to search a query through internet and save the results in a .csv file. |
Copy link

Copilot AI Sep 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Title contains a spelling error: 'Cutom-search' should be 'Custom-search'. Also, there are excessive spaces between 'Custom-search' and 'CLI' that should be normalized.

Suggested change
| Custom-search CLI | [Custom-search CLI](https://github.com/DhanushNehru/Python-Scripts/tree/main/Custom-search%20CLI) | Python script to search a query through internet and save the results in a .csv file. |
| Custom-search CLI | [Custom-search CLI](https://github.com/DhanushNehru/Python-Scripts/tree/main/Custom-search%20CLI) | Python script to search a query through internet and save the results in a .csv file. |

Copilot uses AI. Check for mistakes.
| Digital Clock | [Digital Clock](https://github.com/DhanushNehru/Python-Scripts/tree/main/Digital%20Clock) | A Python script to preview a digital clock in the terminal. |
| Display Popup Window | [Display Popup Window](https://github.com/DhanushNehru/Python-Scripts/tree/main/Display%20Popup%20Window) | A Python script to preview a GUI interface to the user. |
| Distance Calculator | [Distance Calculator](https://github.com/Mathdallas-code/Python-Scripts/tree/main/Distance%20Calculator) | A Python script to calculate the distance between two points.
Expand Down