You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+25-4
Original file line number
Diff line number
Diff line change
@@ -23,26 +23,47 @@ This project works with the data from [TMDB 5000 Movie Dataset from Kaggle](http
23
23
The steps to get the project up and running are:
24
24
25
25
1. Clone the repository to your local machine
26
-
2. MongoDB Atlas setup
26
+
2. MongoDB Atlas Cluster setup
27
27
1. Create account on MongoDB Atlas (if you don't already have one) and log in
28
28
2. Create a new project and deploy a free cluster
29
29
3. Add database user and save credentials (username and password)
30
30
4. Get connection string, should look like this: `mongodb+srv://<username>:<password>@<host>/?retryWrites=true&w=majority`, part after host is optional
31
-
3. Create a .env file in project root and fill in with your user credentials and host from the MongoDB connection string, vector search index name can be chosen arbitrarily
31
+
3. MongoDB Atlas Vector Search setup
32
+
1. Find deployed cluster in the Database section and create a database called 'semantic_search' with 'movies' collection in it
33
+
2. Create a vector search index with name 'moviesVectorSearch' and link it to created collection. For Index definition use the following JSON Editor:
34
+
```json
35
+
{
36
+
"mappings": {
37
+
"dynamic": true,
38
+
"fields": {
39
+
"embedding": {
40
+
"dimensions": 384,
41
+
"similarity": "cosine",
42
+
"type": "knnVector"
43
+
}
44
+
}
45
+
}
46
+
}
47
+
```
48
+
4. Create a .env file in project root and fill in with your user credentials and host from the MongoDB connection string. Fill in the DB name, movies collection name and search index name as you named them in MongoDB Atlas
0 commit comments