Sujal Bhat
commited on
Commit
·
cd1f0ae
1
Parent(s):
323d65b
task1
Browse files- deliverables/Task1.md +3 -4
deliverables/Task1.md
CHANGED
|
@@ -15,21 +15,20 @@ Hint: Create a list of potential questions that people are likely to ask!
|
|
| 15 |
|
| 16 |
✅ Deliverables:
|
| 17 |
|
| 18 |
-
1. Describe the default chunking strategy that you will use
|
| 19 |
<div style="color: green;">
|
| 20 |
The default chunking strategy used is a combination of size-based splitting and thematic categorization.
|
| 21 |
This strategy uses RecursiveCharacterTextSplitter with a chunk size of 1000 characters and an overlap of 200 characters. It then categorizes these chunks based on predefined themes.
|
| 22 |
</div>
|
| 23 |
|
| 24 |
-
2. Articulate a chunking strategy that you would also like to test out
|
| 25 |
|
| 26 |
<div style="color: green;">
|
| 27 |
A pure size-based chunking strategy without thematic categorization. This would involve splitting the text into fixed-size chunks without attempting to categorize them based on themes.
|
| 28 |
</div>
|
| 29 |
|
| 30 |
|
| 31 |
-
|
| 32 |
-
3. Describe how and why you made these decisions
|
| 33 |
<div style="color: green;">
|
| 34 |
The default strategy was chosen for its simplicity and efficiency:
|
| 35 |
|
|
|
|
| 15 |
|
| 16 |
✅ Deliverables:
|
| 17 |
|
| 18 |
+
**1. Describe the default chunking strategy that you will use.**
|
| 19 |
<div style="color: green;">
|
| 20 |
The default chunking strategy used is a combination of size-based splitting and thematic categorization.
|
| 21 |
This strategy uses RecursiveCharacterTextSplitter with a chunk size of 1000 characters and an overlap of 200 characters. It then categorizes these chunks based on predefined themes.
|
| 22 |
</div>
|
| 23 |
|
| 24 |
+
**2. Articulate a chunking strategy that you would also like to test out.**
|
| 25 |
|
| 26 |
<div style="color: green;">
|
| 27 |
A pure size-based chunking strategy without thematic categorization. This would involve splitting the text into fixed-size chunks without attempting to categorize them based on themes.
|
| 28 |
</div>
|
| 29 |
|
| 30 |
|
| 31 |
+
**3. Describe how and why you made these decisions**
|
|
|
|
| 32 |
<div style="color: green;">
|
| 33 |
The default strategy was chosen for its simplicity and efficiency:
|
| 34 |
|