maslionok
commited on
Commit
·
9f9797d
1
Parent(s):
64f0f95
added explanation what is solr
Browse files
app.py
CHANGED
|
@@ -43,7 +43,7 @@ with gr.Blocks(title="Solr Normalization Demo") as demo:
|
|
| 43 |
"""
|
| 44 |
# 🧹 Solr Normalization Pipeline Demo
|
| 45 |
|
| 46 |
-
This demo showcases the **Solr Normalization Pipeline**, which replicates the text preprocessing steps applied by Solr during indexing to help you understand how raw input is transformed before becoming searchable.
|
| 47 |
|
| 48 |
The pipeline applies:
|
| 49 |
- **Tokenization** (splitting text into searchable units)
|
|
@@ -63,6 +63,14 @@ with gr.Blocks(title="Solr Normalization Demo") as demo:
|
|
| 63 |
"""
|
| 64 |
)
|
| 65 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 66 |
with gr.Row():
|
| 67 |
with gr.Column():
|
| 68 |
text_input = gr.Textbox(
|
|
|
|
| 43 |
"""
|
| 44 |
# 🧹 Solr Normalization Pipeline Demo
|
| 45 |
|
| 46 |
+
This demo showcases the **Solr Normalization Pipeline**, which replicates the text preprocessing steps applied by Solr <span title="Solr is the platform that provides search capabilities in Impresso. Several preprocessing steps must be undertaken to prepare data to be searchable in Solr. These steps are common in Natural Language Processing pipelines, as they help with normalising textual data by, for example, making the whole text lowercase. This makes possible non case-sensitive searches, where if you either write 'Dog' or 'dog', you can get the same results.">ℹ️</span> during indexing to help you understand how raw input is transformed before becoming searchable.
|
| 47 |
|
| 48 |
The pipeline applies:
|
| 49 |
- **Tokenization** (splitting text into searchable units)
|
|
|
|
| 63 |
"""
|
| 64 |
)
|
| 65 |
|
| 66 |
+
# Add Solr explanation accordion
|
| 67 |
+
with gr.Accordion("❓ What is Solr?", open=False) as solr_info:
|
| 68 |
+
gr.Markdown("""
|
| 69 |
+
**Solr** is the platform that provides search capabilities in Impresso. Several preprocessing steps must be undertaken to prepare data to be searchable in Solr.
|
| 70 |
+
|
| 71 |
+
These steps are common in Natural Language Processing pipelines, as they help with normalising textual data by, for example, making the whole text lowercase. This makes possible non case-sensitive searches, where if you either write 'Dog' or 'dog', you can get the same results.
|
| 72 |
+
""")
|
| 73 |
+
|
| 74 |
with gr.Row():
|
| 75 |
with gr.Column():
|
| 76 |
text_input = gr.Textbox(
|