Spaces:
Sleeping
Sleeping
added welcome message file
Browse files- spaces/welcome_message.md +22 -0
spaces/welcome_message.md
ADDED
@@ -0,0 +1,22 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
## Welcome!
|
2 |
+
|
3 |
+
*⚠️ This is a private demo using my own OpenAI API key. Please use responsibly. ⚠️*
|
4 |
+
|
5 |
+
This chatbot uses retrieval augmented generation (RAG) to answer questions about topics
|
6 |
+
that are discussed in my master thesis. My master thesis can be found on the [GitHub repo](https://github.com/leonardpasi/mythesis-chatbot) of the project.
|
7 |
+
|
8 |
+
Here you get to choose between three RAG techniques:
|
9 |
+
- **classic retrieval** (which includes a reranker model, so it's actually not the simplest RAG imaginable)
|
10 |
+
- **sentence window retrieval**
|
11 |
+
- **auto-merging retrieval**
|
12 |
+
|
13 |
+
Feel free to experiment with different modes! Note that a little extra delay is to be expected when switching to another mode.
|
14 |
+
Also, note that all your queries (as well as system responses) are automatically logged on a remote PostgreSQL database for continuous monitoring of the deployed systems.
|
15 |
+
|
16 |
+
Each of these systems has been optimized for performance by doing a grid search on the
|
17 |
+
relevant parameters. Performance is quantified with five metrics:
|
18 |
+
- **context relevance**: is the retrieved context relevant to the query?
|
19 |
+
- **groundedness**: is the response supported by the context?
|
20 |
+
- **answer relevance**: is the response relevant to the query?
|
21 |
+
- **cost**
|
22 |
+
- **latency**
|