templates: add RAG template for Intel Xeon Scalable Processors #18424

lvliang-intel · 2024-03-02T14:48:42Z

Description:
This template utilizes Chroma and TGI (Text Generation Inference) to execute RAG on the Intel Xeon Scalable Processors. It serves as a demonstration for users, illustrating the deployment of the RAG service on the Intel Xeon Scalable Processors and showcasing the resulting performance enhancements.

Issue:
None

Dependencies:
The template contains the poetry project requirements to run this template.
CPU TGI batching is WIP.

Twitter handle:
None

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

vercel · 2024-03-02T14:48:46Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
langchain	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Mar 28, 2024 11:25pm

…hain-ai#18424) **Description:** This template utilizes Chroma and TGI (Text Generation Inference) to execute RAG on the Intel Xeon Scalable Processors. It serves as a demonstration for users, illustrating the deployment of the RAG service on the Intel Xeon Scalable Processors and showcasing the resulting performance enhancements. **Issue:** None **Dependencies:** The template contains the poetry project requirements to run this template. CPU TGI batching is WIP. **Twitter handle:** None --------- Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>

**Description:** This template utilizes Chroma and TGI (Text Generation Inference) to execute RAG on the Intel Xeon Scalable Processors. It serves as a demonstration for users, illustrating the deployment of the RAG service on the Intel Xeon Scalable Processors and showcasing the resulting performance enhancements. **Issue:** None **Dependencies:** The template contains the poetry project requirements to run this template. CPU TGI batching is WIP. **Twitter handle:** None --------- Signed-off-by: lvliang-intel <liang1.lv@intel.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>

templates: add RAG template for Intel Xeon Scalable Processors

2d40139

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

efriis added the template label Mar 2, 2024

efriis self-assigned this Mar 2, 2024

lvliang-intel marked this pull request as ready for review March 12, 2024 02:01

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Mar 12, 2024

Merge branch 'master' into rag_template_for_intel_xeon

c2b40d5

dosubot bot added Ɑ: Runnables Related to Runnables 🔌: chroma Primarily related to ChromaDB integrations 🤖:enhancement A large net-new component, integration, or chain. Use sparingly. The largest features labels Mar 12, 2024

vercel bot deployed to Preview March 12, 2024 02:14 View deployment

Merge branch 'master' into rag_template_for_intel_xeon

006efe8

baskaryan approved these changes Mar 28, 2024

View reviewed changes

dosubot bot added the lgtm PR looks good. Use to confirm that a PR is ready for merging. label Mar 28, 2024

vercel bot deployed to Preview March 28, 2024 22:51 View deployment

fmt

376cb3b

vercel bot deployed to Preview March 28, 2024 23:25 View deployment

baskaryan merged commit 0175906 into langchain-ai:master Mar 29, 2024
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

templates: add RAG template for Intel Xeon Scalable Processors #18424

templates: add RAG template for Intel Xeon Scalable Processors #18424

lvliang-intel commented Mar 2, 2024 •

edited

Loading

vercel bot commented Mar 2, 2024 •

edited

Loading

templates: add RAG template for Intel Xeon Scalable Processors #18424

templates: add RAG template for Intel Xeon Scalable Processors #18424

Conversation

lvliang-intel commented Mar 2, 2024 • edited Loading

vercel bot commented Mar 2, 2024 • edited Loading

lvliang-intel commented Mar 2, 2024 •

edited

Loading

vercel bot commented Mar 2, 2024 •

edited

Loading