Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon May 9, 2024 • 12
Tradeoffs Between Alignment and Helpfulness in Language Models with Representation Engineering Paper • 2401.16332 • Published Jan 29, 2024
WHISTRESS: Enriching Transcriptions with Sentence Stress Detection Paper • 2505.19103 • Published May 25 • 13
Getting it Right: Improving Spatial Consistency in Text-to-Image Models Paper • 2404.01197 • Published Apr 1, 2024 • 32
LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models Paper • 2404.03118 • Published Apr 3, 2024 • 27
FastRM: An efficient and automatic explainability framework for multimodal generative models Paper • 2412.01487 • Published Dec 2, 2024 • 1
Inference Performance Optimization for Large Language Models on CPUs Paper • 2407.07304 • Published Jul 10, 2024 • 54
Distributed Speculative Inference of Large Language Models Paper • 2405.14105 • Published May 23, 2024 • 19
ABSApp: A Portable Weakly-Supervised Aspect-Based Sentiment Extraction System Paper • 1909.05608 • Published Sep 12, 2019
Term Set Expansion based NLP Architect by Intel AI Lab Paper • 1808.08953 • Published Aug 27, 2018 • 1