Nicolay Rusnachenko's picture

Nicolay Rusnachenko

nicolay-r

AI & ML interests

Information Retrieval・Medical Multimodal NLP (🖼+📝) Research Fellow @BU_Research・software developer http://arekit.io・PhD in NLP

Recent Activity

Organizations

None yet

Posts 45

view post
Post
679
🚨 If you want a quickly apply various reasoning techniques 🧠 for your dataset, then I am happy to save your time and introduce 🌌 nlp-thirdgate 🌌

https://github.com/nicolay-r/nlp-thirdgate

This is a hub of a third-party providers like OpenAI, Replicate, OpenRouter, Hugging Face 🤗 Transformers to be used for varions NLP tasks in a no-string mode. So that, you decide which dependecies to install, which I personally see is handy for:
📙 quick scripts deployment in notebooks like Google Colab;
📦 empowering existing apps with machnine learning;

📷 The example below demonstrates on how to quick start with reasoning over rows of CSV / JSONL data.

To quick start, all you have to do is to download one of the provider and pass it to the script as shown in the image below.
🌟 Powered by bulk-chain: https://github.com/nicolay-r/bulk-chain
view post
Post
1369
📢 For those who wish to launch distilled DeepSeek R1 for reasoning with schema, sharing the Google Colab notebook:
📙 https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/llm_deep_seek_7b_distill_colab.ipynb
This is a wrapper of the Qwen2 transformers 🤗 provider via bulk-chain framework.
Model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
GPU: T4 (15GB) is nearly enough in float32 mode.
🚀 To boost the performance you may set bf16 mode (use_bf16=True)
🌟 Powered by bulk-chain: https://github.com/nicolay-r/bulk-chain

datasets

None public yet