Nicolay Rusnachenko's picture

Nicolay Rusnachenko

nicolay-r

AI & ML interests

Information Retrieval・Medical Multimodal NLP (πŸ–Ό+πŸ“) Research Fellow @BU_Research・software developer http://arekit.io・PhD in NLP

Recent Activity

Organizations

None yet

nicolay-r's activity

posted an update about 8 hours ago
view post
Post
288
πŸ“’ Replicate IO just started support πŸ‹ DeepSeek-R1 hosting!
https://replicate.com/deepseek-ai/deepseek-r1

If you wish to quick start with reasoning over your dataset data, I just added support at Replicate provider for bulk-chain:
https://github.com/nicolay-r/nlp-thirdgate/blob/ebcdec156eb43f9c32d0d70aadc2d26765d31b75/llm/replicate_104.py#L14-L21

πŸ”§ What I fixed (see my setups in the second screenshot)
- 🌑️default temperature is 0.6
- ❌ no system prompt

Here is a quick start for applying R1 for reasoning over your data (see first screenshot):
https://github.com/nicolay-r/bulk-chain?tab=readme-ov-file#shell

πŸš€ Perfomance: ~24 tokens / sec. In my experience the peformance is way more faster than at OpenRouter, similar to playground
πŸ’΅ Price: 10 USD / 10 USD per 1M tokens
🌟 bulk-chain: https://github.com/nicolay-r/bulk-chain
reacted to clem's post with πŸš€πŸ”₯ 1 day ago
posted an update 1 day ago
view post
Post
1045
🚨 If you want a quickly apply various reasoning techniques 🧠 for your dataset, then I am happy to save your time and introduce 🌌 nlp-thirdgate 🌌

https://github.com/nicolay-r/nlp-thirdgate

This is a hub of a third-party providers like OpenAI, Replicate, OpenRouter, Hugging Face πŸ€— Transformers to be used for varions NLP tasks in a no-string mode. So that, you decide which dependecies to install, which I personally see is handy for:
πŸ“™ quick scripts deployment in notebooks like Google Colab;
πŸ“¦ empowering existing apps with machnine learning;

πŸ“· The example below demonstrates on how to quick start with reasoning over rows of CSV / JSONL data.

To quick start, all you have to do is to download one of the provider and pass it to the script as shown in the image below.
🌟 Powered by bulk-chain: https://github.com/nicolay-r/bulk-chain
posted an update 3 days ago
view post
Post
1397
πŸ“’ For those who wish to launch distilled DeepSeek R1 for reasoning with schema, sharing the Google Colab notebook:
πŸ“™ https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/llm_deep_seek_7b_distill_colab.ipynb
This is a wrapper of the Qwen2 transformers πŸ€— provider via bulk-chain framework.
Model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
GPU: T4 (15GB) is nearly enough in float32 mode.
πŸš€ To boost the performance you may set bf16 mode (use_bf16=True)
🌟 Powered by bulk-chain: https://github.com/nicolay-r/bulk-chain
posted an update 4 days ago
view post
Post
1733
πŸ“’ For those who wish to apply DeepSeek-R1 for handling tabular / streaming data using schema of prompts (CoT), the OpenRouter AI hosts API for accessing:
https://openrouter.ai/deepseek/deepseek-r1

The no-string option to quick start with using DeepSeek-R1 includes three steps:
βœ… OpenRouter provider: https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/open_router.py
βœ… Bulk-chain for infering data: https://github.com/nicolay-r/bulk-chain
βœ… Json Schema for Chain-of-Though reasoning (see screenshot πŸ“· below)

πŸ“Ί below is a screenshot of how to quick start the demo, in which you can test your schema for LLM responses. It would ask to type all the parameters first for completing the requests (which is text within this example).

πŸ“ƒ To apply it for JSONL/CSV data, you can use --src shell parameter for passing the related file

⏳ As for time, OpenRouter finds me relatively slow with 30~40 seconds per request

Models:
deepseek-ai/DeepSeek-R1
  • 1 reply
Β·
posted an update 5 days ago
view post
Post
743
πŸ“’ For those who tracking adances in Sentiment Analysis, this post might be relevant for you. So far we arranged RuOpinionNE-2024 competition with final stage that has just been completed several days ago. Here the quick findings we got from the top submissions. πŸŽ‰

πŸ” First, RuOpinonNE-2024 competition πŸ’» on extraction of opinion tuples (opinion source, opinion object, tonality, linguistic expression) from low resource domain news texts written in Russian language. The competition is hosted by codalab platform:

https://codalab.lisn.upsaclay.fr/competitions/20244

To asses the advanes, we adopt F1 over sentiment classes which also involves evalution of the spans.
πŸ‘ Among 7 participants in total, the top three submissions showcase the following results:

πŸ₯‰msuai F1=0.33 🎊
πŸ₯ˆRefalMachine showcase +0.02 F1=0.35 🎊
πŸ†VatolinAlexey showcase +0.06 F1=0.41 🎊

πŸ“ At present, the competition organizers are working on:
1. 🟑 Collecting information about models utilized participants to contribute here with pre-trained models / concepts;
2. 🟑 Wrapping up findings from the submissions in a paper.

πŸ”” For more information and further updates, the most complete source that complements codalab is this github:
https://github.com/dialogue-evaluation/RuOpinionNE-2024

The RuOpinionNE-2024 are now in post-evaluation stage, so everyone interested in low resouce domain evaluation on opinon extraction are welcome πŸ™Œ
posted an update 7 days ago
view post
Post
363
πŸ“’ Deligted to share the new version of the bulk-ner which represent a tiny framework that would save you time for deploing NER with any model.

πŸ“¦: https://pypi.org/project/bulk-ner/0.25.1/
🌟: https://github.com/nicolay-r/bulk-ner

The direct adaptaion of the LM for NER would result in spending signifcant amount of time on formatting your texts according to the NER-model needs.
In particular:
1. Processing CONLL format with B-I-O tags from model outputs
2. Input trimming: long input content might not be completely fitted

The 0.25.1 I made a huge steps forward by providing:
βœ… Enhanced integration by providing function for casting extracted enties to your type (see picture below)
βœ… Enhanced integration with AREkit pipelines
βœ… Simpified API for using (Example using DeepPavlov NER models): https://github.com/nicolay-r/bulk-ner/wiki#api

πŸ‘ The code for pipeline deployment is taken from the AREkit project:
https://github.com/nicolay-r/AREkit
reacted to cutechicken's post with πŸš€ 10 days ago
view post
Post
2844
πŸ”¬ PaperImpact
: Scientific Impact Predictor Powered by Deep Learning 🎯

VIDraft/PaperImpact

πŸ“š Overview
A cutting-edge AI system that combines transformer architecture with citation pattern analysis to predict research impact. Our model, trained on 120,000+ CS papers, analyzes innovation potential, methodological robustness, and future impact, providing researchers with valuable insights before publication.
🧠 Scientific Foundation

BERT-based semantic analysis
Citation network pattern learning
NDCG optimization & MSE loss
Cross-validated prediction engine
GPU-accelerated inference

πŸ’« Why Researchers Need This

Pre-submission impact assessment
Research direction optimization
Time-saving paper evaluation
Competitive edge in academia
Trend identification advantage

🎯 Key Features

One-click arXiv paper analysis
Real-time impact scoring (0-1)
9-tier grading system (AAA-C)
Smart input validation
Instant visual feedback

🌟 Unique Benefits
"Don't wait years to know your paper's impact. Get instant, AI-powered insights to strengthen your research strategy and maximize your academic influence."
Perfect for:

Research authors
PhD students
Journal editors
Research institutions
Grant committees

#ResearchImpact #AcademicAI #ScienceMetrics #ResearchExcellence
  • 1 reply
Β·
posted an update 10 days ago
view post
Post
474
πŸ“’ I am happy to share the bulk-translate 0.25.1. 🎊
This is a framework that allows you adapt your LM or use default (googletranslate API) for a quick translation of your dataset data.

⭐ https://github.com/nicolay-r/bulk-translate

bulk-translate is a tiny Python 🐍 no-string framework that allows translate a massive datasets of pre-annotated fixed-spans with related metadata that are invariant for translator (see picture below). It supports πŸ‘¨β€πŸ’» API for quick data translation with (optionaly) annotated objects in texts (see figure below) in Python 🐍

In new release we enahnce:
1. Fixed: sync type checking for spans representation
2. Compatibility with AREkit pipelines

πŸ€– The quick tutorial for applying it towards list of textual data with optional spans:
https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/translate_texts_with_spans_via_googletrans.ipynb
reacted to mkurman's post with πŸ”₯πŸ‘ 11 days ago