Sugato Ray sugatoray

🎯

Focusing

I am a Physicist turned Data Scientist + ML Practitioner. Research Interests: Data Science, ML, DL, Statistics, Math, Computing, LLMs.

sugatoray / synthetic_data_deepseekr1_qwen_distill.py

Created January 27, 2025 22:48 — forked from davidberenstein1957/synthetic_data_deepseekr1_qwen_distill.py

	# /// script
	# requires-python = ">=3.11,<3.12"
	# dependencies = [
	# "distilabel[hf-transformers, hf-inference-endpoints]",
	# ]
	# ///
	from distilabel.models import InferenceEndpointsLLM
	from distilabel.pipeline import InstructionResponsePipeline

	repo_id = "deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B"

davidberenstein1957 / synthetic_data_deepseekr1_qwen_distill.py

Last active February 28, 2025 13:10

	# /// script
	# requires-python = ">=3.11,<3.12"
	# dependencies = [
	# "distilabel[hf-transformers, hf-inference-endpoints]",
	# ]
	# ///
	from distilabel.models import InferenceEndpointsLLM
	from distilabel.pipeline import InstructionResponsePipeline

	repo_id = "deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B"

sugatoray / hf_models_license_top20_dist.md

Created January 3, 2025 11:02

HF Model License Top20 Dist

tomaarsen / export_locally.py

Created October 15, 2024 12:30

Export Sentence Transformer models to ONNX (+ optimization, quantization) & OpenVINO

	# requires sentence_transformers>=3.2.0
	from sentence_transformers import SentenceTransformer, export_optimized_onnx_model, export_dynamic_quantized_onnx_model

	# The model to export to ONNX (+ optimize, quantize), OpenVINO
	model_id = "mixedbread-ai/mxbai-embed-large-v1"
	# Where to save the exported models locally
	output_dir = model_id.replace("/", "-")

	onnx_model = SentenceTransformer(model_id, backend="onnx", model_kwargs={"export": True})
	onnx_model.save_pretrained(output_dir)

sugatoray / prompt.txt

Created October 6, 2024 14:53 — forked from philschmid/prompt.txt

	Begin by enclosing all thoughts within <thinking> tags, exploring multiple angles and approaches.
	Break down the solution into clear steps within <step> tags. Start with a 20-step budget, requesting more for complex problems if needed.
	Use <count> tags after each step to show the remaining budget. Stop when reaching 0.
	Continuously adjust your reasoning based on intermediate results and reflections, adapting your strategy as you progress.
	Regularly evaluate progress using <reflection> tags. Be critical and honest about your reasoning process.
	Assign a quality score between 0.0 and 1.0 using <reward> tags after each reflection. Use this to guide your approach:

	0.8+: Continue current approach
	0.5-0.7: Consider minor adjustments
	Below 0.5: Seriously consider backtracking and trying a different approach

philschmid / prompt.txt

Last active April 28, 2025 22:55

	Begin by enclosing all thoughts within <thinking> tags, exploring multiple angles and approaches.
	Break down the solution into clear steps within <step> tags. Start with a 20-step budget, requesting more for complex problems if needed.
	Use <count> tags after each step to show the remaining budget. Stop when reaching 0.
	Continuously adjust your reasoning based on intermediate results and reflections, adapting your strategy as you progress.
	Regularly evaluate progress using <reflection> tags. Be critical and honest about your reasoning process.
	Assign a quality score between 0.0 and 1.0 using <reward> tags after each reflection. Use this to guide your approach:

	0.8+: Continue current approach
	0.5-0.7: Consider minor adjustments
	Below 0.5: Seriously consider backtracking and trying a different approach

sugatoray / pipeline_parallel.py

Created October 2, 2024 17:20 — forked from 3outeille/pipeline_parallel.py

Self contained example of how pipeline parallel works (AFAB and 1F1B) in 200 LOC

	#VERBOSE=0 torchrun --nproc_per_node 3 self_contained_pp_LOC.py
	import os, random, numpy as np, torch, torch.nn as nn, torch.distributed as dist, torch.nn.functional as F
	from torch.optim import AdamW
	from torch.utils.data import DataLoader, DistributedSampler
	from datasets import load_dataset
	from transformers import AutoConfig, AutoModelForCausalLM, AutoTokenizer

	STEP, local_rank, world_size, verbose = 0, int(os.environ["LOCAL_RANK"]), int(os.environ["WORLD_SIZE"]), os.environ.get("VERBOSE", "0") == "1"

	def set_all_seed(seed):

3outeille / pipeline_parallel.py

Last active April 28, 2025 00:54

Self contained example of how pipeline parallel works (AFAB and 1F1B) in 200 LOC

	#VERBOSE=0 torchrun --nproc_per_node 3 self_contained_pp_LOC.py
	import os, random, numpy as np, torch, torch.nn as nn, torch.distributed as dist, torch.nn.functional as F
	from torch.optim import AdamW
	from torch.utils.data import DataLoader, DistributedSampler
	from datasets import load_dataset
	from transformers import AutoConfig, AutoModelForCausalLM, AutoTokenizer

	STEP, local_rank, world_size, verbose = 0, int(os.environ["LOCAL_RANK"]), int(os.environ["WORLD_SIZE"]), os.environ.get("VERBOSE", "0") == "1"

	def set_all_seed(seed):

hanxiao / testRegex.js

Last active May 6, 2025 13:38

Regex for chunking by using all semantic cues

	// Updated: Aug. 20, 2024
	// Run: node testRegex.js whatever.txt
	// Live demo: https://jina.ai/tokenizer
	// LICENSE: Apache-2.0 (https://www.apache.org/licenses/LICENSE-2.0)
	// COPYRIGHT: Jina AI
	const fs = require('fs');
	const util = require('util');

	// Define variables for magic numbers
	const MAX_HEADING_LENGTH = 7;

awni / mflux_steps.md

Last active February 24, 2025 14:33

git clone [email protected]:filipstrand/mflux.git
cd mflux && pip install -r requirements.txt

Name this anything, maybe flux.py. Make sure to update the two paths marked below.