iangechuki

3 followers · 11 following

Nairobi,Kenya
00:52 (UTC -12:00)

View GitHub Profile

Recently created

Least recently created

Recently updated

Least recently updated

buttercutter / mamba.py

Last active May 22, 2024 05:56

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

	# [Mamba: Linear-Time Sequence Modeling with Selective State Spaces](https://arxiv.org/abs/2312.00752)

	import torch
	import torch.nn as nn
	import torch.optim as optim
	from torch.utils.data import DataLoader, Dataset
	from torch.nn import functional as F
	from einops import rearrange, repeat
	from tqdm import tqdm

veekaybee / normcore-llm.md

Last active November 7, 2025 17:43

Normcore LLM Reads

Anti-hype LLM reading list

Goals: Add links that are reasonable and good explanations of how stuff works. No hype and no vendor content if possible. Practical first-hand accounts of models in prod eagerly sought.

Foundational Concepts

Pre-Transformer Models

HarshTrivedi / pad_packed_demo.py

Last active November 7, 2025 15:47 — forked from Tushar-N/pad_packed_demo.py

Minimal tutorial on packing (pack_padded_sequence) and unpacking (pad_packed_sequence) sequences in pytorch.

	import torch
	from torch import LongTensor
	from torch.nn import Embedding, LSTM
	from torch.autograd import Variable
	from torch.nn.utils.rnn import pack_padded_sequence, pad_packed_sequence

	## We want to run LSTM on a batch of 3 character sequences ['long_str', 'tiny', 'medium']
	#
	# Step 1: Construct Vocabulary
	# Step 2: Load indexed data (list of instances, where each instance is list of character indices)

miguelmota / server.go

Last active August 24, 2025 06:50

Golang TCP server example

	package server

	import (
	"bufio"
	"fmt"
	"log"
	"net"
	)

	// Server ...

aparrish / understanding-word-vectors.ipynb

Last active October 7, 2025 16:12

Understanding word vectors: A tutorial for "Reading and Writing Electronic Text," a class I teach at ITP. (Python 2.7) Code examples released under CC0 https://creativecommons.org/choose/zero/, other text released under CC BY 4.0 https://creativecommons.org/licenses/by/4.0/

Sorry, something went wrong. Reload?

Sorry, we cannot display this file.

Sorry, this file is invalid so it cannot be displayed.

karpathy / min-char-rnn.py

Last active November 9, 2025 11:16

Minimal character-level language model with a Vanilla Recurrent Neural Network, in Python/numpy

	"""
	Minimal character-level Vanilla RNN model. Written by Andrej Karpathy (@karpathy)
	BSD License
	"""
	import numpy as np

	# data I/O
	data = open('input.txt', 'r').read() # should be simple plain text file
	chars = list(set(data))
	data_size, vocab_size = len(data), len(chars)