Skip to content

Instantly share code, notes, and snippets.

View thewh1teagle's full-sized avatar
💭
coding

thewh1teagle

💭
coding
  • localhost
  • The martian
View GitHub Profile
@thewh1teagle
thewh1teagle / 204k_tokens.txt
Created September 3, 2025 00:34
204k tokens
This file has been truncated, but you can view the full file.
Lorem ipsum dolor sit amet consectetur adipiscing elit. Quisque faucibus ex sapien vitae pellentesque sem placerat. In id cursus mi pretium tellus duis convallis. Tempus leo eu aenean sed diam urna tempor. Pulvinar vivamus fringilla lacus nec metus bibendum egestas. Iaculis massa nisl malesuada lacinia integer nunc posuere. Ut hendrerit semper vel class aptent taciti sociosqu. Ad litora torquent per conubia nostra inceptos himenaeos.
Lorem ipsum dolor sit amet consectetur adipiscing elit. Quisque faucibus ex sapien vitae pellentesque sem placerat. In id cursus mi pretium tellus duis convallis. Tempus leo eu aenean sed diam urna tempor. Pulvinar vivamus fringilla lacus nec metus bibendum egestas. Iaculis massa nisl malesuada lacinia integer nunc posuere. Ut hendrerit semper vel class aptent taciti sociosqu. Ad litora torquent per conubia nostra inceptos himenaeos.
@thewh1teagle
thewh1teagle / mms.py
Last active July 3, 2025 02:41
How to run old saspeech Hebrew model (olddd)
"""
uv venv -p3.10
uv pip insatll numpy==1.26.4 soundfile transformers
uv pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cpu
"""
from transformers import VitsModel, AutoTokenizer
import torch
import soundfile as sf
@thewh1teagle
thewh1teagle / hebrew_encoder.py
Last active March 24, 2025 01:09
Hebrew encoder decoder for diacritics model
"""
Hebrew diacritics encoding and decoding
"""
import unicodedata
import re
# Deduplicate duplicate phonetic diacritics
NIQQUD_DEDUPLICATE = {
"""
wget https://huggingface.co/datasets/thewh1teagle/hebright/resolve/main/knesset.txt.zip
unzip knesset.txt.zip
uv run main.py
"""
from pathlib import Path
import time
import requests
@thewh1teagle
thewh1teagle / README.md
Created February 1, 2025 15:08
espeak-ng languages

{'af': 'Afrikaans', 'am': 'Amharic', 'an': 'Aragonese', 'ar': 'Arabic', 'as': 'Assamese', 'az': 'Azerbaijani', 'ba': 'Bashkir', 'be': 'Belarusian', 'bg': 'Bulgarian', 'bn': 'Bengali', 'bpy': 'Bishnupriya Manipuri', 'bs': 'Bosnian', 'ca': 'Catalan', 'ca-ba': 'Catalan (Balearic)', 'ca-nw': 'Catalan (North-western)', 'ca-va': 'Catalan (Valencian)', 'chr-US-Qaaa-x-west': 'Cherokee ', 'cmn': 'Chinese (Mandarin, latin as English)', 'cmn-latn-pinyin': 'Chinese (Mandarin, latin as Pinyin)', 'cs': 'Czech', 'cv': 'Chuvash', 'cy': 'Welsh', 'da': 'Danish', 'de': 'German', 'el': 'Greek', 'en-029': 'English (Caribbean)', 'en-gb': 'English (Great Britain)', 'en-gb-scotland': 'English (Scotland)', 'en-gb-x-gbclan': 'English (Lancaster)', 'en-gb-x-gbcwmd': 'English (West Midlands)', 'en-gb-x-rp': 'English (Received Pronunciation)', 'en-shaw': 'English (Shavian alphabet)', 'en-us': 'English (America)', 'en-us-nyc': 'English (America, New York City)', 'eo': 'Esperanto', 'es': 'Spanish (Spain)', 'es-419': 'Spanish (Latin America

Cross compile espeak-ng for aarch64 Linux

# Cross compile for aarch64 Linux


# Alsa
git clone https://github.com/alsa-project/alsa-lib
cd alsa-lib
@thewh1teagle
thewh1teagle / remove_question_mark.py
Last active January 3, 2025 01:01
Fix subtitles in Hebrew and embed subtitles
import sys
with open(sys.argv[1], 'r', encoding='utf-8') as file:
content = file.read()
lines = []
for line in content.splitlines():
if line.startswith('?') or line.startswith('<i>?'):
line = line.replace('?', '')
line = line.replace('<i>?', '<i>')
@thewh1teagle
thewh1teagle / windows-keys.md
Created December 20, 2024 14:53 — forked from rvrsh3ll/windows-keys.md
Windows Product Keys

NOTE

These are NOT product / license keys that are valid for Windows activation.
These keys only select the edition of Windows to install during setup, but they do not activate or license the installation.

Index

@thewh1teagle
thewh1teagle / script.js
Last active December 19, 2024 18:27
Supress @everyone and @here mentions across all Discord servers.
/**
Supress @everyone and @here mentions across all Discord servers.
1. Open https://discord.com
2. Extract the token from dev tools -> Local Storage -> token
3. Run
*/
const token = "your token";
const headers = {
authorization: token,
@thewh1teagle
thewh1teagle / readme.md
Last active December 11, 2024 13:19
compile whisper.cpp x86-64 on macos m1
arch -x86_64 cmake -G Ninja -B build -DGGML_METAL=1 .
arch -x86_64 cmake --build build
wget https://github.com/thewh1teagle/vibe/raw/main/samples/short.wav
arch -x86_64 ./main --no-prints -m '/Users/user/Library/Application Support/github.com.thewh1teagle.vibe/ggml-medium-q8_0.bin' -f short.wav