Christopher Akiki cakiki

Thank you to tiny corp for pointing out some problems running BERT training with Tinygrad on AMD GPUs in this Tweet. We had a few engineers at AMD take a look at the problem and they were quickly able to reproduce it.

What they found was an issue related to CWSR (compute wave save restore), which is a mechanism that allows our driver and firmware to preempt and reschedule long-running compute waves on our GPUs. The GFXv11 GPU line requires a workaround to set COMPUTE_PGM_RSRC1.PRIV=1 when dispatching a compute kernel. Normally this is handled by the AQL DISPATCH packet. However, since the Tinygrad implementation leverages a custom runtime, it requires this workaround in its PM4-based dispatch. This patch is specific to GFXv11 GPUs. Other GPUs do not require it and should not use this workaround. The following KFDTest patch can be used as a reference: https://github.com/ROCm/ROCT-Thunk-Interface/commit/507637ed5b82197eecbf483cdc1234939766549a

While inv

TPU VM Cheetsheat

This TPU VM cheatsheet uses and was tested with the following library versions:

Library	Version
JAX	0.3.25
FLAX	0.6.4
Datasets	2.10.1
Transformers	4.27.1

Resize Ghidra for High DPI screens

If you run Ghidra on a high DPI screen, you will probably find the GUI to be scaled down so small to be almost of no use.

There is a setting that you can adjust to scale the Ghidra GUI:

in $GHIDRA_ROOT/support is a file named launch.properties. In this launch.properties file is the following configuration key:

VMARGS_LINUX=-Dsun.java2d.uiScale=1

What the BookCorpus?

So in the midst of all these Sesame Streets characters and robots transforming automobile era of "contextualize" language models, there is this "Toronto Book Corpus" that points to this kinda recently influential paper:

Yukun Zhu, Ryan Kiros, Rich Zemel, Ruslan Salakhutdinov, Raquel Urtasun, Antonio Torralba, and Sanja Fidler. 2015. "Aligning books and movies: Towards story-like visual explanations by watching movies and reading books." In Proceedings of the IEEE international conference on computer vision, pp. 19-27.

Why do I even care, there's no translations there?

Some might know my personal pet peeve on collecting translation datasets but this BookCorpus has no translations, so why do I even care about it?

	from huggingface_hub.hf_api import ( # type: ignore
	REPO_TYPES,
	REPO_TYPES_URL_PREFIXES,
	HfApi,
	_raise_for_status,
	)

	def update_repo_settings(
	hf_api: HfApi,
	repo_id: str,

	#
	# Author: Cody Buntain
	# Date: 19 March 2020
	#
	# Description:
	# This code is an example of uysing the agreement package
	#. in NLTK to calculate a number of agreement metrics on
	#. a set of annotations. Currently, this code will work
	#. with two annotators and multiple labels.
	#. You can use Fleiss's Kappa or Krippendorf's Alpha if you

	""" Implementation of OKapi BM25 with sklearn's TfidfVectorizer
	Distributed as CC-0 (https://creativecommons.org/publicdomain/zero/1.0/)
	"""

	import numpy as np
	from sklearn.feature_extraction.text import TfidfVectorizer
	from scipy import sparse


	class BM25(object):