Helge Munk Jacobsen elgehelge

peterroelants / mnist_estimator.py

Last active February 14, 2024 11:26

Example using TensorFlow Estimator, Experiment & Dataset on MNIST data.

	"""Script to illustrate usage of tf.estimator.Estimator in TF v1.3"""
	import tensorflow as tf

	from tensorflow.examples.tutorials.mnist import input_data as mnist_data
	from tensorflow.contrib import slim
	from tensorflow.contrib.learn import ModeKeys
	from tensorflow.contrib.learn import learn_runner


	# Show debugging output

karpathy / pg-pong.py

Created May 30, 2016 22:50

Training a Neural Network ATARI Pong agent with Policy Gradients from raw pixels

	""" Trains an agent with (stochastic) Policy Gradients on Pong. Uses OpenAI Gym. """
	import numpy as np
	import cPickle as pickle
	import gym

	# hyperparameters
	H = 200 # number of hidden layer neurons
	batch_size = 10 # every how many episodes to do a param update?
	learning_rate = 1e-4
	gamma = 0.99 # discount factor for reward