Mike MikeOuimet

Researcher interested in the algorithms for robotics and artificial intelligence

MikeOuimet / PG.py

Created September 25, 2016 20:09

Vanilla policy gradient with tensorflow

	import numpy as np
	import gym
	import tensorflow as tf
	import matplotlib.pyplot as plt

	def weight_variable(shape):
	initial = tf.truncated_normal(shape, stddev=0.1)
	return tf.Variable(initial)

	def bias_variable(shape):