Neel Mishra NeelMishra

🎯

locked in

Applied Scientist II @ Microsoft · Building GNNs, knowledge graphs, GenAI systems & scalable ML pipelines · Research in optimization, GANs & AI safety

3 followers · 3 following

View GitHub Profile

Recently created

Least recently created

Recently updated

Least recently updated

1 file
0 forks
0 comments
0 stars

NeelMishra / gist:9e47aebcdef11c4fed0920de3b89e170

Created January 15, 2025 15:09

Monte Carlo Cart Pole Balancing with Epsilon Greedy Policy Improvement

	import gymnasium as gym
	import numpy as np

	class MonteCarloCartPole:
	def __init__(self, n_bins=10, learning_rate=0.1, gamma=0.99, epsilon=0.1):
	self.env = gym.make('CartPole-v1')
	self.n_bins = n_bins
	self.learning_rate = learning_rate
	self.gamma = gamma
	self.epsilon = epsilon