Lynx Roundup, December 15th 2020

Lynx Roundup, December 15th 2020

Reinforcement learning with PyTorch! Training deep control policies for the real world! Considerations in building production ML!

Matthew Alhonte
Matthew Alhonte
A Python Packaging Carol
Limits, schlimits: It’s time to rethink how we teach calculus
Ars chats with math teacher Ben Orlin about his book Change Is the Only Constant.
Activating a Conda environment in your Dockerfile
The Conda packaging tool implements environments, that enable different applications to have different libraries installed. So when you’re building a Docker image for a Conda-based application, you’ll need to activate a Conda environment. Unfortunately, activating Conda environments is a bit complex…
Learning Reinforcement Learning: REINFORCE with PyTorch!
The REINFORCE algorithm is one of the first policy gradient algorithms in reinforcement learning and a great jumping off point to get into more advanced approaches. Policy gradients are different…
Considerations in Building Production ML - Toucan AI Blog
Training deep control policies for the real world - Microsoft Research
Humans subconsciously use perception-action loops to do just about everything, from walking down a crowded sidewalk to scoring a goal in a community soccer league. Perception-action loops—using sensory input to decide on appropriate action in a continuous real time loop —are at the heart of autonomo…
Roundup

Matthew Alhonte

Supervillain in somebody's action hero movie. Experienced a radioactive freak accident at a young age which rendered him part-snake and strangely adept at Python.