Applied MAB Algorithms For Online Live-Learning Systems

Learn how to build smart live-learning MAB agents to improve the click-through rate of ads on the web. Read more.

Edward

No ratings yet

Course Skill Level

Intermediate

Time Estimate

4h 28m

Access all courses in our library for only $9/month with All Access Pass

Get Started with All Access Pass Buy Only This Course

About This Course
Curriculum

About This Course

Who this course is for:

People who already know about multi-armed bandit algorithms and want to transition from simulations into building real applications
Anyone who wants to learn how to design and implement an architecture for live-learning systems.
Engineers who want to learn how reinforcement learning can be used to optimize click-through rates of adverts.
Students of my previous course “Create Multi-Armed Bandit Algorithms In Python” who want to apply their knowledge to real-life situations.

What you’ll learn:

Designing the architecture of live-learning systems that uses multi-armed bandit algorithms
Using Flask to implement MAB agents to optimize click-through rate of advertisements
Implementations of Epsilon-Greedy, Softmax Exploration, and UCB in a live-learning system
Transitioning from simulations of MAB problems into real applications
General best-practices in Python software development
General backend development with Flask
Automation of database migrations and seeding

Requirements:

Basic Object Oriented Programming in Python.
Basic mathematics (High school algebra is enough)
Have taken the course “Create Multi-Armed Bandit Algorithms In Python”

This course is a sequel to my previous course titled “Create Multi-Armed Bandit Algorithms In Python” and the goal is to teach how you can readily apply your knowledge on MAB algorithms to build and deploy smarts agents on the web that automatically learns how to improve the click-through rate of advertisements.

Every video in this course is hands-on, and collectively, they equip you with expert knowledge on how to build web applications using Flask, and also how to integrate MAB agents that adjust their operations to improve CTR of online ads. By the end of this course, you will know precisely how to implement live-learning agents into web applications to optimize key business goals.

It is one thing knowing how to use simulations to validate the performance of MAB agents. However, transitioning from simulations into their real-world applications require some key skills that are taught in this course. For example, you’ll need to know how to do the following:

store and retrieve information from a database which will be used by the agent to choose actions.
translate user interactions (such as clicks) into rewards which the agent can use as evaluative feedback information.
adjusting the agent’s knowledge to reflect the true user behaviors that have been observed through interaction.
implement various MAB algorithms with an API that makes it easier to switch one algorithm for the other.
design and implement a good software architecture for online live-learning systems.

I highly recommend that you complete my previous course titled “Create Multi-Armed Bandit Algorithms In Python” before taking this course since it’s a follow-up. However, if you already know how to implement various MAB algorithms, then you can jump right into this course and succeed without struggling.

This course is intentionally taught in a very simple way. It doesn’t include the use of advanced mathematics and all you need to know is OOP in Python and simple high school algebra.

Thanks for taking this course! I can’t wait to see what you will build with the knowledge shared here!

Our Promise to You

By the end of this course, you will have learned how to use multi-armed bandit algorithms.

10 Day Money Back Guarantee. If you are unsatisfied for any reason, simply contact us and we’ll give you a full refund. No questions asked.

Get started today!

Course Curriculum

Section 1 - Introduction
	Environment Setup	00:00:00
	Recreating Virtual Environments From Requirements	00:00:00
	Structuring The Codebase For The Project	00:00:00
Section 2 - Introduction To Flask
	Creating A Basic Server	00:00:00
	Running The Server	00:00:00
	Rendering HTML Templates	00:00:00
	Jinja – Variable Substitutions	00:00:00
	Jinja – Looping Over Collections	00:00:00
	Jinja – Template Inheritance 1	00:00:00
	Jinja – Template Inheritance 2	00:00:00
	Serving Static Files	00:00:00
	Jinja – Creating And Using Variables	00:00:00
	Defining Database Models	00:00:00
	Setting Up Database Migrations	00:00:00
	Running Database Migrations	00:00:00
	Interacting With The Database In The Shell	00:00:00
	Modifying The Structure Of Tables	00:00:00
Section 3 - Architecture Of A Live-Learning System
	Layout Of The UI	00:00:00
	Inserting Adverts Into The Database	00:00:00
	Filtering Adverts By Tag	00:00:00
	Sequence Diagram Of A Live-Learning System	00:00:00
	Creating The API Of The Agent Interface	00:00:00
	Automation Of Seeding And Clearing Of Database	00:00:00
	Seeding And Clearing The Database In The Terminal	00:00:00
	Rendering Product Listings	00:00:00
	Shuffling Product Listings	00:00:00
Section 4 - Building The Smart Agents
	Fetching Data On Adverts Before Each Request	00:00:00
	Installing Numpy	00:00:00
	Implementing The Epsilon-Greedy Agent	00:00:00
	Using The Epsilon-Greedy Agent	00:00:00
	Making The Agent Choose The Best Adverts	00:00:00
	Rewarding Agent Through User Interactions	00:00:00
	Adding Decay Rate To The Epsilon-Greedy Agent	00:00:00
	Implementing The Softmax Exploration Agent	00:00:00
	Implementing The Upper Confidence Bounds Agent	00:00:00