Lerner rl agents

原文链接

The Partner Device Ecosystem organization in Netflix is responsible for ensuring successful integration and testing of the Netflix application on all partner devices. 居然还保证testing

with the Netflix philosophy of using data to drive solutions for intriguing problems, we proceeded by leveraging machine learning.

Reinforcement Learning for Automatic Test Case Prioritization and Selection in Continuous Integration

it allows us to start with very little input data and let the agent explore and exploit the patterns it learns in the process of SDK integration and regression testing

A process of getting recommendations and rewarding the agent using APIs consists of 4 steps:

Out of all available test cases for a particular job — form a request that can be interpreted by Lerner. This involves aggregation of historical results and additional features.
Lerner returns a recommendation identified with a unique episode id.
A CI system can execute the recommendation and submit the execution results to Lerner based on the episode id.
Call an API to assign a reward based on the agent id and episode id.

关于训练的并发性？