AWS Deep Racer Worksheet

Reward Function

Rewards calculation is based on a number of conditional points, which are all configurabale. Sum of all conditional points contributes to the final score. Conditional points also have sensitivity parameter to control rewards curve.

The points are:

Rewards the agent for being on or close enough (configurable) to the racing line.
Rewards the agent for taking an action which gets the car closer to the next waypoint on the racing line.
Reward the agent on having high steps-to-progress ratio. Simply saying: reward on good progress.
Reward the agent on moving in high speed (regardless of track conditions).
Reward the agent on moving straight, rather than making turns (regardless of track conditions).

Sample reward weights

REWARD_WEIGHT_PROG_STEP = 30
REWARD_WEIGHT_MAX_SPEED = 25
REWARD_WEIGHT_MIN_STEER = 20
REWARD_WEIGHT_DIR_STEER = 15
REWARD_WEIGHT_ON_TRACK = 10

In addition to contributing criteria points, there are 4 types of penalty indicators which reduce the total calculates score. The effectiveness of every penalty indicator is configurable in percentage terms, expect for the "Wheels Off Track" penalty. It has penalty weight of 100% percent and decay over number of steps (configurable).

The penalties are:

Penalty to overall score in percentage for being off or far from racing line.
Penalty to taking turn (having steering) which takes the car further away from the next waypoint on the racing line.
Penalty for taking sharp turn / having high steering (regardless of track conditions).
Penalty for having at least one wheel off track.

Sample penalty weights

MAX_STEPS_TO_DECAY_PENALTY = 5      # Value of zero or below disables penalty for having wheels off track

TOTAL_PENALTY_ON_OFF_TRACK = 0.999999  # Maximum penalty in percentage of total reward for being off track
TOTAL_PENALTY_ON_OFF_DIR_STEER = 0.50  # Maximum penalty in percentage of total reward for off directional steering
TOTAL_PENALTY_ON_HIGH_STEERING = 0.25  # Maximum penalty in percentage of total reward for high steering

Racing line calculation and visualisation

The objective is to smooth central line of a given track. This is achieved by minimising the distance between each pair of two closes waypoints. Selected closes waypoints are not always the nearest waypoints, such as between a pair of selected waypoints, there might be others waypoints located closer. However, these are still close enough to be used. This is controlled by skipp_step parameter, which allows acceleration of the algorithm. The optimisation algorithm takes into account track's inner and outer borders to ensure that new calculated waypoints do not touch the borders or go beyond the borders or located too close to the borders. This is controlled by max_offset parameter.

This algorithm does not search for an optimal arc of corners to race at the maximum possible speed for a given steering.

Sample Results

Track Name	Original Track	Racing Line v1	Racing Line v2
Vivalas Loop
Vivalas Speedway
Expedition Loop
Expedition Super Loop
Playa Super Raceway
Playa Raceway
Hot Rod Super Speedway
Hot Rod Speedway
Baja Highway
Baja Turnpike
Kuei Raceway
Kuei Super Raceway
Cosmic Circuit
Cosmic Loop
Lars Circuit
Lars Loop
Po-Chun Speedway
Po-Chun Super Speedway
Baadal Track

Full list of available tracks' data can be checked here: https://github.com/dp770/aws_deepracer_worksheet/tree/main/tracks.

Other Useful Repositories to look at

License

MIT License

Copyright (c) 2021 AWS DeepRacer Worksheet contributors community

Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
analysis		analysis
models/BaadalTrack-model-v11		models/BaadalTrack-model-v11
scripts		scripts
src		src
tracks		tracks
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

analysis

analysis

models/BaadalTrack-model-v11

models/BaadalTrack-model-v11

scripts

scripts

src

src

tracks

tracks

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

AWS Deep Racer Worksheet

Reward Function

The points are:

Sample reward weights

The penalties are:

Sample penalty weights

Racing line calculation and visualisation

Sample Results

Other Useful Repositories to look at

License

About

Releases

Packages

Languages

License

dp770/aws_deepracer_worksheet

Folders and files

Latest commit

History

Repository files navigation

AWS Deep Racer Worksheet

Reward Function

The points are:

Sample reward weights

The penalties are:

Sample penalty weights

Racing line calculation and visualisation

Sample Results

Other Useful Repositories to look at

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages