Metal Game Performance Optimization
Metal Game Performance Optimization
WWDC 2018
Game Performance Template
data:image/s3,"s3://crabby-images/ec8e1/ec8e1e58f1c301f7d1663c6a5db415323bc093cf" alt=""
data:image/s3,"s3://crabby-images/a3e9e/a3e9ea83d5667b6de212af0e6c817c7a2dadcc8a" alt=""
data:image/s3,"s3://crabby-images/34166/34166cdf1a6824cc722b923738291f2562b8c66d" alt=""
Thread State View (new)
data:image/s3,"s3://crabby-images/9a979/9a979585e1b13c81952c368f1e9e990dba419262" alt=""
Frame Pacing
data:image/s3,"s3://crabby-images/6c1bc/6c1bc2e010d887f8b1c4fa02335c87de4facda46" alt=""
Micro Stuttering
Inconsistent frame rate
- Frame time higher than display refresh interval
- Game logic timing errors
data:image/s3,"s3://crabby-images/1ae2e/1ae2eaed5e72e2f75c17c7b3f0aa5a5461929fc8" alt=""
Problem
- Do not use sleep() to pace frames!
data:image/s3,"s3://crabby-images/599fd/599fdecd0408e085ecb5e9b3fea660189186237d" alt=""
Best Practice
- Target explicit frame rate
- Use the following APIs (iOS 10.3 +)
-MTLDrawable addPresentedHandler
-MTLCommandBuffer presentDrawable afterMinimumDuration
-MTLCommandBuffer presentDrawable atTime
data:image/s3,"s3://crabby-images/f4b9b/f4b9b3292da27e94c84e543b1a2e93cb3f02b7af" alt=""
data:image/s3,"s3://crabby-images/bbf0d/bbf0d124c0693f39dbefd78b6bdd219e583243df" alt=""
Thread Stalling
Render thread gets preempted due to low priority
- Priority decay
- Priority inversion
data:image/s3,"s3://crabby-images/ab911/ab9119668168a9df1d4d5bc8debf33fa39cf0769" alt=""
data:image/s3,"s3://crabby-images/41c07/41c07d573d8ab122b54e736ab4b4534e6b7a18aa" alt=""
Best Practice
Configure the render thread
- Priority 45
- Opt out of Quality of Service
data:image/s3,"s3://crabby-images/897de/897de5245a661e887bbc87cef30ad7d52c1e766f" alt=""
data:image/s3,"s3://crabby-images/6dd8a/6dd8ae03f75dec2d571155964980ab72c9e035cb" alt=""
Thermal Throttling
Impact on system performace
- High device temperature
- Low power mode enabled
Best Practice
- Adjust the workload to the system state
- Use the following APIs
- iOS 11.0+ :NSProcessInfo thermalState
- iOS 10.3+ :MTLCommandBuffer GPUStartTime/ GPUEndTime
- iOS 9.0+ :NSProcessInfo lowPowerModeEnabled
data:image/s3,"s3://crabby-images/f2494/f2494494a771e357d47880cc3c7bcf9cdc017011" alt=""
Adjust the Workload
- Target sustainable framerate
- Reduce the resolution
- Simplify the shadow maps
- Use smaller textures
- Decrease the level of detail (LOD) for geometry
- Simplify post-processing and effects
Wasted GPU Time
Waste of power and GPU budget
- Large resources
- Unused GPU work
Best Practice
Profile the GPU
- Understand the cost of every rendering feature
- Remove excessive work
Metal System Trace
- Accurate timing for Vertex, Fragment, and Compute work
- Ideal to measure GPU budget
data:image/s3,"s3://crabby-images/2de36/2de3623b389ea56f75bbfb1eed10a458c3cd2532" alt=""
Dependency Viewer
data:image/s3,"s3://crabby-images/4f0ba/4f0ba76c842e2ced674435013a723da76972c20f" alt=""
data:image/s3,"s3://crabby-images/69f04/69f04d928e8c22b05e80d10a8fb073653ceaa61e" alt=""
data:image/s3,"s3://crabby-images/48d26/48d26a25799eece184610a1c4f4340a406c40b02" alt=""
data:image/s3,"s3://crabby-images/8f37e/8f37ed2efccc7f49e18446ffdb32bd2df91e9f39" alt=""
Finding Hidden Complexity
data:image/s3,"s3://crabby-images/226b3/226b39b4daadfe48c53607938f1864a9e8e4c536" alt=""
data:image/s3,"s3://crabby-images/c5043/c504300b77e5e0e0d1f5e5f0e7db2fad7b22cf79" alt=""
Take-Away
- Profile early and often
- Target a consistent frame rate
- Set the correct thread prriorites
- Adapt to system load and thermals
- Don’t submit unnecessary work to the GPU