[feature][refactor] Optimum-Benchmark API #118

IlyasMoutawwakil · 2024-02-05T14:20:58Z

In order to support an API that's very simple to use and doesn't require passing by hydra CLI and subprocess.
This refactoring removes most of hydra magic, especially resolvers that made the usage of optimum-benchmark as an API impossible.

Example:

from pprint import pprint
from optimum_benchmark.logging_utils import setup_logging
from optimum_benchmark.experiment import launch, ExperimentConfig
from optimum_benchmark.backends.pytorch.config import PyTorchConfig
from optimum_benchmark.launchers.process.config import ProcessConfig
from optimum_benchmark.benchmarks.inference.config import InferenceConfig

if __name__ == "__main__":
    setup_logging(level="INFO")
    backend_config = PyTorchConfig(model="gpt2", no_weights=True, device="cuda")
    launcher_config = ProcessConfig(device_isolation=True)
    benchmark_config = InferenceConfig(memory=True)
    experiment_config = ExperimentConfig(
        experiment_name="python-api-launch-experiment",
        benchmark=benchmark_config,
        launcher=launcher_config,
        backend=backend_config,
    )
    benchmark_report = launch(experiment_config)
    pprint(benchmark_report)

[2024-02-06 12:58:42,933][backend][WARNING] - Multiple GPUs detected but CUDA_VISIBLE_DEVICES is not 
[...]
[2024-02-06 12:59:14,971][isolation][INFO] -    + Closing device(s) isolation process...
{'decode.latency(s)': 0.7354095666084248,
 'decode.throughput(tokens/s)': 269.2377268263452,
 'forward.latency(s)': 0.0073284958915750915,
 'forward.max_memory_allocated(MB)': 530,
 'forward.max_memory_reserved(MB)': 574,
 'forward.max_memory_used(MB)': 2025,
 'forward.peak_memory(MB)': 2025,
 'forward.throughput(samples/s)': 272.9072963388325,
 'generate.latency(s)': 0.7427380625,
 'generate.max_memory_allocated(MB)': 555,
 'generate.max_memory_reserved(MB)': 618,
 'generate.max_memory_used(MB)': 2069,
 'generate.peak_memory(MB)': 2069,
 'generate.throughput(tokens/s)': 269.2739339718436}

model, device, task and library are deprecated in experiment, in favor of backend.model and backend.device, backend.task and backend.library.
No breaking changes, only more flexibility and fine grained testing 😊.

IlyasMoutawwakil added 10 commits February 5, 2024 13:35

refactor trackers

7e589e7

refactor launchers

d0cc41a

refactor input generators

38243d5

refactor backends

a4d1d05

refactor benchmarks

32bf5e9

added api utilities

1188fa5

api tests

3f0c8c7

update workflows

2bf0a85

update examples

c26e3c0

style

349019b

IlyasMoutawwakil changed the title ~~[refactor] Optimum-Benchmark API~~ [feature][refactor] Optimum-Benchmark API Feb 5, 2024

IlyasMoutawwakil mentioned this pull request Feb 5, 2024

How to use optimum-benchmark for custom testing of my model #116

Closed

IlyasMoutawwakil added 18 commits February 6, 2024 02:49

logging

1712f3c

fix generators

d4e7cd6

deprecated experiment arguments

e135670

api testing

77598a5

fix workflows

560decc

fix torch-ort backend

685216f

fix backned names

372891a

fix torchrun launcher

2759ba0

fix ort backend

0ff79f4

style

cb943fe

consolidate training benchmark

c332902

more tests

d4487bc

fix

911e694

fix cli test

1e31b4a

readme

d00f0cb

consistent and controlabale logging level throughout processes

11d2573

deferred benchmark metrics logging

67800e3

fix torchrun failing on returning output

6f52748

allow api setting CUDA_VISIBLE_DEVICES multiple times

94d8761

IlyasMoutawwakil merged commit 6be304d into main Feb 6, 2024
21 checks passed

IlyasMoutawwakil mentioned this pull request Feb 12, 2024

[feature][refactor] Benchmark Reporting + Hub Mixin #122

Merged

IlyasMoutawwakil deleted the api-integration branch March 29, 2024 16:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature][refactor] Optimum-Benchmark API #118

[feature][refactor] Optimum-Benchmark API #118

IlyasMoutawwakil commented Feb 5, 2024 •

edited

Loading

[feature][refactor] Optimum-Benchmark API #118

[feature][refactor] Optimum-Benchmark API #118

Conversation

IlyasMoutawwakil commented Feb 5, 2024 • edited Loading

IlyasMoutawwakil commented Feb 5, 2024 •

edited

Loading