[Docs] Benchmark docs #5360

patrickvonplaten · 2020-06-29T12:12:43Z

This PR updates the docs for benchmarks and adds a README.md where the community can post their benchmark results.

Would be happy about Feedback from @sgugger and @LysandreJik .

@LysandreJik - I deleted the part about "This work was done by Timothy Liu." because the links were broken.

sshleifer · 2020-06-29T13:03:24Z

examples/benchmarking/README.md

+
+If you would like to list benchmark results on your favorite models of the [model hub](https://huggingface.co/models) here, please open a Pull Request and add it below.
+
+| Benchmark description | Results | Environment info |      Author      |


Is there one number we could put in the actual table. Like tokens/second or MB/1000 tokens so that this table is more than just a ton of links?

I'd also have a Framework column instead of including in description. And maybe also Date

I love this idea though!

How good is GitHub at diff'ing large CSV files? I was thinking of copying all results into a single, monster (multi-dimensional?) spreadsheet like @LysandreJik's spreadsheet

sshleifer · 2020-06-29T13:05:44Z

docs/source/benchmarks.rst

+    >>> python examples/benchmarking/run_benchmark_tf.py --help
+
+
+An instantiated benchmark object can then simply be run by calling ``benchmark.run()``.


this seems to not linearly follow from the command line example above.

should be closer to L34

sshleifer · 2020-06-29T13:06:14Z

docs/source/benchmarks.rst

+
+Here, three arguments are given to the benchmark argument data classes, namely ``models``, ``batch_sizes``, and ``sequence_lengths``. The argument ``models`` is required and expects a :obj:`list` of model identifiers from the `model hub <https://huggingface.co/models>`__
+The :obj:`list` arguments ``batch_sizes`` and ``sequence_lengths`` define the size of the ``input_ids`` on which the model is benchmarked. 
+There are many more parameters that can be configured via the benchmark argument data classes. For more detail on these one can either directly consult the files 


I would have one example with all reasonable clargs. It's easier to delete than add.

sshleifer

Great idea. I also tried to promote sharing seq2seq results through wandb (cc: @clmnt), have not gotten much usage yet but maybe I will eventually!

sgugger · 2020-06-29T13:35:45Z

docs/source/benchmarks.rst

+How to benchmark 🤗 Transformer models
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+The classes :obj:`PyTorchBenchmark` and :obj:`TensorflowBenchmark` allow to flexibly benchmark ðŸ¤— Transformer models.


Use :class:~transformers.PyTorchBenchmark and :class:~transformers.TensorflowBenchmark so that we get a link to the doc (once it's added ;-) ) here and in the rest of the document.

patrickvonplaten · 2020-06-29T13:43:34Z

Actually will make a separate PR for the examples README.md - to have the docs in v3.0.0.

LysandreJik

Very cool!

LysandreJik · 2020-06-29T13:32:50Z

docs/source/benchmarks.rst

+How to benchmark 🤗 Transformer models
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+The classes :obj:`PyTorchBenchmark` and :obj:`TensorflowBenchmark` allow to flexibly benchmark ðŸ¤— Transformer models.


ðŸ¤ what does this mean?

not sure where this comes from actually :D -> was supposed to be a HF smiley

haha very random

docs/source/benchmarks.rst

Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

sgugger

@LysandreJik Are the doctest included in the tests now? I think this file should be ignored if that's the case because:

is it fast enough?
the output seems very dependent of the system

sgugger · 2020-06-29T13:47:27Z

docs/source/benchmarks.rst

+
+  Hereby, `inference` is defined by a single forward pass, and `training` is defined by a single forward pass and backward pass.
+
+The benchmark classes :obj:`PyTorchBenchmark` and :obj:`TensorflowBenchmark` expect an object of type :obj:`PyTorchBenchmarkArguments` and :obj:`TensorflowBenchmarkArguments`, respectively, for instantiation. :obj:`PyTorchBenchmarkArguments` and :obj:`TensorflowBenchmarkArguments` are data classes and contain all relevant configurations for their corresponding benchmark class.


Same for :obj:PyTorchBenchmarkArguments and :obj:TensorflowBenchmarkArguments

(PS: if this is still experimental and can be renamed, Tensorflow is spelled TensorFlow normally, PyTorchBenchmark has the right spelling)

LysandreJik · 2020-06-29T13:54:59Z

@sgugger right now it's not ignored, so the slow test will fail because the output isn't the same. I don't think it's too big of a deal though, we can fix that after the release with only partial testing of the file.

…vonplaten/transformers into update_doc_for_benchmark

patrickvonplaten · 2020-06-29T14:01:46Z

Thanks for the review. Addressed them and also renamed the classes for consistency.

LysandreJik

Very cool!

patrickvonplaten added 4 commits June 29, 2020 14:11

first doc version

537a472

add benchmark docs

143103b

fix typos

b729c2c

improve README

008eb62

patrickvonplaten requested review from LysandreJik, sgugger and clmnt June 29, 2020 12:24

sshleifer reviewed Jun 29, 2020

View reviewed changes

sshleifer approved these changes Jun 29, 2020

View reviewed changes

sgugger reviewed Jun 29, 2020

View reviewed changes

LysandreJik approved these changes Jun 29, 2020

View reviewed changes

Update docs/source/benchmarks.rst

0decddc

Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

sgugger approved these changes Jun 29, 2020

View reviewed changes

patrickvonplaten added 2 commits June 29, 2020 15:58

fix naming and docs

4863d73

Merge branch 'update_doc_for_benchmark' of https://github.com/patrick…

b7e539e

…vonplaten/transformers into update_doc_for_benchmark

LysandreJik approved these changes Jun 29, 2020

View reviewed changes

patrickvonplaten merged commit 4bcc35c into huggingface:master Jun 29, 2020

patrickvonplaten mentioned this pull request Jun 29, 2020

[Benchmark] Readme for benchmark #5363

Merged

LysandreJik mentioned this pull request Sep 23, 2021

Documentation Mistake: no_multi_processing for Benchmarks #13713

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Docs] Benchmark docs #5360

[Docs] Benchmark docs #5360

patrickvonplaten commented Jun 29, 2020 •

edited

Loading

sshleifer Jun 29, 2020

sshleifer Jun 29, 2020

sshleifer Jun 29, 2020

julien-c Jun 29, 2020

sshleifer Jun 29, 2020

sshleifer Jun 29, 2020

sshleifer Jun 29, 2020

sshleifer left a comment

sgugger Jun 29, 2020

patrickvonplaten commented Jun 29, 2020

LysandreJik left a comment

LysandreJik Jun 29, 2020

patrickvonplaten Jun 29, 2020

LysandreJik Jun 29, 2020

sgugger left a comment

sgugger Jun 29, 2020

LysandreJik commented Jun 29, 2020

patrickvonplaten commented Jun 29, 2020

LysandreJik left a comment


		If you would like to list benchmark results on your favorite models of the [model hub](https://huggingface.co/models) here, please open a Pull Request and add it below.

		\| Benchmark description \| Results \| Environment info \| Author \|

		>>> python examples/benchmarking/run_benchmark_tf.py --help


		An instantiated benchmark object can then simply be run by calling ``benchmark.run()``.


		Hereby, `inference` is defined by a single forward pass, and `training` is defined by a single forward pass and backward pass.

		The benchmark classes :obj:`PyTorchBenchmark` and :obj:`TensorflowBenchmark` expect an object of type :obj:`PyTorchBenchmarkArguments` and :obj:`TensorflowBenchmarkArguments`, respectively, for instantiation. :obj:`PyTorchBenchmarkArguments` and :obj:`TensorflowBenchmarkArguments` are data classes and contain all relevant configurations for their corresponding benchmark class.

[Docs] Benchmark docs #5360

[Docs] Benchmark docs #5360

Conversation

patrickvonplaten commented Jun 29, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sshleifer left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

patrickvonplaten commented Jun 29, 2020

LysandreJik left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sgugger left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LysandreJik commented Jun 29, 2020

patrickvonplaten commented Jun 29, 2020

LysandreJik left a comment

Choose a reason for hiding this comment

patrickvonplaten commented Jun 29, 2020 •

edited

Loading