bigscience-workshop / petals Public

Notifications You must be signed in to change notification settings
Fork 493
Star 8.9k

Code
Issues 78
Pull requests 16
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Wiki
Security
Insights

Pull requests: bigscience-workshop/petals

Labels 13 Milestones 0

New pull request New

Clear current search query, filters, and sorts

16 Open 382 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Support loading weights from Safetensors on server

#473 by borzunov was merged Aug 22, 2023

Loading…

Change transformers version assert

#472 by justheuristic was merged Aug 22, 2023

Loading…

Support transformers 4.32.x

#471 by justheuristic was merged Aug 22, 2023

Loading…

Temporarily require peft<0.5.0, transformers<4.32.0

#470 by justheuristic was merged Aug 22, 2023

Loading…

Make client compatible with transformers' GenerationMixin

#464 by borzunov was merged Aug 20, 2023

Loading…

Move SequenceManagerConfig -> ClientConfig, petals.dht_utils -> petals.utils.dht

#463 by borzunov was merged Aug 14, 2023

Loading…

Add blocked_servers argument

#462 by borzunov was merged Aug 14, 2023

Loading…

Support repetition_penalty in generation

#461 by borzunov was closed Aug 15, 2023 • Draft

Bump version to 2.0.1.post2

#459 by borzunov was merged Aug 11, 2023

Loading…

Prioritize short inference, unmerge pools for long inference

#458 by borzunov was merged Aug 11, 2023

Loading…

Use torch.cuda.synchronize for compute throughput

#456 by justheuristic was merged Aug 9, 2023

Loading…

benchmarks: Aggregate speed among workers, set default dtype torch32

#454 by borzunov was merged Aug 9, 2023

Loading…

Test Llama, rebalancing, throughput eval, and all CLI scripts

#452 by borzunov was merged Aug 8, 2023

Loading…

Force using --new_swarm instead of empty --initial_peers

#451 by borzunov was merged Aug 8, 2023

Loading…

Prefer longer servers for fine-tuning, exclude unreachable

#448 by borzunov was merged Aug 7, 2023

Loading…

Add customizable input tensors

#445 by artek0chumak was merged Aug 14, 2023

Loading…

Remove deprecated comment in fine-tuning notebook

#443 by borzunov was merged Aug 6, 2023

Loading…

Use bitsandbytes 0.41.1

#442 by borzunov was merged Aug 6, 2023

Loading…

Remove distracting links from readme

#441 by borzunov was merged Aug 6, 2023

Loading…

Update Discord links from channels to forums

#440 by borzunov was merged Aug 6, 2023

Loading…

Fix typo and make blocks message more informative

#437 by vadi2 was merged Aug 6, 2023

Loading…

[don't merge] Branch for AMD GPUs (with older bitsandbytes)

#436 by borzunov was closed Aug 10, 2023 • Draft

[Refactor] extract block forward, backward and inference into a separate file

#435 by justheuristic was merged Aug 7, 2023

Loading…

3 tasks done

Rewrite MemoryCache alloc_timeout logic

#434 by justheuristic was merged Aug 28, 2023

Loading…

4 tasks done

Override float32 in config to bfloat16

#431 by borzunov was merged Aug 7, 2023

Loading…

2 tasks

Previous 1 2 3 4 5 … 15 16 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly