-
Amazon
- San Francisco, CA
- https://www.linkedin.com/in/vdantu/
- @vdan123
Stars
vdantu / sagemaker-huggingface-inference-toolkit
Forked from aws/sagemaker-huggingface-inference-toolkitHTTP(S) benchmark tools, testing/debugging, & restAPI (RESTful)
Serve, optimize and scale PyTorch models in production
A guideline for building practical production-level deep learning systems to be deployed in real world applications.
This project is inspired by the design and development of the AWS Serverless Application Repository - a production-grade AWS service. Learn how AWS built a production service using serverless techn…
Tensors and Dynamic neural networks in Python with strong GPU acceleration
A cheatsheet of modern C++ language and library features.
gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.
Multi Model Server is a tool for serving neural net models for inference