-
Stevens Institute of Technology
-
13:14
(UTC -04:00) - https://jdibenes.github.io/
- in/jdibenes
Block or Report
Block or report jdibenes
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
UNIX-like reverse engineering framework and command-line toolset
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
DSPy: The framework for programming—not prompting—foundation models
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Tesseract Open Source OCR Engine (main repository)
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
A package acting as a wrapper around the headless mode of existing web browsers to generate images from URLs and from HTML+CSS strings or files.
ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ suppo…
We retraine the YOLO-series detection framework on the ego-object dataset in order to obtain a more complete egocentric perspective visual tool chain.
Tools and samples for camera related APIs on Windows
CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!
A Robust and Versatile Monocular Visual-Inertial State Estimator
COLMAP - Structure-from-Motion and Multi-View Stereo
PyTorch Lightning Optical Flow models, scripts, and pretrained weights.
Zdepth :: Streaming Depth Compressor in C++ for Azure Kinect DK
a reimplementation of PWC-Net in PyTorch that matches the official Caffe version
[CVPR 2023] Rethinking Optical Flow from Geometric Matching Consistent Perspective
GTSAM is a library of C++ classes that implement smoothing and mapping (SAM) in robotics and vision, using factor graphs and Bayes networks as the underlying computing paradigm rather than sparse m…
VOLDOR-SLAM is a real-time dense-indirect SLAM system takes dense optical flows as input that supports monocular, stereo and RGB-D video sequence.
The primary source code repository for Macaulay2, a system for computing in commutative algebra, algebraic geometry and related fields.
A curated list of awesome streaming video tools, frameworks, libraries, and learning resources.
[CVPR 2023 Highlight] Neural Kernel Surface Reconstruction