ContextForge

ContextForge is a powerful and flexible command-line tool designed to compile the contents of a development project into a single, well-structured file. This compiled output is ideal for use as input to large language models (LLMs) like GPT, making it easier to provide comprehensive project context in a single prompt.

As LLMs continue to evolve, we're seeing a significant increase in their context window sizes. This expansion allows these models to process and understand larger amounts of information at once, opening up new possibilities for developers and AI practitioners. ContextForge is at the forefront of this revolution, enabling users to leverage these expanded context windows to their full potential.

With ContextForge, you can now compile your entire project—including code, documentation, and configuration files—into a single, coherent document. This comprehensive compilation allows you to provide LLMs with a complete picture of your project, leading to more accurate and contextually relevant responses. Whether you're seeking code suggestions, architectural advice, or deep project analysis, ContextForge ensures that the LLM has access to all the necessary information.

Key benefits of using ContextForge with large context window LLMs include:

Holistic Understanding: LLMs can grasp the full scope of your project, including intricate relationships between different components.
Improved Accuracy: With access to more context, LLMs can provide more precise and project-specific suggestions and analyses.
Time Efficiency: Instead of manually selecting and pasting relevant parts of your project, ContextForge automates the process of creating a comprehensive context.
Consistency: Ensure that every interaction with the LLM is based on the same, complete project context, leading to more consistent and coherent assistance.
Scalability: As your project grows, ContextForge scales with it, always providing the most up-to-date and complete context to the LLM.

By bridging the gap between expansive codebases and the growing capabilities of LLMs, ContextForge empowers developers to harness the full potential of AI assistance in their development workflows. Whether you're working on a small script or a large-scale application, ContextForge is an essential tool for maximizing the benefits of large context window LLMs in your development process.

Features

Project Compilation: Recursively scans and compiles the contents of a project directory into a single file.
Multiple Output Formats: Supports Markdown, HTML, JSON, and XML output formats.
Syntax Highlighting: Automatically detects and applies appropriate language syntax highlighting for code files.
Improved Path Handling: Better support for files not directly below the project root in the tree structure.
Flexible Ignore Patterns: Supports both .cfignore and .gitignore files to exclude specific files or directories from compilation.
Automatic .git Exclusion: When using .gitignore, the .git directory is automatically excluded.
File Size Limit: Option to set a maximum file size for inclusion in the compilation.
File Extension Filtering: Ability to specify which file extensions to include in the compilation.
Metadata Inclusion: Adds useful metadata about the compilation process to the output.
Parallel Processing: Uses multi-threading to speed up the compilation process for large projects.
Progress Tracking: Displays a progress bar during compilation.
Smart File Naming: Automatically uses the project folder name as the default output file name.
Consistent File Extensions: Ensures the output file extension matches the chosen format.
Watch Mode: Automatically recompiles the project when file changes are detected.

Installation

Ensure you have Python 3.6 or later installed on your system.

Clone the ContextForge repository:

git clone https://github.com/seeschweiler/contextforge.git
cd contextforge

Install the required dependencies:
```
pip install -r requirements.txt
```

Usage

The basic usage of ContextForge is as follows:

python contextforge.py [project_path] [output_file] [-f FORMAT] [-m MAX_FILE_SIZE] [--extensions EXTENSIONS] [--watch]

project_path: Path to the project folder (default: current directory)
output_file: Path to the output file (default: project_name.{format})
-f, --format: Output format (markdown, html, json, or xml; default: markdown)
-m, --max-file-size: Maximum file size in bytes to include (default: 1000000)
--extensions: Comma-separated list of file extensions to include (e.g., 'py,js,md')
--watch: Run in watch mode, recompiling on file changes

For more information and options, use the help command:

python contextforge.py -h

Configuration

.cfignore and .gitignore Files

ContextForge now supports both .cfignore and .gitignore files in the root of your project directory. These files allow you to specify patterns for files and directories that should be excluded from the compilation.

If both .cfignore and .gitignore exist, their contents are merged.
When using .gitignore, the .git directory is automatically excluded.

Example .cfignore or .gitignore file:

# Ignore all .log files
*.log

# Ignore the entire 'node_modules' directory
node_modules/

# Ignore a specific file
secrets.txt

# Ignore all files in a specific directory
build/*

Output Formats

ContextForge supports four output formats:

Markdown (default): A well-structured Markdown file with appropriate code blocks and syntax highlighting.
HTML: An HTML file with syntax-highlighted code blocks, suitable for viewing in a web browser.
JSON: A JSON file containing the project structure and file contents, useful for programmatic processing.
XML: An XML file with a structured representation of the project, ideal for parsing and processing with XML tools.

Examples

Compile the current directory to the default output file (project_name.md):
```
python contextforge.py
```

Compile a specific project to a custom output file:

python contextforge.py /path/to/project custom_output.md

Compile to HTML format:
```
python contextforge.py -f html
```
Compile with a 500KB max file size:
```
python contextforge.py -m 500000
```

Compile to JSON format with a 2MB max file size:

python contextforge.py -f json -m 2000000 /path/to/project

Compile to XML format:

python contextforge.py -f xml /path/to/project

Compile only Python and JavaScript files:

python contextforge.py --extensions py,js

Compile to XML format with a 2MB max file size, only including Python files:

python contextforge.py -f xml -m 2000000 --extensions py /path/to/project output.xml

Run in watch mode, recompiling on file changes:
```
python contextforge.py --watch
```

Run in watch mode with specific format and extensions:

python contextforge.py --watch -f html --extensions py,js /path/to/project

Contributing

Contributions to ContextForge are welcome! Please feel free to submit a Pull Request.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.gitignore		.gitignore
README.md		README.md
cf_howitworks.png		cf_howitworks.png
cflogo.png		cflogo.png
contextforge.png		contextforge.png
contextforge.py		contextforge.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ContextForge

Table of Contents

Features

Installation

Usage

Configuration

.cfignore and .gitignore Files

Output Formats

Examples

Contributing

About

Contributors 2

Languages

seeschweiler/contextforge

Folders and files

Latest commit

History

Repository files navigation

ContextForge

Table of Contents

Features

Installation

Usage

Configuration

.cfignore and .gitignore Files

Output Formats

Examples

Contributing

About

Topics

Resources

Stars

Watchers

Forks

Contributors 2

Languages