Publish the anatomy of a coding assistant blog post #7002

ykdojo · 2024-06-12T18:49:08Z

No description provided.

netlify · 2024-06-12T18:49:26Z

✅ Deploy Preview for sourcegraph ready!

Name	Link
🔨 Latest commit	`140c807`
🔍 Latest deploy log	https://app.netlify.com/sites/sourcegraph/deploys/6670bf67e940f20008f5cb41
😎 Deploy Preview	https://deploy-preview-7002--sourcegraph.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

content/blogposts/2024/anatomy-of-a-coding-assistant.md

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

…o the Rust fine tuning article

kukicado

Great work @ykdojo!

content/blogposts/2024/anatomy-of-a-coding-assistant.md

kukicado · 2024-06-13T23:17:20Z

content/blogposts/2024/anatomy-of-a-coding-assistant.md

+
+1. **Conversation history:** In a given chat session, we record previous messages as they may contain relevant context for the user's next request.
+2. **Code search:** We fetch the most relevant code snippets related to the user's query from the codebase using search, similar to how a human dev might search for these code snippets.
+3. **User control:** Users should have the ability to mention specific files and provide those directly to the model, and also include the option to reference external sources like Slack threads or Notion documents to enrich the context further.


Suggested change

3. **User control:** Users should have the ability to mention specific files and provide those directly to the model, and also include the option to reference external sources like Slack threads or Notion documents to enrich the context further.

3. **User choice:** Users should have the ability to mention specific files and provide those directly to the model, and also include the option to reference external sources like Slack threads or Notion documents to enrich the context further.

maybe "choice" instead of "control"?

Hmm here, I think "control" fits better. "Choice" makes me think of choosing an LLM model - so maybe more like choosing one thing out of several options.

Makes sense. :)

content/blogposts/2024/anatomy-of-a-coding-assistant.md

kukicado · 2024-06-13T23:30:17Z

content/blogposts/2024/anatomy-of-a-coding-assistant.md

+- Diagnostic information like warnings and errors
+- User-specified context from @-mentions
+
+By including diagnostic information, we're able to provide more appropriate code edits to the selected range of code. In the future, we plan to incorporate code graph context here as well.


Do we also want to say something along the lines of "With the code editing feature, we put greater emphasis on the users prompt and rely on the above context sources to generate high quality code output."

This section feels a little incomplete and could use a few more details imo.

Added:

With the code editing feature, we currently put greater emphasis on the users prompt and rely on the above context sources to generate high quality code output.

This section feels a little incomplete and could use a few more details imo.

Definitely. Writing this part helped me realized that there's more we can do here from the product development perspective, too.

Awesome! Thank you. Yeah I think the editing/inserting code feature is a sleeping giant that if we invest more into, could be huge.

content/blogposts/2024/anatomy-of-a-coding-assistant.md

Co-authored-by: Ado Kukic <kukicadnan@gmail.com>

content/blogposts/2024/anatomy-of-a-coding-assistant.md

jtibshirani · 2024-06-14T22:55:16Z

content/blogposts/2024/anatomy-of-a-coding-assistant.md

+
+![13_autocomplete](https://storage.googleapis.com/sourcegraph-assets/blog/anatomy/13_autocomplete.png)
+
+For this, we look at a few sources of information: the cursor position, the surrounding code, and the code graph (a code graph is a representation of the relationships and structures within a codebase, mapping entities such as classes and methods to show how they are interconnected). We use the current cursor position within the code graph to determine if the user wants a single-line suggestion or a multi-line suggestion. Once we determine that, we add more context by looking through recent files and open tabs. Within those, we find code snippets related to the code you're currently writing.


It'd be good to explain how we use the code graph for autocomplete context. I know this is unclear to many people (both inside and out of Sourcegraph).

Added a link to this to answer it - hope it clarifies it: https://sourcegraph.notion.site/How-do-we-use-the-code-graph-for-autocomplete-context-11145ad3e07049e8ab1add7ac3012f81

That indeed helps. However It's still unclear to me when and if we actually use this. For example, is LSP context actually enabled by default for our customers?

I looked into it, and it looks like it's determined by a feature flag: https://github.com/sourcegraph/cody/blob/main/vscode/src/completions/completion-provider-config.ts#L77

As well as a config variable: https://github.com/sourcegraph/cody/blob/54b25dc7d17bac425c2b59b04ca08d600ef184ed/vscode/package.json#L1137

content/blogposts/2024/anatomy-of-a-coding-assistant.md

jtibshirani

I like the "product of products" framing, and how it ties into the different context retrieval strategies. I had a couple high-level thoughts:

I left some suggestions for expanding the "Keyword Search" description, to help emphasize our expertise in information retrieval/ search and how we have a nuanced strategy based on code. This applies to both the "query understanding" and ranking steps.
I wonder if we should mention our internal evals briefly? Maybe just a short note about how we are "data driven" in making changes, using both offline and online metrics. I almost never hear other companies mention evals in blog posts, and this gives me a bad impression ... like "why do you have such a complicated pipeline? Did you even measure it??"

jtibshirani

Thanks for resolving those comments! Did you see my comment about evals?

I wonder if we should mention our internal evals briefly? Maybe just a short note about how we are "data driven" in making changes, using both offline and online metrics. I almost never hear other companies mention evals in blog posts, and this gives me a bad impression ... like "why do you have such a complicated pipeline? Did you even measure it??"

ykdojo · 2024-06-17T21:14:51Z

@jtibshirani Sorry I missed it, but mentioning our internal evals sounds good.

What can we say about it specifically - or where can I find more about it?

jtibshirani

Looks good to me!

content/blogposts/2024/anatomy-of-a-coding-assistant.md

Co-authored-by: Julie Tibshirani <julietibs@apache.org>

Create anatomy-of-a-coding-assistant.md

144033b

Link the June release blog post

ee2c92d

beyang reviewed Jun 13, 2024

View reviewed changes

ykdojo and others added 25 commits June 13, 2024 15:51

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

eaf9413

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

370626e

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

3ec6ad2

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

1aafbc9

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

d893018

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

71cf4f4

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

4c410ee

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

4387ef0

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

15d7a99

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

1dd5113

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

2a3c69c

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

f050fe0

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

2edf196

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

78bfe16

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

c8d1aa0

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

b0af593

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

36c10da

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

9cfa37b

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

71a994d

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

5e2f6f1

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

0dfe77e

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

ef28683

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

917d81b

Co-authored-by: Beyang Liu <beyang@sourcegraph.com>

Anatomy of a coding assistant: remove the conclusion and add a link t…

8fa4eab

…o the Rust fine tuning article

Update anatomy-of-a-coding-assistant.md

4f51021

kukicado reviewed Jun 13, 2024

View reviewed changes

ykdojo and others added 9 commits June 13, 2024 16:32

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

31dfbfd

Co-authored-by: Ado Kukic <kukicadnan@gmail.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

7d2f3d2

Co-authored-by: Ado Kukic <kukicadnan@gmail.com>

Update anatomy-of-a-coding-assistant.md

8bbe96f

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

a949a47

Co-authored-by: Ado Kukic <kukicadnan@gmail.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

5d75dc4

Co-authored-by: Ado Kukic <kukicadnan@gmail.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

a18f34b

Co-authored-by: Ado Kukic <kukicadnan@gmail.com>

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

dcd3ca2

Co-authored-by: Ado Kukic <kukicadnan@gmail.com>

Update anatomy-of-a-coding-assistant.md

c389179

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

1eeef6e

Co-authored-by: Ado Kukic <kukicadnan@gmail.com>

jtibshirani reviewed Jun 14, 2024

View reviewed changes

Update anatomy-of-a-coding-assistant.md

e69d675

jtibshirani reviewed Jun 17, 2024

View reviewed changes

Add a note on evals to the "anatomy" post

f1c220f

jtibshirani previously approved these changes Jun 17, 2024

View reviewed changes

content/blogposts/2024/anatomy-of-a-coding-assistant.md Outdated Show resolved Hide resolved

ykdojo dismissed jtibshirani’s stale review via 140c807 June 17, 2024 22:57

Update content/blogposts/2024/anatomy-of-a-coding-assistant.md

140c807

Co-authored-by: Julie Tibshirani <julietibs@apache.org>

jtibshirani approved these changes Jun 17, 2024

View reviewed changes

ykdojo merged commit 156c834 into main Jun 18, 2024
6 checks passed

ykdojo deleted the blog/anatomy-of-a-coding-assistant branch June 18, 2024 18:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Publish the anatomy of a coding assistant blog post #7002

Publish the anatomy of a coding assistant blog post #7002

ykdojo commented Jun 12, 2024

netlify bot commented Jun 12, 2024 •

edited

Loading

kukicado left a comment

kukicado Jun 13, 2024

ykdojo Jun 13, 2024

kukicado Jun 13, 2024

kukicado Jun 13, 2024

ykdojo Jun 13, 2024

kukicado Jun 13, 2024

jtibshirani Jun 14, 2024

ykdojo Jun 17, 2024

jtibshirani Jun 17, 2024

ykdojo Jun 17, 2024

jtibshirani left a comment

jtibshirani left a comment

ykdojo commented Jun 17, 2024

jtibshirani left a comment

	3. User control: Users should have the ability to mention specific files and provide those directly to the model, and also include the option to reference external sources like Slack threads or Notion documents to enrich the context further.
	3. User choice: Users should have the ability to mention specific files and provide those directly to the model, and also include the option to reference external sources like Slack threads or Notion documents to enrich the context further.


		![13_autocomplete](https://storage.googleapis.com/sourcegraph-assets/blog/anatomy/13_autocomplete.png)

		For this, we look at a few sources of information: the cursor position, the surrounding code, and the code graph (a code graph is a representation of the relationships and structures within a codebase, mapping entities such as classes and methods to show how they are interconnected). We use the current cursor position within the code graph to determine if the user wants a single-line suggestion or a multi-line suggestion. Once we determine that, we add more context by looking through recent files and open tabs. Within those, we find code snippets related to the code you're currently writing.

Publish the anatomy of a coding assistant blog post #7002

Publish the anatomy of a coding assistant blog post #7002

Conversation

ykdojo commented Jun 12, 2024

netlify bot commented Jun 12, 2024 • edited Loading

✅ Deploy Preview for sourcegraph ready!

kukicado left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jtibshirani left a comment

Choose a reason for hiding this comment

jtibshirani left a comment

Choose a reason for hiding this comment

ykdojo commented Jun 17, 2024

jtibshirani left a comment

Choose a reason for hiding this comment

netlify bot commented Jun 12, 2024 •

edited

Loading