More

LeonidBugaev · 2025-06-10T11:17:42 1749554262

It does not implement the Auth :)

(mcp auth is terrible btw)

seanobannon · 2025-06-10T12:19:11 1749557951

I couldn’t find any great examples of MCP auth, so made this demonstrate an oauth flow recently - https://github.com/OBannon37/chatgpt-deep-research-connector...

WXLCKNO · 2025-06-10T13:06:54 1749560814

For my app I'm bypassing MCP auth and doing the regular oauth2 flow to connect users to external apps.

Then I pass the stored oauth token directly to my (private) MCP servers alongside a bearer token.

LeonidBugaev · 2025-04-17T05:55:37 1744869337

You should check https://probeai.dev/ too. Thats one of those building blocks which makes AI trully understand the code.

LeonidBugaev · 2025-04-09T14:31:16 1744209076

To put it simple:

A2A is for communication between the agents. MCP is how agent communicate with its tools.

Important aspect of A2A, is that it has a notion of tasks, task rediness, and etc. E.g. you can give it a task and expect completely in few days, and get notified via webhook or polling it.

For the end users for sure A2A will cause a big confusing, and can replace a lot of current MCP usage.

NickNaraghi · 2025-04-09T23:25:15 1744241115

If an agent could wrap itself in an MCP server, would that make A2A redundant?

fengkx · 2025-04-10T08:51:56 1744275116

I have the same problem come out in my mind.

What if I wrap the agent as a tool in MCP?

Since the agents I got from the 'A2A' protocol is passed as tools to another Agent...

https://github.com/google/A2A/blob/72a70c2f98ffdb9bd543a57c8...

tuananh · 2025-04-10T00:58:09 1744246689

you mean wrap mcp server in itself?

LeonidBugaev · 2025-04-08T18:31:52 1744137112

Hello HN!

I'm building Probe https://probeai.dev/ for a while now, and this this docs-mcp project is showcase of its capable. Giving you a local semantic search over any codebase or docs without indexing.

Feel free to ask any questions!

LeonidBugaev · on Dec 6, 2024

Nope, it is simply fresh issues with "help wanted" and "good first issue" labels.

LeonidBugaev · on Dec 4, 2024

I do maintain big OSS projects and and try to contribute as well.

However contribution experience can very bad, if you follow the path of picking the most famous objects. Good luck contributing to Node, Rust, Shadcn and etc - they do not need your contribution, their PR queue is overloaded and they can't handle it. Plus you need to get to their internal circles first, though quite complex process.

The world is much bigger. There are so many help required from the smaller but still active projects.

Just recently I raised 3 small PRs, and they reviewed the same day!

As a my respect to all the OSS community, I have build https://helpwanted.dev/ website, which in the nutshell shows latest "help wanted" and "good first issue" issues, from all over github in the last 24 hours.

You would be amazed how many cool projects out of there looking for the help!

LeonidBugaev · on April 14, 2024

One of the cases when AI not needed. There is very good working algorithm to extract content from the pages, one of implementations: https://github.com/buriy/python-readability

haddr · on April 14, 2024

Some years ago I compared those boilerplate removal tools and I remember that jusText was giving me the best results out of the box (tried readability and few other libraries too). I wonder what is the state of the art today?

jot · on April 14, 2024

This is worth having a look at: https://mixmark-io.github.io/turndown/

With some configuration you can get most of the way there.

asadalt · on April 14, 2024

oh AI is optional here. I do use readability to clean the html before converting to .md.

jot · on April 14, 2024

Last time I tried readability it worked well with articles but struggled with other kinds of pages. Took away far more content than I wanted it to.

IanCal · on April 14, 2024

How do you achieve the same things without AI here using that tool?

chrisweekly · on April 14, 2024

"How do you do it without AI" is a question I (sadly) expect to see more often.

IanCal · on April 14, 2024

Feel free to answer then, how do you do the same functions this does with gpt(3/4) without AI?

Edit -

This is an excellent use of it, a free text human input capable of doing things like extracting summaries. It does not seem to be used at all for the basic task of extracting content, but for post filtering.

cactusfrog · on April 14, 2024

I think “copy from a PDF” could be improved with AI. It’s been 30 years and I still get new lines in the middle of sentences when I try to copy from one.

IanCal · on April 14, 2024

That's a great use case, you might be able to do this if you've got a copy and paste on the command line with

https://github.com/simonw/llm

In between. An alias like pdfwtf translating to "paste | llm command | copy"

genewitch · on April 14, 2024

i've long assumed that is a "feature" of PDF akin to DRM. Making copying text from a PDF makes sense from a publisher's standpoint.

hombre_fatal · on April 14, 2024

Meh, it’s just the “how does it work?” question. How content extractors work is interesting and not obvious nor trivial.

And even when you see how readability parser works, AI handles most of the edge cases that content extractors fail on, so they are genuinely superseded by LLMs.

fbdab103 · on April 14, 2024

I was honestly expecting it to be mostly black magic, but it looks like the meat of the project is a bunch of (surely hard won) regexes. Nifty.

nyokodo · on April 15, 2024

> I was … expecting it to be mostly black magic, but … the meat of the project is a bunch of … regexes

Wait, regexes are the epitome of black magic. What do you consider as black magic?

fbdab103 · on April 15, 2024

Macros? Any situation where code edits other code?

Sure, I could not write a regex engine, but the language itself can be fine if you keep it to straightfoward stuff. Unlike the famous e-mail parsing regex.

foundzen · on April 14, 2024

how is it compared to mozilla/readability?

asadm · on April 14, 2024

it uses readibility but does some additional stuff like relink images to local paths etc., which I needed

foundzen · on April 15, 2024

I have had challenges with readability. The output is good for blogs but when we try it for other type of content, it misses on important details even when the page is quite text-heavy just like blog.

asadalt · on April 15, 2024

yeah that’s correct. i put a checkbox to disable readability filter if needed…

LeonidBugaev · on March 19, 2024

Plus in production, with high load, Redis cluster is way more common, which kind of solve single-threaded concern.

CuriouslyC · on March 19, 2024

I've always found redis cluster to just bring problems with it.

LeonidBugaev · on Nov 13, 2023

What I really appreciate about Rails, is the strong vision, and not becoming another bloated framework for building "enterprise grade" applications. This is a 20-year-old framework, which does not afraid to radically change with time, and still being seen as Punk compared to rest

It was so heartwarming to remember my Rails story, its been 20 years ago since I started using it! https://twitter.com/buger/status/1723040883325460818

LeonidBugaev · on March 18, 2023

I love to see more activity in this area!

I'm maintainer of GoReplay https://github.com/buger/goreplay and work in this area for the last 10 years.

It is quite hard problem to solve, because you have to deal with state difference between test and production environments. Love your approach to mocking dependencies, and leveraging OpenTelementry. It potentially can solve some of state issues. But still require modifying user code. I wonder if it can be done purely using OpenTelementry (e.g. you depend on typical OTel setup), and then read the data directly from OTel DB.

Cheers!

vedant_ag · on March 19, 2023

Thanks Leonid! Your vote of confidence means a lot.

OTel for go requires user code changes. Languages that allow monkey-patching (java, js, python, etc.).

> I wonder if it can be done purely using OpenTelementry (e.g. you depend on typical OTel setup), and then read the data directly from OTel DB.

OTel doesn't work out of the box. OTel usually doesn't collect request or response for any network or db call. 90% of my time is spent on extending the individual agents' code; so that they can collect additional required information, and perform "replay".

royal0203 · on March 18, 2023

Go replay has been one of the inspiration Leonid, so glad you checked out CodeParrot :)

Typical Otel implementation don’t capture some request data esp parameters and replay part is missing among few other issues, so we need to extend it.

pradeep_bishnoi · on March 18, 2023

thank you Leonid for the GoReplay. A great ecosystem of products will be built on top of it.

LeonidBugaev · on March 18, 2023

I hope so! But I also hope that I will be also able to monetise some of this movent. GoReplay dual licensed under AGPL and Commercial license. I also sell special appliance licenses.

If anyone in this thread wants to build a product based on GoReplay technology (capture network traffic directly, via AWS Traffic Mirroring or k8s), sent me message :)