Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Great question!

1. I've observed multiple products across customers.

1.1 Correcting or filling missing information in structured data. For example a system to suggest corrections to products in a company catalogue (each product category has different schema). Unstructured data is pulled from various websites and optionally from categories retrieved from images. It is then compared against the data and most probable fixes are reported.

Most of the work is done by a few polite prompts to GPT3.5/4 (~5 English sentences in total)

1.2 Better search company data. E.g. a chat bot for internal documentation that can also access internal services in order to answer a question. Same ~5 English sentences to do bulk of the work.

1.3 (non commercial) Endangered language preservation. Building a smart agent that is accessible via chat/hardware (like Alexa/Homepod), that talks in native language can understand and helps to preserve the culture. This is a complex one.

2. Tech stack itself is rather simple. Mostly - GPT, LangChain/LlamaIndex, Vector database with embeddings for memory, plugins for external services and potentially agents to drive workflows.

Output quality, testing, monitoring, scalability etc also don't differ much from operating normal "old-school" ML models. If anything, it feels simpler.

The tricky part is that the entire notion of LLM-driven micro-services is new. Quality of the resulting product largely depends on knowing prompting tricks and following the latest news in an area.

Plus the biggest challenge that customers want to be solved: "How can I ran it on my hardware?"



Can you sell me on langchain? To me it seemed useful for building gimmicky examples, but didn’t seem very easy to customize


I came to the same conclusion.


How do Embeddings work for you? I'm yet to play with them, but I imagine: - with large corpus of text, they can be expensive to compute

- they may be limited in some cases - when you want to extract a deeper meaning from text - that is - gpt3/4 would figure out that an answer to a query is in a given document, but embeddings are way more shallow and will miss it

I know they are far better than previous search approaches, but do they have the limitations above?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: