Discussion
Loading...

#Tag

  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

How do you improve on this?

- Find more data sources
- Filter out bad data, both via humans and regexes
- RAG!

#bythebay

phildini
@phildini@wandering.shop replied  ·  activity timestamp 2 days ago

Now we're seeing a demo of https://fighthealthinsurance.com/, what this talk has been all about.

#bythebay

  • Copy link
  • Flag this comment
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

They're hosting their fine-tuned model on their own hardware in a server rack, and fronted by a #Django app as an interface.

Going from lab to prod is still rough!

#bythebay

phildini
@phildini@wandering.shop replied  ·  activity timestamp 2 days ago

How do you improve on this?

- Find more data sources
- Filter out bad data, both via humans and regexes
- RAG!

#bythebay

  • Copy link
  • Flag this comment
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

How do we do the fine-tuning?

- Correct data structure
- Pile o' shell scripts
- Axolotl! this was clutch. https://axolotl.ai/

#bythebay

phildini
@phildini@wandering.shop replied  ·  activity timestamp 2 days ago

They're hosting their fine-tuned model on their own hardware in a server rack, and fronted by a #Django app as an interface.

Going from lab to prod is still rough!

#bythebay

  • Copy link
  • Flag this comment
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

Once we have the data, how do we pick a model to fine tune?

- Make sure the model will fit in memory
- Check the license. Then check it again.
- Then consider good base performance, that's got good scores on best-related tasks

#bythebay

phildini
@phildini@wandering.shop replied  ·  activity timestamp 2 days ago

How do we do the fine-tuning?

- Correct data structure
- Pile o' shell scripts
- Axolotl! this was clutch. https://axolotl.ai/

#bythebay

  • Copy link
  • Flag this comment
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

First problem: training data!

- Insurance companies won't give the data
- Doctor's offices have the data, but it's hard for them to give it out
- Reddit has some data! But there's sample bias
- Insurance commissioners!

#bythebay

phildini
@phildini@wandering.shop replied  ·  activity timestamp 2 days ago

Once we have the data, how do we pick a model to fine tune?

- Make sure the model will fit in memory
- Check the license. Then check it again.
- Then consider good base performance, that's got good scores on best-related tasks

#bythebay

  • Copy link
  • Flag this comment
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

This includes:
- An ML model to take denials and produce appeals
- Frontend interface for people to access the model
- Library to remove and put the personal data back in

#bythebay

phildini
@phildini@wandering.shop replied  ·  activity timestamp 2 days ago

First problem: training data!

- Insurance companies won't give the data
- Doctor's offices have the data, but it's hard for them to give it out
- Reddit has some data! But there's sample bias
- Insurance commissioners!

#bythebay

  • Copy link
  • Flag this comment
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

We have a social problem, and we're going to try and duct tape it with computers!

"Making healthcare leak a little less fast"

How? AI to take denials and produce appeals.

#bythebay

phildini
@phildini@wandering.shop replied  ·  activity timestamp 2 days ago

This includes:
- An ML model to take denials and produce appeals
- Frontend interface for people to access the model
- Library to remove and put the personal data back in

#bythebay

  • Copy link
  • Flag this comment
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

What's the problem? Health Insurance in America frequently denies medical care, Insurance _might_ be using AI to deny claims, and appealing claim denials is hard

And they have a lot more money than us!

#bythebay

phildini
@phildini@wandering.shop replied  ·  activity timestamp 2 days ago

We have a social problem, and we're going to try and duct tape it with computers!

"Making healthcare leak a little less fast"

How? AI to take denials and produce appeals.

#bythebay

  • Copy link
  • Flag this comment
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

Holden is @holden! And has written a ton of books on Spark that people should buy.

#bythebay

phildini
@phildini@wandering.shop replied  ·  activity timestamp 2 days ago

What's the problem? Health Insurance in America frequently denies medical care, Insurance _might_ be using AI to deny claims, and appealing claim denials is hard

And they have a lot more money than us!

#bythebay

  • Copy link
  • Flag this comment
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

Net talk at AI #bythebay: "AI For Good: Fighting Health Insurance with AI" by Holden Karau.

Starting with CWs around surgery, hospitals, trans references, and broken bones.

phildini
@phildini@wandering.shop replied  ·  activity timestamp 2 days ago

Holden is @holden! And has written a ton of books on Spark that people should buy.

#bythebay

  • Copy link
  • Flag this comment
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

Net talk at AI #bythebay: "AI For Good: Fighting Health Insurance with AI" by Holden Karau.

Starting with CWs around surgery, hospitals, trans references, and broken bones.

  • Copy link
  • Flag this post
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

Uber has a combined interface for doing debugging, visualizing, and evals on an agent.

(This would be amazing to have at work)

#bythebay

phildini
@phildini@wandering.shop replied  ·  activity timestamp 2 days ago

This tooling supports local servers, online servers, and versioning for rollbacks and testing.

Evals can even be run offline, or in CI/CD

#bythebay

  • Copy link
  • Flag this comment
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

Why use the Agent Protocol? Standardized:

- Chat history management
- Low Level APIs
- Observability (would love to know more about this)

#bythebay

phildini
@phildini@wandering.shop replied  ·  activity timestamp 2 days ago

Uber has a combined interface for doing debugging, visualizing, and evals on an agent.

(This would be amazing to have at work)

#bythebay

  • Copy link
  • Flag this comment
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

AgentFX is a standard interface to agents at Uber.

the runtime uses the OSS Agent Protocol (with langchain.ai)

Provides Threads, Runs, Messages, and Agents under one protocol.

#bythebay

phildini
@phildini@wandering.shop replied  ·  activity timestamp 2 days ago

Why use the Agent Protocol? Standardized:

- Chat history management
- Low Level APIs
- Observability (would love to know more about this)

#bythebay

  • Copy link
  • Flag this comment
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

Team is trying to provide the best tools for working in AI.

Their verticals:
- Agent Builder for rapid prototyping
- Write production-ready AI applications with AgentFX
- Monitor and deploy with Agent Studio

#bythebay

phildini
@phildini@wandering.shop replied  ·  activity timestamp 2 days ago

AgentFX is a standard interface to agents at Uber.

the runtime uses the OSS Agent Protocol (with langchain.ai)

Provides Threads, Runs, Messages, and Agents under one protocol.

#bythebay

  • Copy link
  • Flag this comment
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

Uber has a sense of levels for AI agents.

L1 - Responders, simple chat bot
L2 - Assistants, ChatGPT without reasoning
L3 - Collaborators, ChatGPT with deep research, Claude Code
L4 - Experts, AI agents SWE teams that build new features
L5 - Autonomous Teams - AI manager builds team of agents to get things done

#bythebay

phildini
@phildini@wandering.shop replied  ·  activity timestamp 2 days ago

Team is trying to provide the best tools for working in AI.

Their verticals:
- Agent Builder for rapid prototyping
- Write production-ready AI applications with AgentFX
- Monitor and deploy with Agent Studio

#bythebay

  • Copy link
  • Flag this comment
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

Uber has a sense of levels for AI agents.

L1 - Responders, simple chat bot
L2 - Assistants, ChatGPT without reasoning
L3 - Collaborators, ChatGPT with deep research, Claude Code
L4 - Experts, AI agents SWE teams that build new features
L5 - Autonomous Teams - AI manager builds team of agents to get things done

#bythebay

  • Copy link
  • Flag this post
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

Next talk: "Uber's Multi-Agent Platform" by Jamieson Leibovitch

This is one of the talks I've been most excited for!

#bythebay

phildini
@phildini@wandering.shop replied  ·  activity timestamp 2 days ago

Jamieson has been at Uber for 4 years, starting inside embedded hardware and moving into the Agent Platform team.

#bythebay

  • Copy link
  • Flag this comment
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

Next talk: "Uber's Multi-Agent Platform" by Jamieson Leibovitch

This is one of the talks I've been most excited for!

#bythebay

  • Copy link
  • Flag this post
  • Block
phildini
@phildini@wandering.shop  ·  activity timestamp 2 days ago

Example 3: Voice transcription project

Lesson 4: Voice is faster than typing (not sure I agree with this, but fascinating take!)

using superwhisper, MacWhisper

#bythebay

phildini
@phildini@wandering.shop replied  ·  activity timestamp 2 days ago

Don't be afraid of starting over if the context is getting unwieldy!

#bythebay

  • Copy link
  • Flag this comment
  • Block
Log in

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.0 no JS en
Automatic federation enabled
  • Explore
  • About
  • Members
  • Code of Conduct
Home
Login