[extension/llm] IOManager interface

### 🚀 The feature, motivation and pitch

Certain backends like qnn or coreML utilize static attention which requires very special handling in the runner code. At a higher level different algorithms in general can cause minor differences in the IO of the model that make it difficult for the runner class to handle causing folks to have to fork the runner for every new model. 

An IOManager interface would simplify the runner code abstracting away minutia that really boils down to state management away letting a lot of the other runner boilerplate to be shared

### Alternatives

_No response_

### Additional context

_No response_

### RFC (Optional)

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[extension/llm] IOManager interface #12002

🚀 The feature, motivation and pitch

Alternatives

Additional context

RFC (Optional)

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[extension/llm] IOManager interface #12002

Description

🚀 The feature, motivation and pitch

Alternatives

Additional context

RFC (Optional)

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions