Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
27 commits
Select commit Hold shift + click to select a range
c78374f
Add memory-first storage architecture (pre-TurboQuant)
michalharakal Apr 6, 2026
1b4bf05
Use borrowed arrays in SafeTensors loader, add wrap methods to Execut…
michalharakal Apr 6, 2026
d33a4e1
Add loadTensorStorage() to streaming GGUF and SafeTensors readers
michalharakal Apr 6, 2026
0e72c50
Add file-backed tensor storage loading to streaming readers
michalharakal Apr 6, 2026
8d306d5
Expose MemoryPlanner and MemoryTracker through ExecutionContext
michalharakal Apr 6, 2026
749c134
Migrate Ternary2BitTensorData to PackedBlockStorage interface
michalharakal Apr 6, 2026
cf6b59e
Add FallbackMappedMemoryChunk for non-JVM platforms
michalharakal Apr 6, 2026
693b6af
Add explicit transfer operations to TensorStorage
michalharakal Apr 6, 2026
68f2d3f
Auto-instrument copy paths with ActiveMemoryTracker
michalharakal Apr 6, 2026
710252f
Add @Place and @Weights DSL annotations for placement intent
michalharakal Apr 6, 2026
3d9535f
Update README and docs to recommend StreamingGGUFReader
michalharakal Apr 6, 2026
0c3eaf5
Add BufferAccessor and JvmFileBackedResolver for mmap end-to-end
michalharakal Apr 6, 2026
85c7fe1
Add end-to-end storage integration tests with synthetic GGUF
michalharakal Apr 6, 2026
b3eb1ff
Add TensorStorage → TensorData bridge for backend compatibility
michalharakal Apr 6, 2026
9886cd9
Add sin, cos, tanh and convTranspose1d tensor ops
michalharakal Apr 6, 2026
f274085
Add TransposedConv1d, Snake activation and LayerScale modules
michalharakal Apr 6, 2026
552a19f
Add unit tests for transfer ops, Q4_K and Ternary dequantization
michalharakal Apr 6, 2026
ec1276d
Add tests for ActiveMemoryTracker, FallbackMappedMemoryChunk, non-con…
michalharakal Apr 6, 2026
cbd069f
Add storage benchmarks
michalharakal Apr 6, 2026
63776a9
Add Tekken tokenizer for Mistral models
michalharakal Apr 6, 2026
3deeb06
Merge pull request #457 from SKaiNET-developers/feature/mistral
michalharakal Apr 6, 2026
4b9872c
remove unlrelated file
michalharakal Apr 7, 2026
b650236
Complete Step 1: KV-cache subsystem, SDPA bridge, Quants.kt, SafeTens…
michalharakal Apr 8, 2026
585e1b3
Implement Step 2: TurboQuant runtime with CPU reference path
michalharakal Apr 8, 2026
4bd346f
Add DSL annotations, CPU SIMD kernels, and JMH benchmarks for TurboQuant
michalharakal Apr 8, 2026
3df4f28
Add Metal backend implementation task for TurboQuant (TQ-023, TQ-024)
michalharakal Apr 8, 2026
4316565
Add TurboQuant consumer API: factories, annotation resolver, usage guide
michalharakal Apr 8, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
16 changes: 11 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -50,11 +50,17 @@ val d = c.relu()
### GGUF Model Loading

```kotlin
val source = SystemFileSystem.source(Path("model.gguf")).buffered()
val reader = GGUFReader(source)

val tensor = reader.tensors.first { it.name == "token_embd.weight" }
val weights = reader.materialize(tensor)
// Recommended: streaming reader — memory-efficient, supports quantized types
val source = JvmRandomAccessSource.open("model.gguf")
StreamingGGUFReader.open(source).use { reader ->
println("Tensors: ${reader.tensorCount}")

// Load specific tensor on demand (no whole-file loading)
val bytes = reader.loadTensor("token_embd.weight")

// Or get a TensorStorage descriptor with encoding/placement metadata
val storage = reader.loadTensorStorage("token_embd.weight")
}
```

> **More examples:** [SKaiNET-examples](https://github.com/SKaiNET-developers/SKaiNET-examples) | [SKaiNET-notebook](https://github.com/SKaiNET-developers/SKaiNET-notebook)
Expand Down
Loading
Loading