Commit a7654b8
authored
File tree
628 files changed
+18640
-9807
lines changed- .buildkite
- scripts
- hardware_ci
- .github
- workflows
- benchmarks
- auto_tune
- kernels
- cmake
- csrc
- cpu
- moe
- marlin_moe_wna16
- quantization
- fp4
- gptq_allspark
- gptq_marlin
- docker
- docs
- benchmarking
- cli/bench/sweep
- community
- contributing
- model
- design
- features
- getting_started/installation
- mkdocs/hooks
- models
- hardware_supported_models
- serving
- examples
- offline_inference
- online_serving
- requirements
- tests
- basic_correctness
- compile
- distributed
- fullgraph
- engine
- entrypoints
- offline_mode
- openai
- tool_parsers
- pooling
- classify
- embed
- pooling
- score
- sagemaker
- kernels
- attention
- moe
- quantization
- lora
- model_executor
- models
- language
- generation
- pooling
- multimodal
- generation
- vlm_utils
- pooling
- processing
- quantization
- multimodal
- plugins_tests
- plugins
- lora_resolvers
- prithvi_io_processor_plugin/prithvi_io_processor
- quantization
- reasoning
- tokenization
- tokenizers_
- tool_use
- transformers_utils
- utils_
- v1
- attention
- core
- cudagraph
- determinism
- distributed
- e2e
- ec_connector/integration
- engine
- entrypoints
- llm
- openai
- kv_connector/unit
- kv_offload
- spec_decode
- tpu
- worker
- tools
- ep_kernels
- pre_commit
- vllm
- attention
- backends
- layers
- ops
- utils
- benchmarks
- sweep
- compilation
- config
- distributed
- device_communicators
- ec_transfer/ec_connector
- kv_transfer/kv_connector/v1
- lmcache_integration
- p2p
- engine
- entrypoints
- openai
- tool_parsers
- pooling
- classify
- embed
- pooling
- score
- sagemaker
- inputs
- lora
- layers
- punica_wrapper
- model_executor
- layers
- fused_moe
- mamba
- quantization
- compressed_tensors
- schemes
- kernels/mixed_precision
- quark
- schemes
- utils
- model_loader
- models
- transformers
- multimodal
- platforms
- plugins/io_processors
- profiler
- reasoning
- tokenizers
- transformers_utils
- configs
- processors
- tokenizers
- triton_utils
- utils
- v1
- attention/backends
- mla
- core
- sched
- engine
- kv_offload
- worker
- metrics
- pool
- sample
- logits_processor
- ops
- tpu
- spec_decode
- structured_output
- worker
- gpu
- sample
- spec_decode
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
628 files changed
+18640
-9807
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
8 | 8 | | |
9 | 9 | | |
10 | 10 | | |
11 | | - | |
| 11 | + | |
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
| |||
30 | 30 | | |
31 | 31 | | |
32 | 32 | | |
33 | | - | |
34 | | - | |
35 | | - | |
36 | | - | |
37 | | - | |
38 | | - | |
39 | | - | |
40 | | - | |
41 | | - | |
42 | | - | |
43 | | - | |
44 | | - | |
45 | | - | |
46 | 33 | | |
47 | 34 | | |
48 | 35 | | |
| |||
109 | 96 | | |
110 | 97 | | |
111 | 98 | | |
112 | | - | |
113 | 99 | | |
114 | 100 | | |
115 | 101 | | |
| |||
0 commit comments