Simon Mo
|
9a76ef07b9
|
Add pandas and datasets for benchmarks
|
2025-06-04 06:51:59 -07:00 |
Cyrus Leung
|
01dc9a76db
|
[CI/Build][Bugfix] Ensure compatibility with transformers 4.52 (#18678)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2025-06-04 04:49:20 -07:00 |
Li, Jiang
|
4555143ea7
|
[CPU] V1 support for the CPU backend (#16441)
|
2025-06-03 18:43:01 -07:00 |
Lu Fang
|
d32aa2e670
|
[Bugfix] Use cmake 3.26.1 instead of 3.26 to avoid build failure (#19019)
Signed-off-by: Lu Fang <lufang@fb.com>
|
2025-06-03 00:16:17 -07:00 |
Rui Qiao
|
bdce64f236
|
[V1] Support DP with Ray (#18779)
|
2025-06-02 21:15:13 -07:00 |
Gregory Shtrasberg
|
9e6f61e8c3
|
[ROCm][Build] Clean up the ROCm build (#19040)
Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
|
2025-06-02 20:47:47 -07:00 |
Luka Govedič
|
c55d804672
|
[BugFix] Pydantic part 2 (#18911)
Signed-off-by: luka <luka@neuralmagic.com>
|
2025-05-31 03:39:28 -07:00 |
Chengji Yao
|
a1cc9f33a3
|
[TPU] remove transpose ops in moe kernel (#18923)
Signed-off-by: Chengji Yao <chengjiyao@google.com>
|
2025-05-29 23:00:11 +00:00 |
Luka Govedič
|
f274581f44
|
[BugFix] Update pydantic to fix error on python 3.10 (#18852)
Signed-off-by: luka <luka@neuralmagic.com>
|
2025-05-29 03:05:46 -07:00 |
Cyrus Leung
|
4318c0559d
|
[CI/Build] Remove imports of built-in `re` (#18750)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2025-05-27 09:19:18 +00:00 |
Kebe
|
e0f0ff87b8
|
[Build] fix cpu build missing libtbbmalloc.so (#18744)
Signed-off-by: Kebe <mail@kebe7jun.com>
|
2025-05-27 01:03:56 -07:00 |
Cyrus Leung
|
82e2339b06
|
[Doc] Move examples and further reorganize user guide (#18666)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2025-05-26 07:38:04 -07:00 |
dylan
|
243eb9199f
|
[Bugfix]: handle hf-xet CAS error when loading Qwen3 weights in vLLM (#18701)
|
2025-05-26 07:10:56 -07:00 |
Feng XiaoLong
|
4fc1bf813a
|
[Bugfix] Migrate to REGEX Library to prevent catastrophic backtracking (#18454)
Signed-off-by: Crucifixion-Fxl <xmufxl@gmail.com>
Co-authored-by: Crucifixion-Fxl <xmufxl@gmail.com>
|
2025-05-23 16:16:26 -07:00 |
Kay Yan
|
7ab056c273
|
[Hardware][CPU] Update intel_extension_for_pytorch 2.7.0 and move to `requirements/cpu.txt` (#18542)
Signed-off-by: Kay Yan <kay.yan@daocloud.io>
|
2025-05-23 04:38:42 -07:00 |
Harry Mellor
|
a1fe24d961
|
Migrate docs from Sphinx to MkDocs (#18145)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
2025-05-23 02:09:53 -07:00 |
Michael Goin
|
3b17ea26e4
|
[TPU] Re-enable the Pallas MoE kernel (#18025)
Signed-off-by: Michael Goin <mgoin64@gmail.com>
|
2025-05-20 19:52:27 -07:00 |
Dilip Gowda Bhagavan
|
23baa2180b
|
fix:Build torch wheel inline rather than picking from nightly (#18351)
Signed-off-by: Dilip Gowda Bhagavan <dilip.bhagavan@ibm.com>
|
2025-05-20 22:22:24 +00:00 |
wang.yuqi
|
86847700d7
|
[CI] Add mteb testing to test the accuracy of the embedding model (#17175)
|
2025-05-20 06:51:12 -07:00 |
汪志鹏
|
d6c86d09ae
|
Update cpu.txt (#18398)
Signed-off-by: 汪志鹏 <wangzhipeng628@gmail.com>
|
2025-05-20 10:53:23 +00:00 |
Alexei-V-Ivanov-AMD
|
566ec04c3d
|
Adding "Basic Models Test" and "Multi-Modal Models Test (Extended) 3" in AMD Pipeline (#18106)
Signed-off-by: Alexei V. Ivanov <alexei.ivanov@amd.com>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
|
2025-05-15 08:49:23 -07:00 |
Chauncey
|
dc1a821768
|
[Feature][V1] Support `tool_choice: required` when using Xgrammar as the `StructuredOutputBackend`. (#17845)
Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>
|
2025-05-12 23:01:31 -07:00 |
Alexei-V-Ivanov-AMD
|
3b602cdea7
|
AMD conditional all test execution // new test groups (#17556)
Signed-off-by: Alexei V. Ivanov <alexei.ivanov@amd.com>
Signed-off-by: Yida Wu <yidawu@alumni.cmu.edu>
|
2025-05-09 15:35:58 -07:00 |
Shanshan Shen
|
760e3ecc8f
|
[V1][Structured Output] Update llguidance (`>= 0.7.11`) to avoid AttributeError (no `StructTag`) (#17839)
Signed-off-by: shen-shanshan <467638484@qq.com>
|
2025-05-08 20:14:18 -07:00 |
Harry Mellor
|
e4ca6e3a99
|
Fix transient dependency error in docs build (#17848)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
2025-05-08 03:42:03 -07:00 |
Mikhail Podvitskii
|
c747d84576
|
[Installation] OpenTelemetry version update (#17771)
Signed-off-by: Mikhail Podvitskii <podvitskiymichael@gmail.com>
|
2025-05-07 22:32:49 -07:00 |
Christian Heimes
|
1a6af1453d
|
Only depend on importlib-metadata for Python < 3.10 (#17776)
Signed-off-by: Christian Heimes <christian@python.org>
|
2025-05-07 07:51:06 -07:00 |
Satyajith Chilappagari
|
043e4c4955
|
Add NeuronxDistributedInference support, Speculative Decoding, Dynamic on-device sampling (#16357)
Signed-off-by: Satyajith Chilappagari <satchill@amazon.com>
Co-authored-by: Aaron Dou <yzdou@amazon.com>
Co-authored-by: Shashwat Srijan <sssrijan@amazon.com>
Co-authored-by: Chongming Ni <chongmni@amazon.com>
Co-authored-by: Amulya Ballakur <amulyaab@amazon.com>
Co-authored-by: Patrick Lange <patlange@amazon.com>
Co-authored-by: Elaine Zhao <elaineyz@amazon.com>
Co-authored-by: Lin Lin Pan <tailinpa@amazon.com>
Co-authored-by: Navyadhara Gogineni <navyadha@amazon.com>
Co-authored-by: Yishan McNabb <yishanm@amazon.com>
Co-authored-by: Mrinal Shukla <181322398+mrinalks@users.noreply.github.com>
|
2025-05-07 00:07:30 -07:00 |
Yang Wang
|
6de3e13413
|
Add logging for torch nightly version (#17669)
Signed-off-by: Yang Wang <elainewy@meta.com>
|
2025-05-07 00:45:51 +00:00 |
Harry Mellor
|
022afbeb4e
|
Fix doc build performance (#17748)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
2025-05-07 00:36:41 +00:00 |
Jevin Jiang
|
621ca2c0ab
|
[TPU] Increase block size and reset block shapes (#16458)
|
2025-05-06 13:55:04 -04:00 |
Isotr0py
|
cc05b90d86
|
[Doc] Fix broken cuda installation doc rendering (#17654)
Signed-off-by: Isotr0py <2037008807@qq.com>
|
2025-05-05 17:52:40 +00:00 |
Harry Mellor
|
d6484ef3c3
|
Add full API docs and improve the UX of navigating them (#17485)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
2025-05-03 19:42:43 -07:00 |
22quinn
|
d47b605eca
|
Update test requirements to CUDA 12.8 (#17576)
Signed-off-by: 22quinn <33176974+22quinn@users.noreply.github.com>
|
2025-05-02 21:40:15 -07:00 |
Liangfu Chen
|
22c6f6397f
|
[Neuron][Build] Require setuptools >= 77.0.3 for PEP 639 (#17603)
Signed-off-by: Liangfu Chen <liangfc@amazon.com>
|
2025-05-03 02:41:59 +00:00 |
Yang Wang
|
b8b0859b5c
|
add more pytorch related tests for torch nightly (#17422)
Signed-off-by: Yang Wang <elainewy@meta.com>
|
2025-05-02 03:29:59 -07:00 |
Cyrus Leung
|
f2e7af9b86
|
[CI/Build] Remove `awscli` dependency (#17532)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2025-05-01 09:20:54 -07:00 |
Russell Bryant
|
7a0a146c54
|
[Build] Require setuptools >= 77.0.3 for PEP 639 (#17389)
Signed-off-by: Russell Bryant <rbryant@redhat.com>
|
2025-04-30 23:25:36 -07:00 |
Rahul Tuli
|
200bbf92e8
|
Bump Compressed Tensors version to 0.9.4 (#17478)
Signed-off-by: Rahul Tuli <rtuli@redhat.com>
Co-authored-by: mgoin <mgoin64@gmail.com>
|
2025-04-30 15:24:45 -07:00 |
Gregory Shtrasberg
|
584f5fb4c6
|
[Bugfix][ROCm] Restrict ray version due to a breaking release (#17480)
Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
|
2025-04-30 09:59:06 -07:00 |
Kunshang Ji
|
ed6cfb90c8
|
[Hardware][Intel GPU] Upgrade to torch 2.7 (#17444)
Signed-off-by: Kunshang Ji <kunshang.ji@intel.com>
Co-authored-by: Qiming Zhang <qiming1.zhang@intel.com>
|
2025-04-30 00:03:58 -07:00 |
Kunshang Ji
|
6ed9f6047e
|
[Intel GPU] [CI]Fix XPU ci, setuptools >=80.0 have build issue (#17298)
Signed-off-by: Kunshang Ji <kunshang.ji@intel.com>
|
2025-04-29 22:54:10 -07:00 |
Huy Do
|
2c4f59afc3
|
Update PyTorch to 2.7.0 (#16859)
|
2025-04-29 19:08:04 -07:00 |
Aaron Pham
|
b37685afbb
|
[CI] Uses Python 3.11 for TPU (#17359)
Signed-off-by: Aaron Pham <contact@aarnphm.xyz>
|
2025-04-29 17:39:16 +00:00 |
Harry Mellor
|
4a5e13149a
|
Update docs requirements (#17379)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
2025-04-29 11:35:47 +00:00 |
Gregory Shtrasberg
|
4464109219
|
[Build][Bugfix] Restrict setuptools version to <80 (#17320)
Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
|
2025-04-29 00:17:23 -07:00 |
Agata Dobrzyniewicz
|
c48334d405
|
[Hardware][Intel-Gaudi] Update hpu-extension and update bucketing system for HPU device (#17186)
Signed-off-by: Agata Dobrzyniewicz <adobrzyniewicz@habana.ai>
|
2025-04-26 05:55:14 -07:00 |
Cyrus Leung
|
9d98ab5ec6
|
[Misc] Inline Molmo requirements (#17190)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
2025-04-25 16:41:44 +00:00 |
Harry Mellor
|
0bd7f8fca5
|
Bump Transformers to 4.51.3 (#17116)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
2025-04-25 08:34:34 -07:00 |
Harry Mellor
|
0422ce109f
|
Add `:markdownhelp:` to `EngineArgs` docs so markdown docstrings render properly (#17124)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
2025-04-24 10:28:45 -07:00 |