SwiftCode

Code Reproduction

Posted by Treaseven on January 10, 2026

TuneContext

FittingGenerator → SpaceGeneratorFittingGenerator EvolutionarySearch → SearchStrategyEvolutionarySearch TuneContext → TuneContext → TuneContextInitialize

JSONDatabase → DatabaseJSONDatabase XGBModel → FeatureExtractor → PerStoreFeature → FeatureExtractorPerStoreFeauture MeasureCallback → MeasureCallbackDefault TaskScheduler → GradientBased → TaskSchedulerGradientBased

/src/tir/schedule/trace.cc
Array<ObjectRef> TranslateInputRVs(
    const Array<ObjectRef>& inputs,
    const std::unordered_map<const Object*, const Object*>& rv_map)
将调度指令的输入对象根据替换映射进行转换

ssh -p 25378 root@connect.bjb1.seetacloud.com 1.43512 1.36580

G1 rtx_3080 rtx4090 heron 0.03438 ms 5316s 0.015757 ms 3641 pibot 0.03384 ms 2235s 0.013777 ms 782 pytorch 0.510898 ms 1.953404 ms G2 rtx_3080 rtx4090
heron 0.0014123 ms 2667 0.537718 ms 5100 pibot 0.0014334 ms 701 0.543329 ms 3565 pytorch 4.481762 ms 0.837625 ms G3 rtx_3080 rtx4090 heron 0.010030 ms 1217 0.008226 ms 1236 pibot 0.009695 ms 1497 0.008218 ms 1026 pytorch 0.031238 ms 0.094816 ms

B1 rtx4090 rtx3080 heron 0.005173 ms 943 0.01365 ms 2699 pibot 0.005134 ms 881 0.01367 ms 1989 pytorch 0.042630 ms 0.083946 ms B2 rtx4090 heron 0.005090 ms 2962 0.01402 ms 3388 pibo 0.005088 ms 1168 0.01365 ms 747 pytorch 0.024005 ms 0.050248 ms B7 rtx4090 heron 0.053552 ms 5512 0.15041 ms 4917 pibo 0.053502 ms 3701 0.12068 ms 1323 pytorch 0.191892 ms 0.093483 ms B8 rtx4090 heron 0.13958 ms 587
pibo 0.13949 ms 639 pytorch 0.203576 ms 0.159782 ms
B9 rtx4090 heron 0.13874 ms 1356 0.18474 ms 991 pibo 0.13735 ms 936 0.17858 ms 814 pytorch 0.19095 ms 0.166690 ms

C2 rtx3080 heron 0.040300 ms 341 0.011743 ms 5844 pibot 0.040653 ms 255 0.011532 ms 2727 pytorch 0.474093 ms 1.299625 ms C3 rtx3080
heron 0.073979 ms 3845 0.036221 ms 2180
pibot 0.070862 ms 1400 0.039619 ms 2748 pytorch 0.120934 ms 0.121908 ms C4
heron 0.013020 ms 735 0.007596 ms 4349 pibot 0.013155 ms 760 0.007115 ms 1260 pytorch 0.054416 ms 0.090722 ms C5
heron 0.085828 ms 897 0.043186 ms 4888 pibot 0.088222 ms 1101 0.041430 ms 2576 pytorch 0.119524 ms 0.086970 ms C7 heron 0.468694 ms 3939 0.336159 ms 1084 pibot 0.469949 ms 2121 0.336310 ms 580 pytorch 0.246989 ms 0.115868 ms

C8 heron 1.62734 ms 6345 0.765019 ms 14926 pibot 1.61799 ms 12566 0.722327 ms 10866 pytorch 1.438680 ms 0.760748 ms

C10
heron 1.273648 ms 5523 4.878418 ms 14446 pibot 1.232779 ms 4146 4.901200 ms 8578 pytorch 1.568065 ms 0.720168 ms

C11 heron 0.241642 ms 6217 pibot 0.202731 ms 3768 pytorch 0.930382 ms 0.441475 ms

1.309535 ms

G4 rtx_3080 rtx4090 heron 0.049233 ms 224 0.019519 ms 1912
pibot 0.049520 ms 244 0.019339 ms 1016 pytorch 0.104069 ms G5 rtx_3080
heron 0.020117 ms 2255 0.013658 ms 926 pibot 0.019932 ms 1791 0.013728 ms 2193 pytorch 0.024846 ms

B3 rtx4090 heron 0.007050 ms 4019 0.01710 ms 664 pibo 0.006269 ms 1381 0.01710 ms 686 pytorch 0.02924 ms B4 rtx4090 heron 0.005636 ms 1574 0.01729 ms 747 pibo 0.005790 ms 3248 0.01748 ms 932 pytorch 0.02714 ms

C9
heron
pibot
pytorch 0.804250 ms 0.355089 ms

N H W CI CO KH KW stride padding dilation C7 16,224,224,3,64,3,3,1,1,1 C8 16,224,224,64,64,3,3,1,1,1 C10 16,112,112,128,128,3,3,1,1,1 C2 16,56,56,64,64,1,1,1,0,1 C3 16,28,28,128,128,3,3,1,1,1 C4 16,28,28,128,256,1,1,2,0,1

AMOS

_, C, H, W, K, _, R, S, _, stride, padding, dilation, _ 1, 3, 224, 224, 63, 1, 3, 3, 1, 1, 1, 1, 1 1, 64, 224, 224, 64, 1, 3, 3, 1, 1, 1, 1, 1 1, 128, 112, 112, 128, 1, 3, 3, 1, 1, 1, 1, 1 1, 64, 56, 56, 64, 1, 1, 1, 1, 1, 0, 1, 1 1, 128, 28, 28, 128, 1, 3, 3, 1, 1, 1, 1, 1 1, 128, 28, 28, 256, 1, 1, 1, 1, 2, 0, 1, 1

C2D AMOS Batch= 16 0.0823072016615084 0.2856333884812913 0.022220556887290806 1.5902389659442726 9.67448223 5.39098462

B1 12,512,64,512 B2 12,512,512,64 B7 16,512,768,768 B8 192,512,64,512 B9 192,512,512,64

BMM AMOS 0.07377967765776008 0.07971049660219552 0.20670990629139072 0.3435732154867257 0.6294092247992863

G1 1024,1024,1024 G2 4096,4096,4096 G3 32,2048,1000

GEMM AMOS Cost of gemm-float16-float16-layer-(32, 2048, 1000) is 0.037194 ms 0.07731810187079802 1.9382828080495358 0.03719387727609179

16, 224, 224, 3 , 64 , 3, 3, 1, 1, 1 16, 224, 224, 64 , 64 , 3, 3, 1, 1, 1 16, 112, 112, 64 , 128 , 3, 3, 1, 1, 1 16, 112, 112, 128, 128 , 3, 3, 1, 1, 1 16, 56, 56, 128 , 256 , 3, 3, 1, 1, 1 16, 56, 56, 256 , 256 , 3, 3, 1, 1, 1 16, 28, 28, 256 , 512 , 3, 3, 1, 1, 1 16, 28, 28, 512 , 512 , 3, 3, 1, 1, 1 16, 14, 14, 512 , 512 , 3, 3, 1, 1, 1 16, 7, 7, 512 , 512 , 3, 3, 1, 1, 1

\[\text{EI}(\mathbf{x}) = (\mu(\mathbf{x}) - f^* - \xi) \cdot \Phi(z) + \sigma(\mathbf{x}) \cdot \phi(z)\]