From: kernel test robot <oliver.sang@intel.com>
To: Eric Dumazet <edumazet@google.com>
Cc: <oe-lkp@lists.linux.dev>, <lkp@intel.com>,
Linux Memory Management List <linux-mm@kvack.org>,
Jakub Kicinski <kuba@kernel.org>,
Antoine Tenart <atenart@kernel.org>, <netdev@vger.kernel.org>,
<ying.huang@intel.com>, <feng.tang@intel.com>,
<fengwei.yin@intel.com>, <oliver.sang@intel.com>
Subject: [linux-next:master] [net] fd4f101edb: stress-ng.msg.ops_per_sec 29.2% improvement
Date: Thu, 22 Feb 2024 10:00:26 +0800 [thread overview]
Message-ID: <202402220952.7141823b-oliver.sang@intel.com> (raw)
Hello,
kernel test robot noticed a 29.2% improvement of stress-ng.msg.ops_per_sec on:
commit: fd4f101edbd9f99567ab2adb1f2169579ede7c13 ("net: add exit_batch_rtnl() method")
https://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git master
testcase: stress-ng
test machine: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory
parameters:
nr_threads: 100%
testtime: 60s
test: msg
cpufreq_governor: performance
Details are as below:
-------------------------------------------------------------------------------------------------->
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20240222/202402220952.7141823b-oliver.sang@intel.com
=========================================================================================
compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime:
gcc-12/performance/x86_64-rhel-8.3/100%/debian-11.1-x86_64-20220510.cgz/lkp-icl-2sp7/msg/stress-ng/60s
commit:
a1e55f5103 ("Merge branch 'mt7530-dsa-subdriver-improvements-act-ii'")
fd4f101edb ("net: add exit_batch_rtnl() method")
a1e55f51035e6aa6 fd4f101edbd9f99567ab2adb1f2
---------------- ---------------------------
%stddev %change %stddev
\ | \
8563778 -13.4% 7417130 ± 2% cpuidle..usage
5.49 +1.7 7.21 mpstat.cpu.all.usr%
120292 -10.2% 107984 sched_debug.cpu.nr_switches.avg
240634 -10.2% 216144 vmstat.system.cs
98456 ± 2% +5.6% 103994 vmstat.system.in
0.02 ± 2% +13.2% 0.02 ± 2% perf-sched.sch_delay.avg.ms.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.msgctl_down
0.34 ± 39% +930.0% 3.47 ±140% perf-sched.sch_delay.max.ms.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
188902 -11.5% 167109 ± 2% perf-sched.wait_and_delay.count.schedule_preempt_disabled.rwsem_down_read_slowpath.down_read.msgctl_info.constprop
681475 ± 5% +42.1% 968696 ± 4% numa-numastat.node0.local_node
717355 ± 3% +41.1% 1012165 ± 4% numa-numastat.node0.numa_hit
734494 ± 4% +37.8% 1012229 ± 7% numa-numastat.node1.local_node
764874 ± 2% +35.3% 1035011 ± 7% numa-numastat.node1.numa_hit
717320 ± 3% +41.1% 1012083 ± 4% numa-vmstat.node0.numa_hit
681441 ± 5% +42.1% 968614 ± 4% numa-vmstat.node0.numa_local
765007 ± 2% +35.3% 1035046 ± 7% numa-vmstat.node1.numa_hit
734627 ± 4% +37.8% 1012264 ± 7% numa-vmstat.node1.numa_local
49.33 ± 16% -28.0% 35.50 ± 7% perf-c2c.DRAM.local
10066 ± 5% +21.5% 12228 ± 2% perf-c2c.DRAM.remote
19015 ± 5% +14.0% 21668 ± 3% perf-c2c.HITM.local
7769 ± 5% +24.4% 9664 ± 2% perf-c2c.HITM.remote
26784 ± 5% +17.0% 31333 ± 2% perf-c2c.HITM.total
38307 +2.5% 39280 proc-vmstat.nr_slab_unreclaimable
1483925 +38.0% 2048531 ± 4% proc-vmstat.numa_hit
1417665 +39.8% 1982280 ± 5% proc-vmstat.numa_local
1528928 +37.1% 2096049 ± 4% proc-vmstat.pgalloc_normal
1319628 +43.1% 1888225 ± 4% proc-vmstat.pgfree
7.289e+08 +29.2% 9.419e+08 stress-ng.msg.ops
12147934 +29.2% 15698485 stress-ng.msg.ops_per_sec
6837 ± 4% +28.9% 8816 ± 7% stress-ng.time.involuntary_context_switches
3798 +5.5% 4007 stress-ng.time.percent_of_cpu_this_job_got
2105 +2.9% 2166 stress-ng.time.system_time
179.40 +36.0% 243.99 stress-ng.time.user_time
7981626 -9.8% 7198840 stress-ng.time.voluntary_context_switches
0.03 ± 11% +0.0 0.05 ± 14% turbostat.C1%
8400990 -13.6% 7255025 ± 2% turbostat.C1E
35.76 -3.3 32.44 ± 2% turbostat.C1E%
0.10 +20.0% 0.12 turbostat.IPC
7496 -14.0% 6449 ± 2% turbostat.POLL
209.64 +3.0% 215.93 turbostat.PkgWatt
60.82 +2.4% 62.30 turbostat.RAMWatt
1.42 +6.5% 1.51 ± 2% perf-stat.i.MPKI
6.984e+09 +25.4% 8.757e+09 perf-stat.i.branch-instructions
33123635 ± 3% +19.3% 39505381 ± 3% perf-stat.i.branch-misses
25.73 +2.1 27.86 perf-stat.i.cache-miss-rate%
53181495 +33.9% 71212733 perf-stat.i.cache-misses
2.069e+08 +23.7% 2.559e+08 perf-stat.i.cache-references
253077 -10.6% 226239 perf-stat.i.context-switches
3.32 -16.5% 2.78 perf-stat.i.cpi
1.243e+11 +5.1% 1.307e+11 perf-stat.i.cpu-cycles
64512 -10.9% 57474 perf-stat.i.cpu-migrations
2344 -21.6% 1838 perf-stat.i.cycles-between-cache-misses
9.334e+09 +25.6% 1.172e+10 perf-stat.i.dTLB-loads
0.00 ± 3% -0.0 0.00 ± 4% perf-stat.i.dTLB-store-miss-rate%
5.75e+09 +26.7% 7.283e+09 perf-stat.i.dTLB-stores
3.748e+10 +25.7% 4.711e+10 perf-stat.i.instructions
0.30 +19.6% 0.36 perf-stat.i.ipc
1.94 +5.1% 2.04 perf-stat.i.metric.GHz
838.40 +33.4% 1118 perf-stat.i.metric.K/sec
347.81 +25.7% 437.34 perf-stat.i.metric.M/sec
34306768 +36.5% 46843727 perf-stat.i.node-load-misses
68.86 +1.2 70.02 perf-stat.i.node-store-miss-rate%
12534831 +32.0% 16551293 perf-stat.i.node-store-misses
5731642 +24.3% 7124000 perf-stat.i.node-stores
1.42 +6.6% 1.51 ± 2% perf-stat.overall.MPKI
25.67 +2.1 27.80 perf-stat.overall.cache-miss-rate%
3.32 -16.4% 2.78 perf-stat.overall.cpi
2338 -21.5% 1835 perf-stat.overall.cycles-between-cache-misses
0.00 ± 3% -0.0 0.00 ± 3% perf-stat.overall.dTLB-store-miss-rate%
0.30 +19.6% 0.36 perf-stat.overall.ipc
68.65 +1.3 69.93 perf-stat.overall.node-store-miss-rate%
6.862e+09 +25.4% 8.603e+09 perf-stat.ps.branch-instructions
32461333 ± 2% +19.3% 38712892 ± 3% perf-stat.ps.branch-misses
52246451 +33.9% 69968172 perf-stat.ps.cache-misses
2.035e+08 +23.7% 2.517e+08 perf-stat.ps.cache-references
248804 -10.6% 222334 perf-stat.ps.context-switches
1.222e+11 +5.1% 1.284e+11 perf-stat.ps.cpu-cycles
63409 -10.9% 56477 perf-stat.ps.cpu-migrations
9.174e+09 +25.5% 1.152e+10 perf-stat.ps.dTLB-loads
5.651e+09 +26.7% 7.158e+09 perf-stat.ps.dTLB-stores
3.683e+10 +25.7% 4.628e+10 perf-stat.ps.instructions
33708514 +36.6% 46037634 perf-stat.ps.node-load-misses
12318130 +32.1% 16266427 perf-stat.ps.node-store-misses
5625564 +24.3% 6992913 perf-stat.ps.node-stores
2.225e+12 +25.9% 2.801e+12 perf-stat.total.instructions
11.56 -10.6 0.92 ± 5% perf-profile.calltrace.cycles-pp.idr_find.ipc_obtain_object_check.do_msgrcv.do_syscall_64.entry_SYSCALL_64_after_hwframe
15.36 -9.1 6.26 perf-profile.calltrace.cycles-pp.ipc_obtain_object_check.do_msgrcv.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgrcv
10.06 -9.0 1.02 ± 10% perf-profile.calltrace.cycles-pp.idr_find.ipc_obtain_object_check.do_msgsnd.do_syscall_64.entry_SYSCALL_64_after_hwframe
14.53 -7.7 6.84 ± 5% perf-profile.calltrace.cycles-pp.ipc_obtain_object_check.do_msgsnd.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgsnd
7.23 -4.0 3.18 ± 56% perf-profile.calltrace.cycles-pp.msgctl.stress_msg
7.15 -4.0 3.12 ± 56% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.msgctl.stress_msg
7.16 -4.0 3.12 ± 56% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.msgctl.stress_msg
7.06 -4.0 3.06 ± 56% perf-profile.calltrace.cycles-pp.ksys_msgctl.do_syscall_64.entry_SYSCALL_64_after_hwframe.msgctl.stress_msg
4.18 -2.0 2.21 ± 9% perf-profile.calltrace.cycles-pp.msgctl_info.ksys_msgctl.do_syscall_64.entry_SYSCALL_64_after_hwframe.msgctl
1.52 -1.2 0.35 ± 70% perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.__percpu_counter_sum.msgctl_info.ksys_msgctl
1.66 -1.0 0.63 ± 11% perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__percpu_counter_sum.msgctl_info.ksys_msgctl.do_syscall_64
2.53 -1.0 1.54 ± 9% perf-profile.calltrace.cycles-pp.msgctl_down.ksys_msgctl.do_syscall_64.entry_SYSCALL_64_after_hwframe.msgctl
2.38 -0.9 1.50 ± 10% perf-profile.calltrace.cycles-pp.__percpu_counter_sum.msgctl_info.ksys_msgctl.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.38 ± 2% -0.8 0.60 ± 9% perf-profile.calltrace.cycles-pp.down_read.msgctl_info.ksys_msgctl.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.61 -0.7 0.90 ± 10% perf-profile.calltrace.cycles-pp.down_write.msgctl_down.ksys_msgctl.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.16 -0.7 0.49 ± 45% perf-profile.calltrace.cycles-pp.rwsem_down_read_slowpath.down_read.msgctl_info.ksys_msgctl.do_syscall_64
1.50 -0.6 0.86 ± 10% perf-profile.calltrace.cycles-pp.rwsem_down_write_slowpath.down_write.msgctl_down.ksys_msgctl.do_syscall_64
2.79 -0.5 2.27 ± 2% perf-profile.calltrace.cycles-pp.secondary_startup_64_no_verify
2.74 -0.5 2.23 ± 2% perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
2.74 -0.5 2.23 ± 3% perf-profile.calltrace.cycles-pp.start_secondary.secondary_startup_64_no_verify
2.74 -0.5 2.23 ± 2% perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
1.82 -0.4 1.47 ± 3% perf-profile.calltrace.cycles-pp.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
1.67 -0.3 1.34 ± 3% perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry
1.69 -0.3 1.36 ± 3% perf-profile.calltrace.cycles-pp.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary
1.28 -0.3 1.01 ± 3% perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
0.68 -0.3 0.41 ± 71% perf-profile.calltrace.cycles-pp.schedule.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.msgctl_down
0.68 -0.3 0.40 ± 71% perf-profile.calltrace.cycles-pp.__schedule.schedule.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write
0.68 -0.3 0.41 ± 71% perf-profile.calltrace.cycles-pp.schedule_preempt_disabled.rwsem_down_write_slowpath.down_write.msgctl_down.ksys_msgctl
0.73 +0.2 0.88 ± 9% perf-profile.calltrace.cycles-pp.__check_object_size.load_msg.do_msgsnd.do_syscall_64.entry_SYSCALL_64_after_hwframe
2.32 +0.2 2.54 ± 3% perf-profile.calltrace.cycles-pp.__memcg_slab_post_alloc_hook.__kmalloc.alloc_msg.load_msg.do_msgsnd
1.10 ± 8% +0.4 1.48 ± 5% perf-profile.calltrace.cycles-pp.wake_up_q.do_msgrcv.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgrcv
0.79 ± 2% +0.5 1.30 ± 8% perf-profile.calltrace.cycles-pp.___slab_alloc.__kmalloc.alloc_msg.load_msg.do_msgsnd
0.09 ±223% +0.5 0.60 ± 4% perf-profile.calltrace.cycles-pp.__put_user_8.do_msg_fill.do_msgrcv.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.08 ±223% +0.5 0.60 ± 3% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.__libc_msgrcv
1.94 ± 10% +0.5 2.48 ± 7% perf-profile.calltrace.cycles-pp.check_heap_object.__check_object_size.store_msg.do_msg_fill.do_msgrcv
3.22 +0.7 3.93 ± 5% perf-profile.calltrace.cycles-pp.percpu_counter_add_batch.do_msgsnd.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgsnd
2.14 ± 6% +0.7 2.86 ± 5% perf-profile.calltrace.cycles-pp.stress_msg_receiver
3.09 ± 4% +0.7 3.82 ± 3% perf-profile.calltrace.cycles-pp.__slab_free.kfree.free_msg.do_msgrcv.do_syscall_64
3.06 ± 5% +0.8 3.87 ± 2% perf-profile.calltrace.cycles-pp.__check_object_size.store_msg.do_msg_fill.do_msgrcv.do_syscall_64
0.78 +0.8 1.62 ± 54% perf-profile.calltrace.cycles-pp._copy_from_user.load_msg.do_msgsnd.do_syscall_64.entry_SYSCALL_64_after_hwframe
2.96 ± 5% +0.8 3.81 ± 2% perf-profile.calltrace.cycles-pp.__memcg_slab_free_hook.kfree.free_msg.do_msgrcv.do_syscall_64
3.11 ± 5% +0.9 4.04 perf-profile.calltrace.cycles-pp.percpu_counter_add_batch.do_msgrcv.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgrcv
3.88 ± 5% +1.0 4.87 perf-profile.calltrace.cycles-pp._copy_to_user.store_msg.do_msg_fill.do_msgrcv.do_syscall_64
4.68 +1.3 5.95 perf-profile.calltrace.cycles-pp.__kmalloc.alloc_msg.load_msg.do_msgsnd.do_syscall_64
5.33 +1.4 6.77 perf-profile.calltrace.cycles-pp.alloc_msg.load_msg.do_msgsnd.do_syscall_64.entry_SYSCALL_64_after_hwframe
6.67 +1.7 8.33 ± 2% perf-profile.calltrace.cycles-pp.kfree.free_msg.do_msgrcv.do_syscall_64.entry_SYSCALL_64_after_hwframe
7.41 +1.7 9.08 perf-profile.calltrace.cycles-pp.store_msg.do_msg_fill.do_msgrcv.do_syscall_64.entry_SYSCALL_64_after_hwframe
7.27 +1.8 9.10 ± 2% perf-profile.calltrace.cycles-pp.free_msg.do_msgrcv.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgrcv
8.02 +1.8 9.86 perf-profile.calltrace.cycles-pp.do_msg_fill.do_msgrcv.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgrcv
4.66 +1.9 6.59 perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock.do_msgsnd.do_syscall_64.entry_SYSCALL_64_after_hwframe
2.38 ± 9% +1.9 4.31 ± 3% perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock.do_msgrcv.do_syscall_64.entry_SYSCALL_64_after_hwframe
3.24 ± 5% +2.0 5.22 ± 2% perf-profile.calltrace.cycles-pp._raw_spin_lock.do_msgrcv.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgrcv
5.57 +2.2 7.77 ± 2% perf-profile.calltrace.cycles-pp._raw_spin_lock.do_msgsnd.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgsnd
7.03 +2.5 9.56 ± 10% perf-profile.calltrace.cycles-pp.load_msg.do_msgsnd.do_syscall_64.entry_SYSCALL_64_after_hwframe.__libc_msgsnd
21.82 -19.6 2.26 ± 3% perf-profile.children.cycles-pp.idr_find
30.06 -16.9 13.21 ± 3% perf-profile.children.cycles-pp.ipc_obtain_object_check
7.28 -2.8 4.44 ± 2% perf-profile.children.cycles-pp.msgctl
7.07 -2.8 4.24 ± 2% perf-profile.children.cycles-pp.ksys_msgctl
87.96 -2.5 85.49 perf-profile.children.cycles-pp.do_syscall_64
88.53 -2.2 86.32 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
4.19 -1.9 2.33 ± 2% perf-profile.children.cycles-pp.msgctl_info
2.19 -1.3 0.87 ± 4% perf-profile.children.cycles-pp._raw_spin_lock_irqsave
2.53 -0.9 1.62 ± 3% perf-profile.children.cycles-pp.msgctl_down
2.39 -0.8 1.59 ± 3% perf-profile.children.cycles-pp.__percpu_counter_sum
43.44 -0.8 42.65 perf-profile.children.cycles-pp.do_msgrcv
1.47 ± 2% -0.7 0.73 ± 4% perf-profile.children.cycles-pp.down_read
1.61 -0.7 0.94 ± 5% perf-profile.children.cycles-pp.down_write
1.50 -0.6 0.90 ± 5% perf-profile.children.cycles-pp.rwsem_down_write_slowpath
0.67 -0.5 0.12 ± 7% perf-profile.children.cycles-pp._raw_spin_lock_irq
1.23 -0.5 0.70 ± 5% perf-profile.children.cycles-pp.rwsem_down_read_slowpath
2.79 -0.5 2.26 ± 2% perf-profile.children.cycles-pp.do_idle
2.79 -0.5 2.27 ± 2% perf-profile.children.cycles-pp.cpu_startup_entry
2.79 -0.5 2.27 ± 2% perf-profile.children.cycles-pp.secondary_startup_64_no_verify
2.74 -0.5 2.23 ± 3% perf-profile.children.cycles-pp.start_secondary
0.88 -0.4 0.51 perf-profile.children.cycles-pp.rwsem_wake
1.85 -0.4 1.50 ± 3% perf-profile.children.cycles-pp.cpuidle_idle_call
1.72 -0.3 1.38 ± 3% perf-profile.children.cycles-pp.cpuidle_enter
1.71 -0.3 1.38 ± 3% perf-profile.children.cycles-pp.cpuidle_enter_state
0.42 ± 2% -0.3 0.09 ± 4% perf-profile.children.cycles-pp.up_read
45.44 -0.3 45.12 perf-profile.children.cycles-pp.__libc_msgrcv
1.30 -0.3 1.03 ± 4% perf-profile.children.cycles-pp.intel_idle
0.89 -0.3 0.63 perf-profile.children.cycles-pp.try_to_wake_up
0.69 -0.2 0.46 perf-profile.children.cycles-pp.up_write
2.50 -0.2 2.28 ± 6% perf-profile.children.cycles-pp.__schedule
0.42 -0.2 0.20 ± 4% perf-profile.children.cycles-pp.rwsem_optimistic_spin
2.23 -0.2 2.05 ± 7% perf-profile.children.cycles-pp.schedule
1.34 ± 2% -0.2 1.16 ± 7% perf-profile.children.cycles-pp.schedule_preempt_disabled
0.58 -0.1 0.48 ± 3% perf-profile.children.cycles-pp.flush_smp_call_function_queue
0.57 -0.1 0.48 ± 3% perf-profile.children.cycles-pp.__flush_smp_call_function_queue
0.26 ± 2% -0.1 0.17 ± 4% perf-profile.children.cycles-pp.msgctl_stat
0.14 ± 2% -0.1 0.06 ± 6% perf-profile.children.cycles-pp.rwsem_spin_on_owner
0.15 ± 5% -0.1 0.07 ± 5% perf-profile.children.cycles-pp.osq_lock
0.48 -0.1 0.41 ± 3% perf-profile.children.cycles-pp.sched_ttwu_pending
0.44 -0.1 0.37 ± 3% perf-profile.children.cycles-pp.activate_task
0.42 -0.1 0.36 ± 4% perf-profile.children.cycles-pp.ttwu_do_activate
0.36 -0.1 0.31 ± 3% perf-profile.children.cycles-pp.enqueue_task_fair
0.32 ± 2% -0.0 0.27 ± 2% perf-profile.children.cycles-pp.select_task_rq
0.29 ± 2% -0.0 0.24 ± 3% perf-profile.children.cycles-pp.schedule_idle
0.31 ± 3% -0.0 0.26 ± 2% perf-profile.children.cycles-pp.select_task_rq_fair
0.28 -0.0 0.24 ± 5% perf-profile.children.cycles-pp.enqueue_entity
0.37 ± 2% -0.0 0.32 ± 2% perf-profile.children.cycles-pp.dequeue_task_fair
0.30 ± 3% -0.0 0.26 ± 3% perf-profile.children.cycles-pp.dequeue_entity
0.41 -0.0 0.38 ± 2% perf-profile.children.cycles-pp.update_load_avg
0.13 ± 5% -0.0 0.10 ± 4% perf-profile.children.cycles-pp.select_idle_cpu
0.16 ± 3% -0.0 0.13 ± 4% perf-profile.children.cycles-pp.select_idle_sibling
0.14 -0.0 0.12 ± 4% perf-profile.children.cycles-pp.available_idle_cpu
0.14 ± 3% -0.0 0.12 ± 8% perf-profile.children.cycles-pp.__smp_call_single_queue
0.09 ± 4% -0.0 0.07 ± 5% perf-profile.children.cycles-pp.switch_fpu_return
0.14 ± 3% -0.0 0.12 ± 4% perf-profile.children.cycles-pp.wake_affine
0.08 ± 5% -0.0 0.07 ± 7% perf-profile.children.cycles-pp.select_idle_core
0.10 ± 5% -0.0 0.08 ± 5% perf-profile.children.cycles-pp.task_h_load
0.08 -0.0 0.07 ± 7% perf-profile.children.cycles-pp.switch_mm_irqs_off
0.07 ± 7% -0.0 0.05 ± 8% perf-profile.children.cycles-pp.sched_mm_cid_migrate_to
0.10 ± 4% -0.0 0.09 perf-profile.children.cycles-pp.update_curr
0.08 ± 6% +0.0 0.09 ± 4% perf-profile.children.cycles-pp.amd_clear_divider
0.08 ± 7% +0.0 0.10 ± 5% perf-profile.children.cycles-pp.security_msg_queue_msgrcv
0.06 +0.0 0.08 ± 7% perf-profile.children.cycles-pp.security_msg_queue_msgsnd
0.11 ± 3% +0.0 0.13 ± 3% perf-profile.children.cycles-pp.format_decode
0.15 ± 2% +0.0 0.18 ± 2% perf-profile.children.cycles-pp.check_stack_object
0.06 ± 7% +0.0 0.09 ± 7% perf-profile.children.cycles-pp.allocate_slab
0.10 ± 4% +0.0 0.13 ± 3% perf-profile.children.cycles-pp.number
0.04 ± 44% +0.0 0.07 ± 9% perf-profile.children.cycles-pp.shuffle_freelist
0.11 ± 6% +0.0 0.14 ± 3% perf-profile.children.cycles-pp.__cond_resched
0.10 ± 4% +0.0 0.14 ± 3% perf-profile.children.cycles-pp.is_vmalloc_addr
0.18 ± 4% +0.0 0.22 ± 4% perf-profile.children.cycles-pp.task_tick_fair
0.14 ± 5% +0.0 0.18 ± 5% perf-profile.children.cycles-pp.security_ipc_permission
0.18 ± 2% +0.1 0.23 ± 4% perf-profile.children.cycles-pp.refill_obj_stock
0.00 +0.1 0.05 ± 7% perf-profile.children.cycles-pp.sysvec_call_function_single
0.21 ± 8% +0.1 0.26 ± 3% perf-profile.children.cycles-pp.syscall_exit_to_user_mode_prepare
0.00 +0.1 0.06 ± 13% perf-profile.children.cycles-pp.get_partial_node
0.68 +0.1 0.76 perf-profile.children.cycles-pp.seq_read
0.68 +0.1 0.76 perf-profile.children.cycles-pp.seq_read_iter
0.69 +0.1 0.76 perf-profile.children.cycles-pp.ksys_read
0.68 +0.1 0.76 perf-profile.children.cycles-pp.vfs_read
0.70 +0.1 0.77 perf-profile.children.cycles-pp.read
0.31 ± 2% +0.1 0.40 perf-profile.children.cycles-pp.vsnprintf
0.33 ± 2% +0.1 0.42 ± 2% perf-profile.children.cycles-pp.sysvipc_msg_proc_show
0.31 ± 2% +0.1 0.40 ± 2% perf-profile.children.cycles-pp.seq_printf
0.36 ± 2% +0.1 0.45 ± 2% perf-profile.children.cycles-pp.ipcperms
0.40 ± 4% +0.1 0.49 ± 5% perf-profile.children.cycles-pp.__check_heap_object
0.38 ± 3% +0.1 0.49 ± 3% perf-profile.children.cycles-pp.__virt_addr_valid
0.41 ± 2% +0.1 0.52 ± 2% perf-profile.children.cycles-pp.syscall_return_via_sysret
0.67 ± 2% +0.1 0.80 perf-profile.children.cycles-pp.syscall_exit_to_user_mode
0.45 +0.1 0.58 ± 2% perf-profile.children.cycles-pp.__get_user_8
0.36 +0.1 0.49 ± 3% perf-profile.children.cycles-pp.ss_wakeup
0.61 +0.1 0.75 ± 4% perf-profile.children.cycles-pp.obj_cgroup_charge
0.52 +0.1 0.67 ± 2% perf-profile.children.cycles-pp.security_msg_msg_free
0.51 +0.1 0.66 perf-profile.children.cycles-pp.__x64_sys_msgsnd
0.45 ± 2% +0.1 0.60 ± 2% perf-profile.children.cycles-pp.entry_SYSCALL_64_safe_stack
0.65 +0.2 0.82 perf-profile.children.cycles-pp.__put_user_8
0.77 ± 2% +0.2 0.98 perf-profile.children.cycles-pp.mod_objcg_state
2.40 +0.2 2.61 perf-profile.children.cycles-pp.wake_up_q
0.92 +0.2 1.14 ± 2% perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
1.24 +0.3 1.56 perf-profile.children.cycles-pp.entry_SYSCALL_64
2.35 +0.3 2.70 ± 4% perf-profile.children.cycles-pp.__memcg_slab_post_alloc_hook
2.68 +0.6 3.25 ± 3% perf-profile.children.cycles-pp.check_heap_object
0.79 ± 2% +0.6 1.38 perf-profile.children.cycles-pp.___slab_alloc
3.18 +0.7 3.84 ± 3% perf-profile.children.cycles-pp.__slab_free
34.94 +0.7 35.63 perf-profile.children.cycles-pp.do_msgsnd
3.07 +0.8 3.87 perf-profile.children.cycles-pp.__memcg_slab_free_hook
0.85 +0.9 1.75 ± 54% perf-profile.children.cycles-pp._copy_from_user
2.68 +0.9 3.61 perf-profile.children.cycles-pp.stress_msg_receiver
4.08 +0.9 5.02 perf-profile.children.cycles-pp._copy_to_user
4.13 +0.9 5.08 perf-profile.children.cycles-pp.__check_object_size
2.74 +1.0 3.73 perf-profile.children.cycles-pp.stress_msg_sender
5.13 +1.4 6.50 perf-profile.children.cycles-pp.__kmalloc
5.35 +1.4 6.80 perf-profile.children.cycles-pp.alloc_msg
6.45 +1.7 8.11 perf-profile.children.cycles-pp.percpu_counter_add_batch
7.46 +1.7 9.13 perf-profile.children.cycles-pp.store_msg
38.25 +1.7 39.94 perf-profile.children.cycles-pp.__libc_msgsnd
7.15 +1.8 8.91 ± 2% perf-profile.children.cycles-pp.kfree
7.37 +1.9 9.25 ± 2% perf-profile.children.cycles-pp.free_msg
8.28 +1.9 10.18 perf-profile.children.cycles-pp.do_msg_fill
9.52 +2.1 11.65 ± 2% perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
7.23 +2.6 9.84 ± 9% perf-profile.children.cycles-pp.load_msg
9.51 +4.1 13.62 ± 2% perf-profile.children.cycles-pp._raw_spin_lock
21.66 -19.4 2.21 ± 3% perf-profile.self.cycles-pp.idr_find
1.30 -0.3 1.03 ± 4% perf-profile.self.cycles-pp.intel_idle
0.50 -0.2 0.29 ± 4% perf-profile.self.cycles-pp._raw_spin_lock_irqsave
0.16 ± 3% -0.1 0.04 ± 45% perf-profile.self.cycles-pp._raw_spin_lock_irq
0.17 ± 4% -0.1 0.07 perf-profile.self.cycles-pp.rwsem_down_read_slowpath
0.15 ± 5% -0.1 0.07 ± 5% perf-profile.self.cycles-pp.osq_lock
0.06 ± 7% -0.0 0.02 ± 99% perf-profile.self.cycles-pp.rwsem_mark_wake
0.08 ± 6% -0.0 0.04 ± 44% perf-profile.self.cycles-pp.rwsem_down_write_slowpath
0.12 ± 4% -0.0 0.09 ± 5% perf-profile.self.cycles-pp.rwsem_optimistic_spin
0.14 ± 3% -0.0 0.12 ± 4% perf-profile.self.cycles-pp.available_idle_cpu
0.14 ± 3% -0.0 0.12 ± 4% perf-profile.self.cycles-pp.__schedule
0.10 ± 5% -0.0 0.08 ± 5% perf-profile.self.cycles-pp.task_h_load
0.08 -0.0 0.07 ± 7% perf-profile.self.cycles-pp.switch_mm_irqs_off
0.07 ± 7% -0.0 0.05 ± 8% perf-profile.self.cycles-pp.sched_mm_cid_migrate_to
0.06 ± 7% -0.0 0.05 ± 8% perf-profile.self.cycles-pp.enqueue_entity
0.06 ± 7% +0.0 0.08 ± 7% perf-profile.self.cycles-pp.security_msg_queue_msgrcv
0.07 ± 5% +0.0 0.09 ± 4% perf-profile.self.cycles-pp.__cond_resched
0.07 ± 6% +0.0 0.09 ± 5% perf-profile.self.cycles-pp.__x64_sys_msgsnd
0.08 ± 5% +0.0 0.10 ± 4% perf-profile.self.cycles-pp.format_decode
0.08 +0.0 0.10 ± 5% perf-profile.self.cycles-pp.is_vmalloc_addr
0.06 +0.0 0.08 ± 10% perf-profile.self.cycles-pp.security_msg_msg_free
0.10 ± 3% +0.0 0.12 ± 3% perf-profile.self.cycles-pp.alloc_msg
0.04 ± 44% +0.0 0.06 ± 7% perf-profile.self.cycles-pp.security_msg_queue_msgsnd
0.10 ± 5% +0.0 0.12 ± 3% perf-profile.self.cycles-pp.number
0.12 ± 3% +0.0 0.15 ± 3% perf-profile.self.cycles-pp.check_stack_object
0.08 ± 5% +0.0 0.11 ± 3% perf-profile.self.cycles-pp.vsnprintf
0.13 ± 3% +0.0 0.16 ± 3% perf-profile.self.cycles-pp.store_msg
0.12 ± 4% +0.0 0.15 ± 5% perf-profile.self.cycles-pp.security_ipc_permission
0.18 ± 9% +0.0 0.23 ± 2% perf-profile.self.cycles-pp.syscall_exit_to_user_mode_prepare
0.17 ± 4% +0.0 0.21 ± 4% perf-profile.self.cycles-pp.refill_obj_stock
0.21 +0.1 0.26 ± 3% perf-profile.self.cycles-pp.obj_cgroup_charge
0.32 ± 2% +0.1 0.40 perf-profile.self.cycles-pp.entry_SYSCALL_64
0.33 ± 2% +0.1 0.41 ± 2% perf-profile.self.cycles-pp.ipcperms
0.28 ± 4% +0.1 0.36 ± 2% perf-profile.self.cycles-pp.syscall_exit_to_user_mode
0.37 ± 3% +0.1 0.46 ± 5% perf-profile.self.cycles-pp.__check_heap_object
0.28 ± 2% +0.1 0.38 ± 3% perf-profile.self.cycles-pp.load_msg
0.35 ± 2% +0.1 0.45 ± 4% perf-profile.self.cycles-pp.__virt_addr_valid
0.36 ± 3% +0.1 0.47 ± 3% perf-profile.self.cycles-pp.syscall_return_via_sysret
0.41 +0.1 0.52 perf-profile.self.cycles-pp.do_msg_fill
0.42 +0.1 0.53 ± 2% perf-profile.self.cycles-pp.__put_user_8
0.58 ± 3% +0.1 0.69 perf-profile.self.cycles-pp.do_syscall_64
0.16 ± 2% +0.1 0.28 ± 6% perf-profile.self.cycles-pp.free_msg
0.34 +0.1 0.46 ± 3% perf-profile.self.cycles-pp.ss_wakeup
0.42 +0.1 0.54 ± 2% perf-profile.self.cycles-pp.__get_user_8
0.51 +0.1 0.64 ± 2% perf-profile.self.cycles-pp.__libc_msgrcv
0.51 ± 2% +0.1 0.64 perf-profile.self.cycles-pp.__libc_msgsnd
0.45 ± 2% +0.2 0.60 ± 2% perf-profile.self.cycles-pp.entry_SYSCALL_64_safe_stack
0.67 +0.2 0.87 ± 3% perf-profile.self.cycles-pp.__percpu_counter_sum
0.72 ± 2% +0.2 0.92 perf-profile.self.cycles-pp.mod_objcg_state
0.89 +0.2 1.11 perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
0.96 +0.3 1.21 perf-profile.self.cycles-pp.__check_object_size
0.59 +0.3 0.86 perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
0.90 +0.3 1.18 perf-profile.self.cycles-pp.kfree
2.08 +0.3 2.36 ± 4% perf-profile.self.cycles-pp.__memcg_slab_post_alloc_hook
1.56 +0.3 1.90 ± 2% perf-profile.self.cycles-pp.__kmalloc
2.18 +0.4 2.61 ± 3% perf-profile.self.cycles-pp.check_heap_object
2.20 +0.4 2.64 ± 3% perf-profile.self.cycles-pp._raw_spin_lock
1.47 +0.5 1.94 perf-profile.self.cycles-pp.wake_up_q
0.68 +0.5 1.21 perf-profile.self.cycles-pp.___slab_alloc
2.32 +0.6 2.90 ± 2% perf-profile.self.cycles-pp.__memcg_slab_free_hook
3.10 +0.6 3.73 ± 4% perf-profile.self.cycles-pp.__slab_free
0.81 +0.9 1.69 ± 56% perf-profile.self.cycles-pp._copy_from_user
2.64 +0.9 3.56 perf-profile.self.cycles-pp.stress_msg_receiver
4.04 +0.9 4.96 perf-profile.self.cycles-pp._copy_to_user
2.69 +1.0 3.67 perf-profile.self.cycles-pp.stress_msg_sender
3.02 +1.1 4.11 perf-profile.self.cycles-pp.do_msgrcv
6.32 +1.6 7.95 perf-profile.self.cycles-pp.percpu_counter_add_batch
9.42 +2.1 11.52 ± 2% perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
6.63 +2.6 9.21 perf-profile.self.cycles-pp.ipc_obtain_object_check
3.75 +2.7 6.46 perf-profile.self.cycles-pp.do_msgsnd
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
reply other threads:[~2024-02-22 2:00 UTC|newest]
Thread overview: [no followups] expand[flat|nested] mbox.gz Atom feed
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=202402220952.7141823b-oliver.sang@intel.com \
--to=oliver.sang@intel.com \
--cc=atenart@kernel.org \
--cc=edumazet@google.com \
--cc=feng.tang@intel.com \
--cc=fengwei.yin@intel.com \
--cc=kuba@kernel.org \
--cc=linux-mm@kvack.org \
--cc=lkp@intel.com \
--cc=netdev@vger.kernel.org \
--cc=oe-lkp@lists.linux.dev \
--cc=ying.huang@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).