From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752054AbbINVlQ (ORCPT ); Mon, 14 Sep 2015 17:41:16 -0400 Received: from mx2.suse.de ([195.135.220.15]:36259 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751927AbbINVlO (ORCPT ); Mon, 14 Sep 2015 17:41:14 -0400 Date: Mon, 14 Sep 2015 14:41:05 -0700 From: Davidlohr Bueso To: Waiman Long Cc: Peter Zijlstra , Ingo Molnar , Thomas Gleixner , "H. Peter Anvin" , x86@kernel.org, linux-kernel@vger.kernel.org, Scott J Norton , Douglas Hatch Subject: Re: [PATCH v6 4/6] locking/pvqspinlock: Collect slowpath lock statistics Message-ID: <20150914214105.GH19736@linux-q0g1.site> References: <1441996658-62854-1-git-send-email-Waiman.Long@hpe.com> <1441996658-62854-5-git-send-email-Waiman.Long@hpe.com> <20150911231355.GD19736@linux-q0g1.site> <55F6E6F2.10102@hpe.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii; format=flowed Content-Disposition: inline In-Reply-To: <55F6E6F2.10102@hpe.com> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 14 Sep 2015, Waiman Long wrote: >You can't use debugfs if we want to have per-cpu stats. We will have >to use sysfs instead. This will require more code changes. It is >certainly doable, but we have to choose between simplicity and >performance overhead. Right now, I am assuming that lock PV lockstat >is used primarily for debugging purpose and won't be enabled on >production system. If we want to have this capability in production >systems, we will certainly need to change it to per-cpu stats and use >sysfs instead. > >The original PV ticketlock code used debugfs and I was just following >its footstep. Do you think it is worthwhile to have this capability >available on production system by default? If we can prove that the overhead is small enough, and do it correctly (ie see how we do vmstats), it would be _very_ useful data to have enabled by default for debugging performance issues; methinks. But right now we have nowhere near that kind of data, not even with this atomic variant -- although I recall you did mention a workload in a previous iteration (which would be good to have in the changelog).