android_kernel_lge_bullhead

Commit Graph

Author	SHA1	Message	Date
Linux Build Service Account	4e4ff5940f	Merge "cpufreq: Interactive: Implement per policy instances of governor"	2014-05-11 06:43:22 -07:00
Linux Build Service Account	81e0b11676	Merge "cpufreq: interactive: Move definition of cpufreq_gov_interactive downwards"	2014-05-11 06:43:21 -07:00
Linux Build Service Account	1ed610bcd4	Merge "cpufreq: interactive: Remove unnecessary cpu_online() check"	2014-05-11 06:43:20 -07:00
Linux Build Service Account	0afd9d2ff6	Merge "qcom-cpufreq: Fix frequency table registration"	2014-05-11 01:24:33 -07:00
Minsung Kim	ababac55b2	cpufreq: interactive: fix NULL pointer dereference at sysfs ops sysfs ops for target_loads and above_hispeed_delay can be called before initializing tunables at CPUFREQ_GOV_POLICY_INIT. Create sysfs entries after initialization. Change-Id: I50356198d7629731c0d32a3066d61fe8354e0001 Signed-off-by: Minsung Kim <ms925.kim@samsung.com> Git-commit: 0ac276ebfca1d405153f4a3476aa1f7f66bbbec8 Git-Repo: https://android.googlesource.com/kernel/common.git [junjiew@codeaurora.org: Resolve merge conflicts] Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-05-09 17:58:22 -07:00
Chih-Wei Huang	4b3ea78abc	cpufreq: interactive: fix compiling warnings The gcc warns like: cpufreq_interactive.c:745:6: warning: operation on 'ret' may be undefined [-Wsequence-point] It was introduced by commit cf0fad49d17cb8273ce555dd5b7afab67d7923bf. Since sprintf(...) just return 1 (one character) in this case, ret should not changed. Just discarding the result of sprintf(...) leads to the result that the committer of cf0fad49d17cb8273ce555dd5b7afab67d7923bf wants. Change-Id: Ifed1cef6d6a31c3ed23dad03a567b3b9eddf3a57 Signed-off-by: Chih-Wei Huang <cwhuang@android-x86.org> Git-commit: 0715d10b9e6c04327adb189e13ecd6a3c2df48ce Git-Repo: https://android.googlesource.com/kernel/common.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-05-09 17:58:21 -07:00
Viresh Kumar	d956226a7c	cpufreq: Interactive: Implement per policy instances of governor If we have a multi-package system, where we have multiple instances of struct policy (per package), currently we can't have multiple instances of same governor. i.e. We can't have multiple instances of Interactive governor for multiple packages. This is a bottleneck for multicluster system, where we want different packages to use Interactive governor, but with different tunables. This patch uses the infrastructure provided by earlier patches pushed in Mainline in v3.10-rc1/rc2 and implements per policy instances of Interactive governor. Change-Id: I70436d4a5a45c6cb6edf37f3e46d0b9fbc930982 [toddpoynor@google.com: merge with later code, minor changes] Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Git-commit: 65f53ba0525cf92f397b22aea94ee637542a6757 Git-repo: https://android.googlesource.com/kernel/common/ [junjiew@codeaurora.org: resolved many conflicts to keep our previous modifications to interactive governor] Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-05-09 17:58:21 -07:00
Viresh Kumar	6c59f48ad4	cpufreq: interactive: Move definition of cpufreq_gov_interactive downwards This moves definition of cpufreq_gov_interactive towards the bottom of file, so that we don't have to add prototype of cpufreq_governor_interactive() in the beginning of file. Change-Id: I04bd1004954eb36502c5cd7e35d3d7274cddaf95 Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Git-commit: e50d640cacfe61e69f1b4fa92fa0174c688e919c Git-Repo: https://android.googlesource.com/kernel/common.git [junjiew@codeaurora.org: Resolve merge conflicts] Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-05-09 17:58:21 -07:00
Viresh Kumar	60caf9c686	cpufreq: interactive: Remove unnecessary cpu_online() check Cpufreq no longer calls governor callback for offlined cpus. i.e. All policy->cpus are guaranteed to be online. Hence we don't need explicit check to see if cpu is online or not. Change-Id: I9ad85ea4addd5b4a40952e59ed730dd15e328690 Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Git-commit: ad818d962134c16b1d8aff56edc8a2c4e12fa7b7 Git-Repo: https://android.googlesource.com/kernel/common.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-05-09 17:58:21 -07:00
Linux Build Service Account	f1f3117134	Merge "cpufreq: cpu-boost: Consider only task load to decide on sync frequency"	2014-05-08 20:06:56 -07:00
Junjie Wu	36223a22b7	qcom-cpufreq: Fix frequency table registration CPUfreq device is the provider of CPU frequency table, and it should always keep track of freq tables for every CPU itself. cpufreq_frequency_get_table() is an API for frequency table user. qcom-cpufreq should not use this API to obtain CPU freq table in the init function. Otherwise, the following sequence will lead to init failure. Assume CPU 0 and 1 are under control of same policy and online. Say policy->cpu is CPU 0. 1) Hot unplug CPU0 update_policy_cpu() will transfer ownership of policy to CPU1 cpufreq_frequency_table_update_policy_cpu() is called during the process to set CPU1's freq table to be the same as CPU0's freq table. CPU0's freq table is now NULL. 2) Hot unplug CPU1 All CPUs in policy are now offline. 3) Hot plug in CPU0 cpufreq_driver->init() is called for initialization. If this init function calls cpufreq_frequency_get_table() on CPU0, the result will be NULL. Current qcom-cpufreq checks return value of cpufreq_frequency_get_table() and returns an error in init. Remove the usage of cpufreq_frequency_get_table() and fill in frequency table for policy->cpu in msm_cpufreq_init(). Change-Id: I2d41af776938c31b4444c9f280a341dd1bed1548 Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-05-08 18:08:52 -07:00
Junjie Wu	73308b21a4	cpufreq: persistent_stats: Use correct CPU to get freq table If a policy manages multiple CPUs, frequency table might only be available on policy->cpu. Use the correct CPU to obtain frequency table. Change-Id: I3a52bb88f64df70e32e511a37d8364a0f62b507c Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-05-06 14:00:56 -07:00
Girish S Ghongdemath	d77dfa7f35	cpufreq: cpu-boost: Consider only task load to decide on sync frequency Currently we take the maximum between the source CPU frequency and the calculated frequency based on the migrating task load to decide on the frequency to sync the destination CPU to. This was done to handle short bursts in workloads of tasks which migrated immediately after causing a ramp up on the source CPU. Since their load history wasn't high enough, the destination CPU synced to a lower frequency which wasn't sufficient for the spike in workload. But as such cases are rare, taking the higher of source and calculated frequency can lead to destination CPU unnecessarily spending a considerable amount of time at higher frequencies which in turn can hurt power. With this change we make sure only the migrating task load is used to calculate the sync frequency for destination CPU when load based syncing is enabled. Change-Id: Ib1489d256c42ea7712aad2179aebffc87c549836 Signed-off-by: Girish Ghongdemath <girishsg@codeaurora.org>	2014-05-05 16:39:14 -07:00
Linux Build Service Account	16296e0178	Merge "Merge upstream linux-stable v3.10.36 into msm-3.10"	2014-05-05 15:09:00 -07:00
Junjie Wu	750f0b5399	qcom-cpufreq: Add support for separate governor tunables per policy CPUs under different policies might need different governor tunables on certain targets. Enable support for this feature by parsing "qcom,governor-per-policy" DT property. Change-Id: I670135822d67d392030b54f40feab9a3aef11542 Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-05-02 19:19:42 -07:00
Saravana Kannan	527ca23a10	cpufreq: Delete unused CPUBW and cache scaling code from cpufreq driver The devfreq cpufreq governor and the devfreq devices for CPUBW and cache now handle CPUBW and cache frequency scaling based on CPU frequency. So, delete all the unused code in the cpufreq driver that relates to scaling CPUBW and cache. Also update the msm-cpufreq DT format/data accordingly. Change-Id: I30bf7d8478c964ffa300dd78039cd8c280ec63fc Signed-off-by: Saravana Kannan <skannan@codeaurora.org>	2014-05-01 19:52:26 -07:00
Matt Wagantall	ee99855fde	qcom-cpufreq: Remove use of device_suspended in the hotplug path The CPU_DOWN_PREPARE notifier in cpufreq.c already takes care of performing a CPUFREQ_GOV_STOP, which waits for in-progress frequency switches to complete and prevents any additional frequency switches from being performed. Remove the redundant code for accomplishing the same thing from qcom-cpufreq. Besides being good cleanup, this change should also resolve a potential deadlock in out-of-memory conditions. The scenario arises if CPU1, executing in msm_cpufreq_target(), holds the qcom-cpufreq suspend_mutex while waiting for the completion at the end of that same function. In an out-of-memory condition, the work performing that completion may not be able to immediately run. At the same time, CPU2 may be executing CPU_DOWN_PREPARE hotplug notifiers. It will be holding the global hotplug mutex at this time, and will eventually become blocked in msm_cpufreq_cpu_callback() while waiting for the CPU1 to release the device_suspended lock. If invoked, the out-of-memory killer will attempt to acquire the hotplug mutex via get_online_cpus() as part of out_of_memory(). This lock is already held by CPU2. CPU2 will not release the lock until the device_suspended lock is released by CPU1. The device_suspended lock will not be release until the completion in msm_cpufreq_target() is marked as complete, which won't happen until the out-of-memory killer is able to free some memory. Nasty deadlock. Clearly, the out-of-memory condition itself needs to be resolved, but there is no harm in resolving the deadlock also. Change-Id: I5f59864aeb12a8c2aa81de16446af09a6d514da4 Signed-off-by: Matt Wagantall <mattw@codeaurora.org>	2014-04-24 15:30:47 -07:00
Ian Maund	356fb13538	Merge upstream linux-stable v3.10.36 into msm-3.10 * commit 'v3.10.36': (494 commits) Linux 3.10.36 netfilter: nf_conntrack_dccp: fix skb_header_pointer API usages mm: close PageTail race net: mvneta: rename MVNETA_GMAC2_PSC_ENABLE to MVNETA_GMAC2_PCS_ENABLE x86: fix boot on uniprocessor systems Input: cypress_ps2 - don't report as a button pads Input: synaptics - add manual min/max quirk for ThinkPad X240 Input: synaptics - add manual min/max quirk Input: mousedev - fix race when creating mixed device ext4: atomically set inode->i_flags in ext4_set_inode_flags() Linux 3.10.35 sched/autogroup: Fix race with task_groups list e100: Fix "disabling already-disabled device" warning xhci: Fix resume issues on Renesas chips in Samsung laptops Input: wacom - make sure touch_max is set for touch devices KVM: VMX: fix use after free of vmx->loaded_vmcs KVM: x86: handle invalid root_hpa everywhere KVM: MMU: handle invalid root_hpa at __direct_map Input: elantech - improve clickpad detection ARM: highbank: avoid L2 cache smc calls when PL310 is not present ... Change-Id: Ib68f565291702c53df09e914e637930c5d3e5310 Signed-off-by: Ian Maund <imaund@codeaurora.org>	2014-04-23 16:23:49 -07:00
Stephen Boyd	d8996f63ab	cpufreq: Fix timer/workqueue corruption due to double queueing commit 3617f2ca6d0eba48114308532945a7f1577816a4 upstream. When a CPU is hot removed we'll cancel all the delayed work items via gov_cancel_work(). Normally this will just cancels a delayed timer on each CPU that the policy is managing and the work won't run, but if the work is already running the workqueue code will wait for the work to finish before continuing to prevent the work items from re-queuing themselves like they normally do. This scheme will work most of the time, except for the case where the work function determines that it should adjust the delay for all other CPUs that the policy is managing. If this scenario occurs, the canceling CPU will cancel its own work but queue up the other CPUs works to run. For example: CPU0 CPU1 ---- ---- cpu_down() ... __cpufreq_remove_dev() cpufreq_governor_dbs() case CPUFREQ_GOV_STOP: gov_cancel_work(dbs_data, policy); cpu0 work is canceled timer is canceled cpu1 work is canceled <work runs> <waits for cpu1> od_dbs_timer() gov_queue_work(, , true); cpu0 work queued cpu1 work queued cpu2 work queued ... cpu1 work is canceled cpu2 work is canceled ... At the end of the GOV_STOP case cpu0 still has a work queued to run although the code is expecting all of the works to be canceled. __cpufreq_remove_dev() will then proceed to re-initialize all the other CPUs works except for the CPU that is going down. The CPUFREQ_GOV_START case in cpufreq_governor_dbs() will trample over the queued work and debugobjects will spit out a warning: WARNING: at lib/debugobjects.c:260 debug_print_object+0x94/0xbc() ODEBUG: init active (active state 0) object type: timer_list hint: delayed_work_timer_fn+0x0/0x10 Modules linked in: CPU: 0 PID: 1491 Comm: sh Tainted: G W 3.10.0 #19 [<c010c178>] (unwind_backtrace+0x0/0x11c) from [<c0109dec>] (show_stack+0x10/0x14) [<c0109dec>] (show_stack+0x10/0x14) from [<c01904cc>] (warn_slowpath_common+0x4c/0x6c) [<c01904cc>] (warn_slowpath_common+0x4c/0x6c) from [<c019056c>] (warn_slowpath_fmt+0x2c/0x3c) [<c019056c>] (warn_slowpath_fmt+0x2c/0x3c) from [<c0388a7c>] (debug_print_object+0x94/0xbc) [<c0388a7c>] (debug_print_object+0x94/0xbc) from [<c0388e34>] (__debug_object_init+0x2d0/0x340) [<c0388e34>] (__debug_object_init+0x2d0/0x340) from [<c019e3b0>] (init_timer_key+0x14/0xb0) [<c019e3b0>] (init_timer_key+0x14/0xb0) from [<c0635f78>] (cpufreq_governor_dbs+0x3e8/0x5f8) [<c0635f78>] (cpufreq_governor_dbs+0x3e8/0x5f8) from [<c06325a0>] (__cpufreq_governor+0xdc/0x1a4) [<c06325a0>] (__cpufreq_governor+0xdc/0x1a4) from [<c0633704>] (__cpufreq_remove_dev.isra.10+0x3b4/0x434) [<c0633704>] (__cpufreq_remove_dev.isra.10+0x3b4/0x434) from [<c08989f4>] (cpufreq_cpu_callback+0x60/0x80) [<c08989f4>] (cpufreq_cpu_callback+0x60/0x80) from [<c08a43c0>] (notifier_call_chain+0x38/0x68) [<c08a43c0>] (notifier_call_chain+0x38/0x68) from [<c01938e0>] (__cpu_notify+0x28/0x40) [<c01938e0>] (__cpu_notify+0x28/0x40) from [<c0892ad4>] (_cpu_down+0x7c/0x2c0) [<c0892ad4>] (_cpu_down+0x7c/0x2c0) from [<c0892d3c>] (cpu_down+0x24/0x40) [<c0892d3c>] (cpu_down+0x24/0x40) from [<c0893ea8>] (store_online+0x2c/0x74) [<c0893ea8>] (store_online+0x2c/0x74) from [<c04519d8>] (dev_attr_store+0x18/0x24) [<c04519d8>] (dev_attr_store+0x18/0x24) from [<c02a69d4>] (sysfs_write_file+0x100/0x148) [<c02a69d4>] (sysfs_write_file+0x100/0x148) from [<c0255c18>] (vfs_write+0xcc/0x174) [<c0255c18>] (vfs_write+0xcc/0x174) from [<c0255f70>] (SyS_write+0x38/0x64) [<c0255f70>] (SyS_write+0x38/0x64) from [<c0106120>] (ret_fast_syscall+0x0/0x30) Signed-off-by: Stephen Boyd <sboyd@codeaurora.org> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Cc: Krzysztof Kozlowski <k.kozlowski@samsung.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2014-04-14 06:42:19 -07:00
Xiaoguang Chen	ba17ca46b9	cpufreq: Fix governor start/stop race condition commit 95731ebb114c5f0c028459388560fc2a72fe5049 upstream. Cpufreq governors' stop and start operations should be carried out in sequence. Otherwise, there will be unexpected behavior, like in the example below. Suppose there are 4 CPUs and policy->cpu=CPU0, CPU1/2/3 are linked to CPU0. The normal sequence is: 1) Current governor is userspace. An application tries to set the governor to ondemand. It will call __cpufreq_set_policy() in which it will stop the userspace governor and then start the ondemand governor. 2) Current governor is userspace. The online of CPU3 runs on CPU0. It will call cpufreq_add_policy_cpu() in which it will first stop the userspace governor, and then start it again. If the sequence of the above two cases interleaves, it becomes: 1) Application stops userspace governor 2) Hotplug stops userspace governor which is a problem, because the governor shouldn't be stopped twice in a row. What happens next is: 3) Application starts ondemand governor 4) Hotplug starts a governor In step 4, the hotplug is supposed to start the userspace governor, but now the governor has been changed by the application to ondemand, so the ondemand governor is started once again, which is incorrect. The solution is to prevent policy governors from being stopped multiple times in a row. A governor should only be stopped once for one policy. After it has been stopped, no more governor stop operations should be executed. Also add a mutex to serialize governor operations. [rjw: Changelog. And you owe me a beverage of my choice.] Signed-off-by: Xiaoguang Chen <chenxg@marvell.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Cc: Krzysztof Kozlowski <k.kozlowski@samsung.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2014-04-14 06:42:19 -07:00
Mikulas Patocka	45deaa3ba8	powernow-k6: reorder frequencies commit 22c73795b101597051924556dce019385a1e2fa0 upstream. This patch reorders reported frequencies from the highest to the lowest, just like in other frequency drivers. Signed-off-by: Mikulas Patocka <mpatocka@redhat.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2014-04-14 06:42:14 -07:00
Mikulas Patocka	59b61f4dd5	powernow-k6: correctly initialize default parameters commit d82b922a4acc1781d368aceac2f9da43b038cab2 upstream. The powernow-k6 driver used to read the initial multiplier from the powernow register. However, there is a problem with this: * If there was a frequency transition before, the multiplier read from the register corresponds to the current multiplier. * If there was no frequency transition since reset, the field in the register always reads as zero, regardless of the current multiplier that is set using switches on the mainboard and that the CPU is running at. The zero value corresponds to multiplier 4.5, so as a consequence, the powernow-k6 driver always assumes multiplier 4.5. For example, if we have 550MHz CPU with bus frequency 100MHz and multiplier 5.5, the powernow-k6 driver thinks that the multiplier is 4.5 and bus frequency is 122MHz. The powernow-k6 driver then sets the multiplier to 4.5, underclocking the CPU to 450MHz, but reports the current frequency as 550MHz. There is no reliable way how to read the initial multiplier. I modified the driver so that it contains a table of known frequencies (based on parameters of existing CPUs and some common overclocking schemes) and sets the multiplier according to the frequency. If the frequency is unknown (because of unusual overclocking or underclocking), the user must supply the bus speed and maximum multiplier as module parameters. This patch should be backported to all stable kernels. If it doesn't apply cleanly, change it, or ask me to change it. Signed-off-by: Mikulas Patocka <mpatocka@redhat.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2014-04-14 06:42:14 -07:00
Mikulas Patocka	cfc83ee694	powernow-k6: disable cache when changing frequency commit e20e1d0ac02308e2211306fc67abcd0b2668fb8b upstream. I found out that a system with k6-3+ processor is unstable during network server load. The system locks up or the network card stops receiving. The reason for the instability is the CPU frequency scaling. During frequency transition the processor is in "EPM Stop Grant" state. The documentation says that the processor doesn't respond to inquiry requests in this state. Consequently, coherency of processor caches and bus master devices is not maintained, causing the system instability. This patch flushes the cache during frequency transition. It fixes the instability. Other minor changes: * u64 invalue changed to unsigned long because the variable is 32-bit * move the logic to set the multiplier to a separate function powernow_k6_set_cpu_multiplier * preserve lower 5 bits of the powernow port instead of 4 (the voltage field has 5 bits) * mask interrupts when reading the multiplier, so that the port is not open during other activity (running other kernel code with the port open shouldn't cause any misbehavior, but we should better be safe and keep the port closed) This patch should be backported to all stable kernels. If it doesn't apply cleanly, change it, or ask me to change it. Signed-off-by: Mikulas Patocka <mpatocka@redhat.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2014-04-14 06:42:14 -07:00
Junjie Wu	07098af553	qcom-cpufreq: Allow all CPUs to get cpu_clk on targets with sync CPUs For targets with synchronous CPU clocks, only CPU0 has a clock pointer in cpu_clks[]. This forces CPU0 to be the last CPU to be hotplugged out, or go to certain low power mode that needs to disable CPU clocks. Remove this requirement by adding clock pointer for all CPUs. Drivers that ask for CPU clocks will get same clock for CPUs in sync. Change-Id: Ib97af2c3b0fdbc8dbb78364178225cc64ed02036 Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-04-04 11:37:44 -07:00
Junjie Wu	b638af72c6	qcom-cpufreq: Fix hotplug blocking logic Change `2b63b10` qcom-cpufreq: Block hotplug until cpufreq is ready intended to block hotplug until qcom-cpufreq driver has got all clocks. However, its implementation still leaves a small window when CPU 0 has got its clock but others haven't. If a hotplug happens on another CPU that doesn't have its clock pointer, refcount of that CPU clock will again be wrong. Fix the issue by introducing a ready flag. Change-Id: I1e05b063b0584088f717b82709a3f1d55bc6561b Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-04-04 11:37:44 -07:00
Linux Build Service Account	b198cb9af1	Merge "Merge upstream linux-stable v3.10.28 into msm-3.10"	2014-03-26 23:36:07 -07:00
Linux Build Service Account	72bd72cf80	Merge "cpufreq: persistent_stats: make enable node writable only by root"	2014-03-26 11:12:38 -07:00
Linux Build Service Account	37a95a299f	Merge "qcom-cpufreq: Move the enabling of the CPU/L2 clocks to CPU_STARTING"	2014-03-26 11:11:21 -07:00
Anurag Singh	fa7c796bd6	cpufreq: persistent_stats: make enable node writable only by root The 'enable' node in the sysfs structure is currently writable by all users. This is not desirable from an access control perspective. Changing permissions to provide write access to root only. Change-Id: I92525cc63571607a38c7aada2a6853cf89b5b38f Signed-off-by: Anurag Singh <anursing@codeaurora.org>	2014-03-24 18:13:46 -07:00
Ian Maund	f1b32d4e47	Merge upstream linux-stable v3.10.28 into msm-3.10 The following commits have been reverted from this merge, as they are known to introduce new bugs and are currently incompatible with our audio implementation. Investigation of these commits is ongoing, and they are expected to be brought in at a later time: `86e6de7` ALSA: compress: fix drain calls blocking other compress functions (v6) `16442d4` ALSA: compress: fix drain calls blocking other compress functions This merge commit also includes a change in block, necessary for compilation. Upstream has modified elevator_init_fn to prevent race conditions, requring updates to row_init_queue and test_init_queue. * commit 'v3.10.28': (1964 commits) Linux 3.10.28 ARM: 7938/1: OMAP4/highbank: Flush L2 cache before disabling drm/i915: Don't grab crtc mutexes in intel_modeset_gem_init() serial: amba-pl011: use port lock to guard control register access mm: Make {,set}page_address() static inline if WANT_PAGE_VIRTUAL md/raid5: Fix possible confusion when multiple write errors occur. md/raid10: fix two bugs in handling of known-bad-blocks. md/raid10: fix bug when raid10 recovery fails to recover a block. md: fix problem when adding device to read-only array with bitmap. drm/i915: fix DDI PLLs HW state readout code nilfs2: fix segctor bug that causes file system corruption thp: fix copy_page_rep GPF by testing is_huge_zero_pmd once only ftrace/x86: Load ftrace_ops in parameter not the variable holding it SELinux: Fix possible NULL pointer dereference in selinux_inode_permission() writeback: Fix data corruption on NFS hwmon: (coretemp) Fix truncated name of alarm attributes vfs: In d_path don't call d_dname on a mount point staging: comedi: adl_pci9111: fix incorrect irq passed to request_irq() staging: comedi: addi_apci_1032: fix subdevice type/flags bug mm/memory-failure.c: recheck PageHuge() after hugetlb page migrate successfully GFS2: Increase i_writecount during gfs2_setattr_chown perf/x86/amd/ibs: Fix waking up from S3 for AMD family 10h perf scripting perl: Fix build error on Fedora 12 ARM: 7815/1: kexec: offline non panic CPUs on Kdump panic Linux 3.10.27 sched: Guarantee new group-entities always have weight sched: Fix hrtimer_cancel()/rq->lock deadlock sched: Fix cfs_bandwidth misuse of hrtimer_expires_remaining sched: Fix race on toggling cfs_bandwidth_used x86, fpu, amd: Clear exceptions in AMD FXSAVE workaround netfilter: nf_nat: fix access to uninitialized buffer in IRC NAT helper SCSI: sd: Reduce buffer size for vpd request intel_pstate: Add X86_FEATURE_APERFMPERF to cpu match parameters. mac80211: move "bufferable MMPDU" check to fix AP mode scan ACPI / Battery: Add a _BIX quirk for NEC LZ750/LS ACPI / TPM: fix memory leak when walking ACPI namespace mfd: rtsx_pcr: Disable interrupts before cancelling delayed works clk: exynos5250: fix sysmmu_mfc{l,r} gate clocks clk: samsung: exynos5250: Add CLK_IGNORE_UNUSED flag for the sysreg clock clk: samsung: exynos4: Correct SRC_MFC register clk: clk-divider: fix divisor > 255 bug ahci: add PCI ID for Marvell 88SE9170 SATA controller parisc: Ensure full cache coherency for kmap/kunmap drm/nouveau/bios: make jump conditional ARM: shmobile: mackerel: Fix coherent DMA mask ARM: shmobile: armadillo: Fix coherent DMA mask ARM: shmobile: kzm9g: Fix coherent DMA mask ARM: dts: exynos5250: Fix MDMA0 clock number ARM: fix "bad mode in ... handler" message for undefined instructions ARM: fix footbridge clockevent device net: Loosen constraints for recalculating checksum in skb_segment() bridge: use spin_lock_bh() in br_multicast_set_hash_max netpoll: Fix missing TXQ unlock and and OOPS. net: llc: fix use after free in llc_ui_recvmsg virtio-net: fix refill races during restore virtio_net: don't leak memory or block when too many frags virtio-net: make all RX paths handle errors consistently virtio_net: fix error handling for mergeable buffers vlan: Fix header ops passthru when doing TX VLAN offload. net: rose: restore old recvmsg behavior rds: prevent dereference of a NULL device ipv6: always set the new created dst's from in ip6_rt_copy net: fec: fix potential use after free hamradio/yam: fix info leak in ioctl drivers/net/hamradio: Integer overflow in hdlcdrv_ioctl() net: inet_diag: zero out uninitialized idiag_{src,dst} fields ip_gre: fix msg_name parsing for recvfrom/recvmsg net: unix: allow bind to fail on mutex lock ipv6: fix illegal mac_header comparison on 32bit netvsc: don't flush peers notifying work during setting mtu tg3: Initialize REG_BASE_ADDR at PCI config offset 120 to 0 net: unix: allow set_peek_off to fail net: drop_monitor: fix the value of maxattr ipv6: don't count addrconf generated routes against gc limit packet: fix send path when running with proto == 0 virtio: delete napi structures from netdev before releasing memory macvtap: signal truncated packets tun: update file current position macvtap: update file current position macvtap: Do not double-count received packets rds: prevent BUG_ON triggered on congestion update to loopback net: do not pretend FRAGLIST support IPv6: Fixed support for blackhole and prohibit routes HID: Revert "Revert "HID: Fix logitech-dj: missing Unifying device issue"" gpio-rcar: R-Car GPIO IRQ share interrupt clocksource: em_sti: Set cpu_possible_mask to fix SMP broadcast irqchip: renesas-irqc: Fix irqc_probe error handling Linux 3.10.26 sh: add EXPORT_SYMBOL(min_low_pfn) and EXPORT_SYMBOL(max_low_pfn) to sh_ksyms_32.c ext4: fix bigalloc regression arm64: Use Normal NonCacheable memory for writecombine arm64: Do not flush the D-cache for anonymous pages arm64: Avoid cache flushing in flush_dcache_page() ARM: KVM: arch_timers: zero CNTVOFF upon return to host ARM: hyp: initialize CNTVOFF to zero clocksource: arch_timer: use virtual counters arm64: Remove unused cpu_name ascii in arch/arm64/mm/proc.S arm64: dts: Reserve the memory used for secondary CPU release address arm64: check for number of arguments in syscall_get/set_arguments() arm64: fix possible invalid FPSIMD initialization state ... Change-Id: Ia0e5d71b536ab49ec3a1179d59238c05bdd03106 Signed-off-by: Ian Maund <imaund@codeaurora.org>	2014-03-24 14:28:34 -07:00
Linux Build Service Account	d556a8f203	Merge changes Ifbdd3ecc,Ieca413c1,Id2ba91cc,Ib83207b9 into msm-3.10 * changes: cpufreq: cpu-boost: Handle wakeup hints received for foreground tasks sched: Call the notify_on_migrate notifier chain for wakeups as well cpufreq: cpu-boost: Introduce scheduler assisted load based syncs sched: window-based load stats for tasks	2014-03-22 04:36:25 -07:00
Vikram Mulukutla	6760936311	qcom-cpufreq: Move the enabling of the CPU/L2 clocks to CPU_STARTING It is a requirement of PLLs in MSM SoCs that the PLL power supplies be enabled while the PLL is running. If the PLL is locked and outputs enabled, turning off the regulator(s) supplying the PLL will cause the PLL to enter an unpredictable state. Now in the CPU_UP_PREPARE notifier, the CPU clocks are prepared and enabled, causing the source HFPLLs to also turn on. Note that the CPU isn't clocked yet. It is possible that execution is pre-empted and the CPU running the notifier enters power collapse. If all other CPUs also enter power collapse, then it is possible for an RPM notification to go out, allowing the RPM to transition the Apps processor to its sleep set. This can result in the HFPLL supplies being turned off while the HFPLL is running, violating the requirement mentioned above. Once the CPU is unclamped, the CPU is effectively unclocked, due to the HFPLL being in an unknown state. There is a check that is enabled in the PM code's CPU_UP_PREPARE notifier callback. This check ensures that the problematic RPM notification cannot occur until the core that is being brought online also enters power collapse. However, there is no ordering guarantee between that PM's hotplug notifier callback's execution and the cpufreq hotplug notifier callback's execution. This ordering depends on program link order, which is unreliable. It is necessary to ensure that once the HFPLL is enabled, the RPM cannot transition apps to its sleep set. Move the enabling of the CPU clocks to the CPU_STARTING notifier, which runs on the CPU bring brought online. The CPU_STARTING notifier is guaranteed to run after the CPU_UP_PREPARE notifier, which implies that the aforementioned do-not-notify-rpm check is executed before the HFPLL is enabled. Therefore even if all cores enter power collapse after the HFPLL is enabled, the HFPLL supplies are guaranteed to stay on, or the CPU clock is switched to the safe source and the HFPLL is turned off. CRs-Fixed: 622738 Change-Id: I136841405806c07e755919859e649ded0c719abb Signed-off-by: Vikram Mulukutla <markivx@codeaurora.org>	2014-03-21 10:55:57 -07:00
Rohit Gupta	c09decae9b	cpufreq: cpu-boost: Handle wakeup hints received for foreground tasks A previous change modifies the notification conditions in the scheduler to call the notifier chain even on foreground_thread wakeups for tasks having load more than a threshold value. If load_based_syncs is turned OFF then we do not need to perform cpu boost for wakeup hints for foreground tasks from scheduler Change-Id: Ifbdd3eccac5c9892dfc3a3c3edbfc0df766478ed Signed-off-by: Rohit Gupta <rohgup@codeaurora.org>	2014-03-20 16:30:45 -07:00
Rohit Gupta	1f25d291e3	cpufreq: cpu-boost: Introduce scheduler assisted load based syncs Previously, on getting a migration notification cpu-boost changed the scaling min of the destination frequency to match that of the source frequency or sync_threshold whichever was minimum. If the scheduler migration notification is extended with task load (cpu demand) information, the cpu boost driver can use this load to compute a suitable frequency for the migrating task. The required frequency for the task is calculated by taking the load percentage of the max frequency and no sync is performed if the load is less than a particular value (migration_load_threshold).This change is beneficial for both perf and power as demand of a task is taken into consideration while making cpufreq decisions and unnecessary syncs for lightweight tasks are avoided. The task load information provided by scheduler comes from a window-based load collection mechanism which also normalizes the load collected by the scheduler to the max possible frequency across all CPUs. Change-Id: Id2ba91cc4139c90602557f9b3801fb06b3c38992 Signed-off-by: Rohit Gupta <rohgup@codeaurora.org>	2014-03-20 16:30:34 -07:00
Rohit Gupta	ec269d9945	cpufreq: interactive: Don't reset sampling down factor at max freq A previous patch sets the min_sample_time to zero if the cpufreq min/max limits have changed which would bring down the frequency in the next sample even if min_sample_time doesnt allow it to do so. This could severly affect benchmarks if this is applied to max frequencies as min_sample_time for max frequencies is set to sampling_down_factor. This patch resets the limits_changed flag for cases where timer fires while running at max frequencies so that sampling_down_factor is still applicable as before. Change-Id: I397133cb15eea4ab999d0f69617b4922438c6e11 Signed-off-by: Rohit Gupta <rohgup@codeaurora.org>	2014-03-20 14:29:52 -07:00
Linux Build Service Account	dd8aa0bf59	Merge "cpufreq: Kconfig: Move CPU_FREQ_MSM from arch to cpufreq"	2014-03-19 18:59:01 -07:00
Patrick Cain	f8b276565c	cpufreq: Sync on thread migration optimizations When threads migrate between CPUs the destination CPU will temporarily have a boosted frequency and the source CPU will have a residual higher frequency even though the load could have just been reduced. To avoid leaving CPUs at a high frequencies notify the source CPU after a thread migration and make the interactive governor's timer reschedule conditionally. Change-Id: Id55055f2a116ca7ffd2a34dc3d0ec519c888a111 Signed-off-by: Patrick Cain <pcain@codeaurora.org>	2014-03-19 13:42:21 -07:00
Patrick Cain	2d5582aca2	cpufreq: cpu-boost: Re-issue boosts above minimum frequency Frequency boosts where the source CPU frequency is greater than CPU's minimum frequency should always go through regardless of the destination CPU's current frequency. This fixes a performance issue where the governor lowers the CPU frequency shortly after a thread is migrated to it because the boost wasn't re-issued. Change-Id: I449545a688d84b0a6e834f5a51dcb499caa84d29 Signed-off-by: Patrick Cain <pcain@codeaurora.org>	2014-03-18 20:11:26 -07:00
Junjie Wu	a8d0e9e173	cpufreq: Kconfig: Move CPU_FREQ_MSM from arch to cpufreq drivers/cpufreq already has Kconfig.arm for vendor specific Kconfig definition. Move CPU_FREQ_MSM from arch/arm/Kconfig to drivers/cpufreq/Kconfig.arm so that ARM64 can share the same configuration option. Change-Id: I68a0a170cd676541ecba325722e0d0e0fe15aefc Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-03-18 11:41:48 -07:00
Junjie Wu	87e8465ab1	qcom-cpufreq: Use CONFIG_CPU_FREQ_MSM to compile qcom-cpufreq The config string used for compiling qcom-cpufreq was changed from CONFIG_CPU_FREQ_MSM to CONFIG_ARCH_MSM in `d0e097389e` (msm: cpufreq: Move cpufreq to drivers/cpufreq/) However, not all targets with CONFIG_ARCH_MSM will always have cpufreq enabled all the time. Previous change leads to definition errors when CONFIG_CPU_FREQ_MSM is not selected on certain MSM targets. Change-Id: I1f99e27282032ab62216100e5712b751dcc10fe9 Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-03-13 14:02:43 -07:00
Saravana Kannan	71f5d65630	cpufreq: stats: Fix error handling in __cpufreq_stats_create_table() Remove sysfs group if __cpufreq_stats_create_table() fails after creating one. Change-Id: Icb0b44424cc4eb6c88be255e2839ef51c3f8779c Signed-off-by: Saravana Kannan <skannan@codeaurora.org>	2014-03-07 11:18:11 -08:00
Saravana Kannan	cdeeafddd7	cpufreq: stats: Remove redundant cpufreq_cpu_get() call __cpufreq_stats_create_table always gets pass the valid and real policy struct. So, there's no need to call cpufreq_cpu_get() to get the policy again. Change-Id: I0136b3e67018ee3af2335906407f55d8c6219f71 Signed-off-by: Saravana Kannan <skannan@codeaurora.org>	2014-03-07 11:18:10 -08:00
Junjie Wu	f900539bcf	Revert "cpufreq: Call cpufreq_update_policy() during cpufreq_stats_init()" This reverts commit `f30be559b1`. cpufreq_stats now calls cpufreq_stats_create_table() directly in cpufreq_stats_init(). It's no longer necessary to call cpufreq_update_policy(). Change-Id: I28bb752f2995e569d55a1a6f238a8e671d673891 Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-03-07 11:18:10 -08:00
viresh kumar	ab4d000190	cpufreq: remove sysfs link when a cpu != policy->cpu, is removed Commit 42f921a (cpufreq: remove sysfs files for CPUs which failed to come back after resume) tried to do this but missed this piece of code to fix. Currently we are getting this on suspend/resume: ------------[ cut here ]------------ WARNING: CPU: 0 PID: 877 at fs/sysfs/dir.c:52 sysfs_warn_dup+0x68/0x84() sysfs: cannot create duplicate filename '/devices/system/cpu/cpu1/cpufreq' Modules linked in: brcmfmac brcmutil CPU: 0 PID: 877 Comm: test-rtc-resume Not tainted 3.14.0-rc2-00259-g9398a10cd964 #12 [<c0015bac>] (unwind_backtrace) from [<c0011850>] (show_stack+0x10/0x14) [<c0011850>] (show_stack) from [<c056e018>] (dump_stack+0x80/0xcc) [<c056e018>] (dump_stack) from [<c0025e44>] (warn_slowpath_common+0x64/0x88) [<c0025e44>] (warn_slowpath_common) from [<c0025efc>] (warn_slowpath_fmt+0x30/0x40) [<c0025efc>] (warn_slowpath_fmt) from [<c012776c>] (sysfs_warn_dup+0x68/0x84) [<c012776c>] (sysfs_warn_dup) from [<c0127a54>] (sysfs_do_create_link_sd+0xb0/0xb8) [<c0127a54>] (sysfs_do_create_link_sd) from [<c038ef64>] (__cpufreq_add_dev.isra.27+0x2a8/0x814) [<c038ef64>] (__cpufreq_add_dev.isra.27) from [<c038f548>] (cpufreq_cpu_callback+0x70/0x8c) [<c038f548>] (cpufreq_cpu_callback) from [<c0043864>] (notifier_call_chain+0x44/0x84) [<c0043864>] (notifier_call_chain) from [<c0025f60>] (__cpu_notify+0x28/0x44) [<c0025f60>] (__cpu_notify) from [<c00261e8>] (_cpu_up+0xf0/0x140) [<c00261e8>] (_cpu_up) from [<c0569eb8>] (enable_nonboot_cpus+0x68/0xb0) [<c0569eb8>] (enable_nonboot_cpus) from [<c006339c>] (suspend_devices_and_enter+0x198/0x2dc) [<c006339c>] (suspend_devices_and_enter) from [<c0063654>] (pm_suspend+0x174/0x1e8) [<c0063654>] (pm_suspend) from [<c00624e0>] (state_store+0x6c/0xbc) [<c00624e0>] (state_store) from [<c01fc200>] (kobj_attr_store+0x14/0x20) [<c01fc200>] (kobj_attr_store) from [<c0126e50>] (sysfs_kf_write+0x44/0x48) [<c0126e50>] (sysfs_kf_write) from [<c012a274>] (kernfs_fop_write+0xb4/0x14c) [<c012a274>] (kernfs_fop_write) from [<c00d4818>] (vfs_write+0xa8/0x180) [<c00d4818>] (vfs_write) from [<c00d4bb8>] (SyS_write+0x3c/0x70) [<c00d4bb8>] (SyS_write) from [<c000e620>] (ret_fast_syscall+0x0/0x30) ---[ end trace 76969904b614c18f ]--- Fix this by removing sysfs link for cpufreq directory when cpu removed isn't policy->cpu. Change-Id: Iea78fcb07670306a47c8f712e93c39897966c2d0 Revamps: 42f921a (cpufreq: remove sysfs files for CPUs which failed to come back after resume) Reported-and-tested-by: Stephen Warren <swarren@nvidia.com> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 6964d91db2becfe80658f50584d264708ca7f49e Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-03-07 11:17:29 -08:00
Srivatsa S. Bhat	1bcccca64c	cpufreq: powernow-k8: Initialize per-cpu data-structures properly commit c3274763bfc3bf1ececa269ed6e6c4d7ec1c3e5e upstream. The powernow-k8 driver maintains a per-cpu data-structure called powernow_data that is used to perform the frequency transitions. It initializes this data structure only for the policy->cpu. So, accesses to this data structure by other CPUs results in various problems because they would have been uninitialized. Specifically, if a cpu (!= policy->cpu) invokes the drivers' ->get() function, it returns 0 as the KHz value, since its per-cpu memory doesn't point to anything valid. This causes problems during suspend/resume since cpufreq_update_policy() tries to enforce this (0 KHz) as the current frequency of the CPU, and this madness gets propagated to adjust_jiffies() as well. Eventually, lots of things start breaking down, including the r8169 ethernet card, in one particularly interesting case reported by Pierre Ossman. Fix this by initializing the per-cpu data-structures of all the CPUs in the policy appropriately. References: https://bugzilla.kernel.org/show_bug.cgi?id=70311 Reported-by: Pierre Ossman <pierre@ossman.eu> Signed-off-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2014-03-06 21:30:09 -08:00
Viresh Kumar	5d4eda83b8	cpufreq: stats: create sysfs entries when cpufreq_stats is a module When cpufreq_stats is compiled in as a module, cpufreq driver would have already been registered. And so the CPUFREQ_CREATE_POLICY notifiers wouldn't be called for it. Hence no sysfs entries for stats. :( This patch calls cpufreq_stats_create_table() for each online CPU from cpufreq_stats_init() and so if policy is already created for CPUx then we will register sysfs stats for it. When its not compiled as module, we will return early as policy wouldn't be found for any of the CPUs. Change-Id: Ie8841c8a9c06b61231afd3f5377b73be24473527 Acked-by: Nicolas Pitre <nico@linaro.org> Tested-by: Nicolas Pitre <nico@linaro.org> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: b3f9ff88db6e149086dc111e5652106005fd5923 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-02-27 18:07:50 -08:00
Viresh Kumar	4fa438995a	cpufreq: stats: free table and remove sysfs entry in a single routine We don't have code paths now where we need to do these two things separately, so it is better do them in a single routine. Just as they are allocated in a single routine. Change-Id: I42f5245d0f9a0c38583f20397b453ae2a0e4f268 Acked-by: Nicolas Pitre <nico@linaro.org> Tested-by: Nicolas Pitre <nico@linaro.org> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 2d13594dcb7eaa41fa180e8a3b27a8f10845381f Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-02-27 18:07:50 -08:00
Viresh Kumar	c76d47f089	cpufreq: stats: remove hotplug notifiers Either CPUs are hot-unplugged or suspend/resume occurs, cpufreq core will send notifications to cpufreq-stats and stats structure and sysfs entries would be correctly handled.. And so we don't actually need hotcpu notifiers in cpufreq-stats anymore. We were only handling cpu hot-unplug events here and that are already taken care of by POLICY notifiers. Change-Id: Icc5df84bdec5704976d24f14c36c8485bdbf0dce Acked-by: Nicolas Pitre <nico@linaro.org> Tested-by: Nicolas Pitre <nico@linaro.org> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 027cc2e4a6127fdf29e17a422aacb5ecd9830cbb Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git [junjiew@codeaurora.org: resolve merge conflict] Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-02-27 18:07:49 -08:00
Viresh Kumar	aaae89ea07	cpufreq: stats: handle cpufreq_unregister_driver() and suspend/resume properly There are several problems with cpufreq stats in the way it handles cpufreq_unregister_driver() and suspend/resume.. - We must not lose data collected so far when suspend/resume happens and so stats directories must not be removed/allocated during these operations, which is done currently. - cpufreq_stat has registered notifiers with both cpufreq and hotplug. It adds sysfs stats directory with a cpufreq notifier: CPUFREQ_NOTIFY and removes this directory with a notifier from hotplug core. In case cpufreq_unregister_driver() is called (on rmmod cpufreq driver), stats directories per cpu aren't removed as CPUs are still online. The only call cpufreq_stats gets is cpufreq_stats_update_policy_cpu() for all CPUs except the last of each policy. And pointer to stat information is stored in the entry for last CPU in the per-cpu cpufreq_stats_table. But policy structure would be freed inside cpufreq core and so that will result in memory leak inside cpufreq stats (as we are never freeing memory for stats). Now if we again insert the module cpufreq_register_driver() will be called and we will again allocate stats data and put it on for first CPU of every policy. In case we only have a single CPU per policy, we will return with a error from cpufreq_stats_create_table() due to this code: if (per_cpu(cpufreq_stats_table, cpu)) return -EBUSY; And so probably cpufreq stats directory would not show up anymore (as it was added inside last policies->kobj which doesn't exist anymore). I haven't tested it, though. Also the values in stats files wouldn't be refreshed as we are using the earlier stats structure. - CPUFREQ_NOTIFY is called from cpufreq_set_policy() which is called for scenarios where we don't really want cpufreq_stat_notifier_policy() to get called. For example whenever we are changing anything related to a policy: min/max/current freq, etc. cpufreq_set_policy() is called and so cpufreq stats is notified. Where we don't do any useful stuff other than simply returning with -EBUSY from cpufreq_stats_create_table(). And so this isn't the right notifier that cpufreq stats.. Due to all above reasons this patch does following changes: - Add new notifiers CPUFREQ_CREATE_POLICY and CPUFREQ_REMOVE_POLICY, which are only called when policy is created/destroyed. They aren't called for suspend/resume paths.. - Use these notifiers in cpufreq_stat_notifier_policy() to create/destory stats sysfs entries. And so cpufreq_unregister_driver() or suspend/resume shouldn't be a problem for cpufreq_stats. - Return early from cpufreq_stat_cpu_callback() for suspend/resume sequence, so that we don't free stats structure. Change-Id: I03340f9fe3f79cb4df3ef94e94b9ff573c1df422 Acked-by: Nicolas Pitre <nico@linaro.org> Tested-by: Nicolas Pitre <nico@linaro.org> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: fcd7af917abba798cd954419030142e95139359f Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-02-27 18:07:48 -08:00
Viresh Kumar	65bf3806ef	cpufreq: remove sysfs files for CPUs which failed to come back after resume There are cases where cpufreq_add_dev() may fail for some CPUs during system resume. With the current code we will still have sysfs cpufreq files for those CPUs and struct cpufreq_policy would be already freed for them. Hence any operation on those sysfs files would result in kernel warnings. Example of problems resulting from resume errors (from Bjørn Mork): WARNING: CPU: 0 PID: 6055 at fs/sysfs/file.c:343 sysfs_open_file+0x77/0x212() missing sysfs attribute operations for kobject: (null) Modules linked in: [stripped as irrelevant] CPU: 0 PID: 6055 Comm: grep Tainted: G D 3.13.0-rc2 #153 Hardware name: LENOVO 2776LEG/2776LEG, BIOS 6EET55WW (3.15 ) 12/19/2011 0000000000000009 ffff8802327ebb78 ffffffff81380b0e 0000000000000006 ffff8802327ebbc8 ffff8802327ebbb8 ffffffff81038635 0000000000000000 ffffffff811823c7 ffff88021a19e688 ffff88021a19e688 ffff8802302f9310 Call Trace: [<ffffffff81380b0e>] dump_stack+0x55/0x76 [<ffffffff81038635>] warn_slowpath_common+0x7c/0x96 [<ffffffff811823c7>] ? sysfs_open_file+0x77/0x212 [<ffffffff810386e3>] warn_slowpath_fmt+0x41/0x43 [<ffffffff81182dec>] ? sysfs_get_active+0x6b/0x82 [<ffffffff81182382>] ? sysfs_open_file+0x32/0x212 [<ffffffff811823c7>] sysfs_open_file+0x77/0x212 [<ffffffff81182350>] ? sysfs_schedule_callback+0x1ac/0x1ac [<ffffffff81122562>] do_dentry_open+0x17c/0x257 [<ffffffff8112267e>] finish_open+0x41/0x4f [<ffffffff81130225>] do_last+0x80c/0x9ba [<ffffffff8112dbbd>] ? inode_permission+0x40/0x42 [<ffffffff81130606>] path_openat+0x233/0x4a1 [<ffffffff81130b7e>] do_filp_open+0x35/0x85 [<ffffffff8113b787>] ? __alloc_fd+0x172/0x184 [<ffffffff811232ea>] do_sys_open+0x6b/0xfa [<ffffffff811233a7>] SyS_openat+0xf/0x11 [<ffffffff8138c812>] system_call_fastpath+0x16/0x1b To fix this, remove those sysfs files or put the associated kobject in case of such errors. Also, to make it simple, remove the cpufreq sysfs links from all the CPUs (except for the policy->cpu) during suspend, as that operation won't result in a loss of sysfs file permissions and we can create those links during resume just fine. Change-Id: I11a90d6b7d3422037af8147d8d9b41afad7c5423 Fixes: 5302c3fb2e62 ("cpufreq: Perform light-weight init/teardown during suspend/resume") Reported-and-tested-by: Bjørn Mork <bjorn@mork.no> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Cc: 3.12+ <stable@vger.kernel.org> # 3.12+ [rjw: Changelog] Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 42f921a6f10c6c2079b093a115eb7e3c3508357f Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-02-27 18:07:47 -08:00
Linux Build Service Account	a4331ca2b8	Merge "cpufreq: cpu-boost: Don't register for cpufreq notifiers too early"	2014-02-26 21:23:40 -08:00
Linux Build Service Account	52159dfc55	Merge "qcom-cpufreq: Block hotplug until cpufreq is ready"	2014-02-26 17:32:08 -08:00
Linux Build Service Account	663435db62	Merge "cpufreq: interactive: delete timers for GOV_START"	2014-02-26 06:55:27 -08:00
Junjie Wu	2b63b109fc	qcom-cpufreq: Block hotplug until cpufreq is ready Hotplug before qcom-cpufreq is ready could lead to inconsistent CPU clock state. Block hotplug by returning NOTIFY_BAD in hotplug callback until qcom-cpufreq is probed. Change-Id: I72a2f98c083c9b21b95ecafdb5a5be7a7682e842 Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-02-25 11:22:11 -08:00
Shridhar Rasal	13f3529d5d	cpufreq: interactive: delete timers for GOV_START Make sure that timers cpu_timer and cpu_slack_timer deactivated before addition of new. Change-Id: If31c4049606871df6f00efdc24b1d713c86a6f69 Signed-off-by: Shridhar Rasal <srasal@nvidia.com> Signed-off-by: Bharat Nihalani <bnihalani@nvidia.com> Git-commit: b430123367e78a6557bac3cf1558bcb85193fb12 Git-repo: https://android.googlesource.com/kernel/common/ [mattw@codeaurora.org: resolved trivial context conflict] Signed-off-by: Matt Wagantall <mattw@codeaurora.org>	2014-02-24 16:14:50 -08:00
Junjie Wu	d0e097389e	msm: cpufreq: Move cpufreq to drivers/cpufreq/ Architecutural changes in the ARM Linux kernel tree mandate the eventual removal of the mach-* directories. Move mach/cpufreq to driver/cpufreq/. Also move related header to include/linux. Change-Id: I6dcf69e275b7ca7ba913e945353a42f0d6321731 Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-02-21 11:17:03 -08:00
Saravana Kannan	d1631df89a	cpufreq: cpu-boost: Don't register for cpufreq notifiers too early The cpufreq notifiers should be registered only after all the data structures used in the notifier callbacks have been initialized. So, move the cpufreq notifier registration to a later point in the init function. Change-Id: I043ab5bc0ebb98164c40549fe151a8d801c8c186 Signed-off-by: Saravana Kannan <skannan@codeaurora.org>	2014-02-13 19:08:21 -08:00
Anurag Singh	a16e20e70e	cpufreq: persistent_stats: export persistent CPU frequency data CPU frequency statistics are lost when a CPU is put offline to save power. Due to this behavior, there is currently no way of knowing what frequencies a CPU ran at over the system's uptime (assuming it is hot-pluggable and was offlined/onlined multiple times). To solve this problem, export these statistics so that frequency residencies for all CPUs are preserved. These statistics can be found under /sys/devices/system/cpu/cpufreq/stats/cpuX - 'cpuX' is the CPU ID. Following are the important nodes: - time_in_state: reading this node would output all the frequencies the CPU is capable of running at and the amount of time (in jiffies) it ran at those frequencies - reset: writing 1 to this node would reset all frequency time stats to 0 - enable: writing 0 or 1 to this node disables or enables stats collection, respectively. By default, stats collection is enabled. Change-Id: I225ef89f7b359f1f94386f2f9445ece9d5119768 Signed-off-by: Anurag Singh <anursing@codeaurora.org>	2014-02-13 15:15:02 -08:00
Dirk Brandewie	0df520d459	intel_pstate: Correct calculation of min pstate value commit 7244cb62d96e735847dc9d08f870550df896898c upstream. The minimum pstate is supposed to be a percentage of the maximum P state available. Calculate min using max pstate and not the current max which may have been limited by the user Signed-off-by: Dirk Brandewie <dirk.j.brandewie@intel.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2014-02-13 13:48:04 -08:00
Brennan Shacklett	e34ce30f32	intel_pstate: Improve accuracy by not truncating until final result commit d253d2a52676cfa3d89b8f0737a08ce7db665207 upstream. This patch addresses Bug 60727 (https://bugzilla.kernel.org/show_bug.cgi?id=60727) which was due to the truncation of intermediate values in the calculations, which causes the code to consistently underestimate the current cpu frequency, specifically 100% cpu utilization was truncated down to the setpoint of 97%. This patch fixes the problem by keeping the results of all intermediate calculations as fixed point numbers rather scaling them back and forth between integers and fixed point. References: https://bugzilla.kernel.org/show_bug.cgi?id=60727 Signed-off-by: Brennan Shacklett <bpshacklett@gmail.com> Acked-by: Dirk Brandewie <dirk.j.brandewie@intel.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2014-02-13 13:48:04 -08:00
Srinivas Pandruvada	3dc642a398	intel_pstate: fix no_turbo commit 1ccf7a1cdafadd02e33e8f3d74370685a0600ec6 upstream. When sysfs for no_turbo is set, then also some p states in turbo regions are observed. This patch will set IDA Engage bit when no_turbo is set to explicitly disengage turbo. Signed-off-by: Srinivas Pandruvada <srinivas.pandruvada@linux.intel.com> Acked-by: Dirk Brandewie <dirk.j.brandewie@intel.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2014-02-13 13:48:04 -08:00
Nell Hardcastle	0b977de88f	intel_pstate: Add Haswell CPU models commit 6cdcdb793791f776ea9408581b1242b636d43b37 upstream. Enable the intel_pstate driver for Haswell CPUs. One missing Ivy Bridge model (0x3E) is also included. Models referenced from tools/power/x86/turbostat/turbostat.c:has_nehalem_turbo_ratio_limit Signed-off-by: Nell Hardcastle <nell@spicious.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Acked-by: Dirk Brandewie <dirk.j.brandewie@intel.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2014-02-13 13:48:04 -08:00
Saravana Kannan	dd16210636	cpufreq: cpu-boost: Fix deadlock in wake_up of sync threads If wake_up() is called on the current task on a CPU, the call will wait until the current task is switched out before it wakes it up again and returns. The sync notifier for a CPU always runs on that CPU. These two together can result in a deadlock if the sync notifier on CPU A tries to wake up the sync thread of CPU A as it goes to sleep (is the current task). A previous commit fixed this by adding a check to the sync notifier to not wake up the sync thread of CPU A if it's the current task. But this is still not sufficient to prevent deadlocks. Sync thread of CPU A could be the current task on CPU B and sync thread of CPU B could be the current task on CPU A. At this point, if sync notifier of CPU A and B try to wake up the sync threads of CPU A and B, it will result in CPU A waiting for the current task in CPU B to get switched out and CPU B waiting for the current task in CPU A to get switched out. This will result in a deadlock. Prevent this scenario from happening by pinning the sync threads of each CPU to run on that CPU. By doing this, we guarantee that sync notifiers will only try to wake up sync threads running on that CPU. The fix added by "cpufreq: cpu-boost: Resolve deadlock when waking up sync thread" ensures a deadlock doesn't happen when a sync notifier tries to wake up a sync thread running on that CPU. Change-Id: I864e545529722a23886dd5a82f66089155d2d193 Signed-off-by: Saravana Kannan <skannan@codeaurora.org>	2014-01-31 17:22:22 -08:00
Saravana Kannan	6131333017	cpufreq: cpu-boost: Fix queue_delayed_work_on() race with hotplug Calling queue_delayed_work_on() on a CPU that's in the process of getting hotplugged out can result in that CPU infinitely looping in msm_pm_wait_cpu_shutdown(). If queue_delayed_work_on() is called after the CPU is hotplugged out, it could wake up the CPU without going through the hotplug path and cause instability. To avoid this, make sure the CPU is and stays online while queuing a work on it. Change-Id: I1b4aae3db803e476b1a7676d08f495c1f38bb154 Signed-off-by: Saravana Kannan <skannan@codeaurora.org>	2014-01-31 17:21:19 -08:00
Junjie Wu	0c3bb4bd90	cpufreq: Set policy for NULL before cleaning up kobj __cpufreq_remove_dev_finish() cleans up the policy->kobj while per_cpu(cpufreq_cpu_data) still contains a valid policy pointer. This causes a race between kobject_get() called from cpufreq_cpu_get() and cpu hotplug. Set cpufreq_cpu_data to NULL before cleaning up kobject so that subsequent cpufreq_cpu_get() will fail and thus not race with cpu hotplug. Change-Id: I0e2d1a64b7aac98aa69e137cc902e07d0edb786e Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-01-24 16:24:51 -08:00
Linux Build Service Account	912b94f9d2	Merge "cpufreq: interactive: Modifying sync_freq implementation"	2014-01-18 07:39:05 -08:00
Linux Build Service Account	b839dcac69	Merge "cpufreq: interactive: Use default min_sample_time if SDF is zero"	2014-01-18 00:47:53 -08:00
Rohit Gupta	878010b9b1	cpufreq: interactive: Modifying sync_freq implementation 1. Check for up_threshold_any_cpu_freq instead of sync_freq to boost the frequency to sync_freq. 2. Change the load threshold name from sync_freq_load_threshold to up_threshold_any_cpu_load 3. Do not consider CPUs with load less than up_threshold_any_cpu_load while evaluating max load across CPUs Change-Id: Ia0e537edbf38a5006c1a22f5c472daa0d086ffc9 Signed-off-by: Rohit Gupta <rohgup@codeaurora.org>	2014-01-16 16:45:02 -08:00
Linux Build Service Account	34c0f6190a	Merge "cpufreq: ondemand: Fix update_sampling_rate race with hotplug"	2014-01-16 13:31:21 -08:00
Linux Build Service Account	b94313f8d8	Merge "cpufreq: interactive: Fix null pointer dereference in interactive governor"	2014-01-16 13:28:56 -08:00
Dirk Brandewie	d0ccf8a115	intel_pstate: Add X86_FEATURE_APERFMPERF to cpu match parameters. commit 6cbd7ee10e2842a3d1f9b60abede1c8f3d1f1130 upstream. KVM environments do not support APERF/MPERF MSRs. intel_pstate cannot operate without these registers. The previous validity checks in intel_pstate_msrs_not_valid() are insufficent in nested KVMs. References: https://bugzilla.redhat.com/show_bug.cgi?id=1046317 Signed-off-by: Dirk Brandewie <dirk.j.brandewie@intel.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2014-01-15 15:28:53 -08:00
Vijay Ganti	d3d905242d	cpufreq: interactive: Fix null pointer dereference in interactive governor For the sync_freq feature currently we check pcpu->policy->cur frequency for each online cpu. But for a CPU that isn't using interactive governor or for an offline CPU, pcpu->policy can be null or an invalid value. This patch tries to avoid that scenario by using pcpu->target_freq instead of policy->cur to get the frequency of an online CPU. Kernel crash without this patch: [ 20.132373] Unable to handle kernel NULL pointer dereference at virtual address 00000028 [ 20.132375] pgd = c34f34c0 [ 20.132377] pgd = ef6f2440 [ 20.132383] [00000028] pgd=00000000 [ 20.132385] [ 20.132388] [00000028] pgd=2e98f003, *pmd=00000000 [ 20.132390] Internal error: Oops: 205 [#1] PREEMPT SMP ARM [ 20.132394] Modules linked in: [ 20.132398] CPU: 0 PID: 1560 Comm: chown Tainted: G W 3.10.0-perf-gb12057b-00001-ga2c6c16-dirty #7 [ 20.132401] task: ef9af300 ti: ee49c000 task.ti: ee49c000 [ 20.132411] PC is at cpufreq_interactive_timer+0x10c/0x650 [ 20.132415] LR is at cpufreq_interactive_timer+0x128/0x650 <snip> [ 20.133002] [<c07eb204>] (cpufreq_interactive_timer+0x10c/0x650) from [<c02804d8>] (call_timer_fn+0x80/0x198) [ 20.133012] [<c02804d8>] (call_timer_fn+0x80/0x198) from [<c0280acc>] (run_timer_softirq+0x1f8/0x270) [ 20.133019] [<c0280acc>] (run_timer_softirq+0x1f8/0x270) from [<c0279e20>] (__do_softirq+0x12c/0x2d4) [ 20.133025] [<c0279e20>] (__do_softirq+0x12c/0x2d4) from [<c027a2d4>] (irq_exit+0x74/0xc8) [ 20.133034] [<c027a2d4>] (irq_exit+0x74/0xc8) from [<c0206a00>] (handle_IRQ+0x68/0x8c) [ 20.133041] [<c0206a00>] (handle_IRQ+0x68/0x8c) from [<c02004b8>] (gic_handle_irq+0x3c/0x60) [ 20.133051] [<c02004b8>] (gic_handle_irq+0x3c/0x60) from [<c0ac6900>] (__irq_svc+0x40/0x70) <snip> Change-Id: Ie834f5d383de4d41e0fe6fbd40c8b0a0c05d82f5 Signed-off-by: Vijay Ganti <viganti@codeaurora.org>	2014-01-14 15:14:11 -08:00
Linux Build Service Account	8cf919625e	Merge "cpufreq: Set policy to non-NULL only after all hotplug online work is done"	2014-01-14 02:29:17 -08:00
Rohit Gupta	61c9b7b970	cpufreq: ondemand: Fix update_sampling_rate race with hotplug update_sampling_rate has a for loop which goes through each online cpu and possibly queue up the ondemand work for them. But while doing this it doesnt take any hotplug lock which could potentially cause a race condition where ondemand work is queued after the hotplug code (which sets the policy to NULL) in the governor has cancelled any pending work. This could cause a crash while trying to access the NULL policy in dbs_check_cpu. Protecting the for_each_online_cpu loop with get_online_cpus() and put_online_cpus(). Change-Id: Ia3f43ca7e4bed542834ab03ca1191d728f13311c Signed-off-by: Rohit Gupta <rohgup@codeaurora.org>	2014-01-13 15:43:17 -08:00
Linux Build Service Account	d86b06e030	Merge "cpufreq: Call cpufreq_update_policy() during cpufreq_stats_init()"	2014-01-10 01:27:42 -08:00
Saravana Kannan	df2bff3319	cpufreq: Set policy to non-NULL only after all hotplug online work is done The existing code sets the per CPU policy to a non-NULL value before all the steps performed during the hotplug online path is done. Specifically, this is done before the policy min/max, governors, etc are initialized for the policy. This in turn means that calls to cpufreq_cpu_get() return a non-NULL policy before the policy/CPU is ready to be used. To fix this, move the update of per CPU policy to a valid value after all the initialization steps for the policy are completed. Example kernel panic without this fix: [ 512.146185] Unable to handle kernel NULL pointer dereference at virtual address 00000020 [ 512.146195] pgd = c0003000 [ 512.146213] [00000020] pgd=80000000004003, pmd=00000000 [ 512.146228] Internal error: Oops: 206 [#1] PREEMPT SMP ARM <snip> [ 512.146297] PC is at __cpufreq_governor+0x10/0x1ac [ 512.146312] LR is at cpufreq_update_policy+0x114/0x150 <snip> [ 512.149740] ---[ end trace f23a8defea6cd706 ]--- [ 512.149761] Kernel panic - not syncing: Fatal exception [ 513.152016] CPU0: stopping [ 513.154710] CPU: 0 PID: 7136 Comm: mpdecision Tainted: G D W 3.10.0-gd727407-00074-g979ede8 #396 <snip> [ 513.317224] [<c0afe180>] (notifier_call_chain+0x40/0x68) from [<c02a23ac>] (__blocking_notifier_call_chain+0x40/0x58) [ 513.327809] [<c02a23ac>] (__blocking_notifier_call_chain+0x40/0x58) from [<c02a23d8>] (blocking_notifier_call_chain+0x14/0x1c) [ 513.339182] [<c02a23d8>] (blocking_notifier_call_chain+0x14/0x1c) from [<c0803c68>] (cpufreq_set_policy+0xd4/0x2b8) [ 513.349594] [<c0803c68>] (cpufreq_set_policy+0xd4/0x2b8) from [<c0803e7c>] (cpufreq_init_policy+0x30/0x98) [ 513.359231] [<c0803e7c>] (cpufreq_init_policy+0x30/0x98) from [<c0805a18>] (__cpufreq_add_dev.isra.17+0x4dc/0x7a4) [ 513.369560] [<c0805a18>] (__cpufreq_add_dev.isra.17+0x4dc/0x7a4) from [<c0805d38>] (cpufreq_cpu_callback+0x58/0x84) [ 513.379978] [<c0805d38>] (cpufreq_cpu_callback+0x58/0x84) from [<c0afe180>] (notifier_call_chain+0x40/0x68) [ 513.389704] [<c0afe180>] (notifier_call_chain+0x40/0x68) from [<c02812dc>] (__cpu_notify+0x28/0x44) [ 513.398728] [<c02812dc>] (__cpu_notify+0x28/0x44) from [<c0aeed90>] (_cpu_up+0xf4/0x1dc) [ 513.406797] [<c0aeed90>] (_cpu_up+0xf4/0x1dc) from [<c0aeeed4>] (cpu_up+0x5c/0x78) [ 513.414357] [<c0aeeed4>] (cpu_up+0x5c/0x78) from [<c0aec808>] (store_online+0x44/0x74) [ 513.422253] [<c0aec808>] (store_online+0x44/0x74) from [<c03a40f4>] (sysfs_write_file+0x108/0x14c) [ 513.431195] [<c03a40f4>] (sysfs_write_file+0x108/0x14c) from [<c03517d4>] (vfs_write+0xd0/0x180) [ 513.439958] [<c03517d4>] (vfs_write+0xd0/0x180) from [<c0351ca8>] (SyS_write+0x38/0x68) [ 513.447947] [<c0351ca8>] (SyS_write+0x38/0x68) from [<c0205de0>] (ret_fast_syscall+0x0/0x30) In this specific case, CPU0 set's CPU1's policy->governor in cpufreq_init_policy() to NULL while CPU1 is using the policy->governor in __cpufreq_governor(). Change-Id: I8b8bc7114e44744f6f38925e4ed710126a45075d Signed-off-by: Saravana Kannan <skannan@codeaurora.org>	2014-01-09 19:51:09 -08:00
Junjie Wu	f30be559b1	cpufreq: Call cpufreq_update_policy() during cpufreq_stats_init() Commit `da12b9488e` (cpufreq: Fix misplaced call to cpufreq_update_policy()) removed cpufreq_update_policy() that was in cpufreq_stats_init(). The fix of moving cpufreq_update_policy() in hotplug notifier is still valid. However, by removing the call in cpufreq_stats_init(), it misses a corner case where cpufreq_stats register last and no hotplug or cpufreq change happens afterwards. Then a CPUFREQ_NOTIFY will never be sent to cpufreq_stats hotplug notifier, and sysfs won't be created. Put back the cpufreq_update_policy() call after registering for hotplug notifier in cpufreq_stats_init(). The call will force a CPUFREQ_NOTIFY notification to be sent and sysfs to be created. In case cpufreq driver is not registered, it will just fail silently. Change-Id: I6d792d0ba600a7d8c70ccb2adae1e2cbadc0463e Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2014-01-09 19:06:21 -08:00
Linux Build Service Account	60040c6828	Merge "cpufreq: cpu-boost: Resolve deadlock when waking up sync thread"	2014-01-09 15:55:19 -08:00
Rafael J. Wysocki	ec84b71390	intel_pstate: Fail initialization if P-state information is missing commit 98a947abdd54e5de909bebadfced1696ccad30cf upstream. If pstate.current_pstate is 0 after the initial intel_pstate_get_cpu_pstates(), this means that we were unable to obtain any useful P-state information and there is no reason to continue, so free memory and return an error in that case. This fixes the following divide error occuring in a nested KVM guest: Intel P-state driver initializing. Intel pstate controlling: cpu 0 cpufreq: __cpufreq_add_dev: ->get() failed divide error: 0000 [#1] SMP Modules linked in: CPU: 0 PID: 1 Comm: swapper/0 Not tainted 3.13.0-0.rc4.git5.1.fc21.x86_64 #1 Hardware name: Bochs Bochs, BIOS Bochs 01/01/2011 task: ffff88001ea20000 ti: ffff88001e9bc000 task.ti: ffff88001e9bc000 RIP: 0010:[<ffffffff815c551d>] [<ffffffff815c551d>] intel_pstate_timer_func+0x11d/0x2b0 RSP: 0000:ffff88001ee03e18 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff88001a454348 RCX: 0000000000006100 RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000 RBP: ffff88001ee03e38 R08: 0000000000000000 R09: 0000000000000000 R10: ffff88001ea20000 R11: 0000000000000000 R12: 00000c0a1ea20000 R13: 1ea200001ea20000 R14: ffffffff815c5400 R15: ffff88001a454348 FS: 0000000000000000(0000) GS:ffff88001ee00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b CR2: 0000000000000000 CR3: 0000000001c0c000 CR4: 00000000000006f0 Stack: fffffffb1a454390 ffffffff821a4500 ffff88001a454390 0000000000000100 ffff88001ee03ea8 ffffffff81083e9a ffffffff81083e15 ffffffff82d5ed40 ffffffff8258cc60 0000000000000000 ffffffff81ac39de 0000000000000000 Call Trace: <IRQ> [<ffffffff81083e9a>] call_timer_fn+0x8a/0x310 [<ffffffff81083e15>] ? call_timer_fn+0x5/0x310 [<ffffffff815c5400>] ? pid_param_set+0x130/0x130 [<ffffffff81084354>] run_timer_softirq+0x234/0x380 [<ffffffff8107aee4>] __do_softirq+0x104/0x430 [<ffffffff8107b5fd>] irq_exit+0xcd/0xe0 [<ffffffff81770645>] smp_apic_timer_interrupt+0x45/0x60 [<ffffffff8176efb2>] apic_timer_interrupt+0x72/0x80 <EOI> [<ffffffff810e15cd>] ? vprintk_emit+0x1dd/0x5e0 [<ffffffff81757719>] printk+0x67/0x69 [<ffffffff815c1493>] __cpufreq_add_dev.isra.13+0x883/0x8d0 [<ffffffff815c14f0>] cpufreq_add_dev+0x10/0x20 [<ffffffff814a14d1>] subsys_interface_register+0xb1/0xf0 [<ffffffff815bf5cf>] cpufreq_register_driver+0x9f/0x210 [<ffffffff81fb19af>] intel_pstate_init+0x27d/0x3be [<ffffffff81761e3e>] ? mutex_unlock+0xe/0x10 [<ffffffff81fb1732>] ? cpufreq_gov_dbs_init+0x12/0x12 [<ffffffff8100214a>] do_one_initcall+0xfa/0x1b0 [<ffffffff8109dbf5>] ? parse_args+0x225/0x3f0 [<ffffffff81f64193>] kernel_init_freeable+0x1fc/0x287 [<ffffffff81f638d0>] ? do_early_param+0x88/0x88 [<ffffffff8174b530>] ? rest_init+0x150/0x150 [<ffffffff8174b53e>] kernel_init+0xe/0x130 [<ffffffff8176e27c>] ret_from_fork+0x7c/0xb0 [<ffffffff8174b530>] ? rest_init+0x150/0x150 Code: c1 e0 05 48 63 bc 03 10 01 00 00 48 63 83 d0 00 00 00 48 63 d6 48 c1 e2 08 c1 e1 08 4c 63 c2 48 c1 e0 08 48 98 48 c1 e0 08 48 99 <49> f7 f8 48 98 48 0f af f8 48 c1 ff 08 29 f9 89 ca c1 fa 1f 89 RIP [<ffffffff815c551d>] intel_pstate_timer_func+0x11d/0x2b0 RSP <ffff88001ee03e18> ---[ end trace f166110ed22cc37a ]--- Kernel panic - not syncing: Fatal exception in interrupt Reported-and-tested-by: Kashyap Chamarthy <kchamart@redhat.com> Cc: Josh Boyer <jwboyer@fedoraproject.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2014-01-09 12:24:23 -08:00
Saravana Kannan	93d462463f	cpufreq: Fix policy getting stuck when user & kernel min/max don't overlap Every __cpufreq_set_policy starts with checking the new policy min/max has some overlap with the current policy min/max. This works out fine until we end up with the policy min/max being set to a range that doesn't overlap with the user policy min/max. Once we get into this situation, the check at the start of __cpufreq_set_policy fails and prevents us from getting out of this state. This only happens when one of the CPUFREQ_ADJUST/CPUFREQ_INCOMPATIBLE notifiers called inside __cpufreq_set_policy pick a min/max outside the range of user policy min/max. The real intent of the check at the start of __cpufreq_set_policy is to make sure userspace can't set user policy min > user policy max. Since __cpufreq_set_policy always gets called only with current user policy min/max except when the actual user space policy min/max is changed, we can fix the issue by simply checking the new policy min/max against current user policy min/max. Change-Id: Iaac805825e64d7985c41fb9052bd96baacdf3d6f Signed-off-by: Saravana Kannan <skannan@codeaurora.org>	2014-01-07 17:43:56 -08:00
Junjie Wu	e0cc22a8a0	cpufreq: interactive: Use correct kobj when creating sysfs cpufreq_global_kobject is no longer initialized during cpufreq_core_init. Fix sysfs creation by properly requesting the kobject based on have_governor_per_policy(). Change-Id: I5f9cff68043dad8822952bd43227d948a934a1c7 Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-23 14:10:55 -08:00
lan,Tianyu	43c1a00e9d	cpufreq: governor: Remove fossil comment in the cpufreq_governor_dbs() The related code has been changed and the comment is out of date. So remove it. Signed-off-by: Lan Tianyu <tianyu.lan@intel.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: aae467c79b14db0d286764ed9ddbaefe3715ebd2 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:20 -08:00
Xiaoguang Chen	8c66d27021	cpufreq: conservative: set requested_freq to policy max when it is over policy max When requested_freq is over policy->max, set it to policy->max. This can help to speed up decreasing frequency. Signed-off-by: Xiaoguang Chen <chenxg@marvell.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 6d7bcb1464a89181ddc4b4584ad6e0c7566ae31b Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:20 -08:00
Xiaoguang Chen	0d941cead1	cpufreq: conservative: fix requested_freq reduction issue When decreasing frequency, requested_freq may be less than freq_target, So requested_freq minus freq_target may be negative, But reqested_freq's unit is unsigned int, then the negative result will be one larger interger which may be even higher than requested_freq. This patch is to fix such issue. when result becomes negative, set requested_freq as the min value of policy. Signed-off-by: Xiaoguang Chen <chenxg@marvell.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 3baa976ae644f76f5cdb5be0fb26754c3bfb32cb Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:19 -08:00
Stratos Karafotis	e2b0e8ad81	cpufreq: ondemand: Remove redundant return statement After commit dfa5bb622555 (cpufreq: ondemand: Change the calculation of target frequency), this return statement is no longer needed. Reported-by: Henrik Nilsson <Karl.Henrik.Nilsson@gmail.com> Signed-off-by: Stratos Karafotis <stratosk@semaphore.gr> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 880eef041655b35f9aa488726ea3c4303a4f2204 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:19 -08:00
Viresh Kumar	c0ff1bbed6	cpufreq: move freq change notifications to cpufreq core Most of the drivers do following in their ->target_index() routines: struct cpufreq_freqs freqs; freqs.old = old freq... freqs.new = new freq... cpufreq_notify_transition(policy, &freqs, CPUFREQ_PRECHANGE); /* Change rate here */ cpufreq_notify_transition(policy, &freqs, CPUFREQ_POSTCHANGE); This is replicated over all cpufreq drivers today and there doesn't exists a good enough reason why this shouldn't be moved to cpufreq core instead. There are few special cases though, like exynos5440, which doesn't do everything on the call to ->target_index() routine and call some kind of bottom halves for doing this work, work/tasklet/etc.. They may continue doing notification from their own code as flag: CPUFREQ_ASYNC_NOTIFICATION is already set for them. All drivers are also modified in this patch to avoid breaking 'git bisect', as double notification would happen otherwise. Acked-by: Hans-Christian Egtvedt <egtvedt@samfundet.no> Acked-by: Jesper Nilsson <jesper.nilsson@axis.com> Acked-by: Linus Walleij <linus.walleij@linaro.org> Acked-by: Russell King <linux@arm.linux.org.uk> Acked-by: Stephen Warren <swarren@nvidia.com> Tested-by: Andrew Lunn <andrew@lunn.ch> Tested-by: Nicolas Pitre <nicolas.pitre@linaro.org> Reviewed-by: Lan Tianyu <tianyu.lan@intel.com> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: d4019f0a92ab802f385cc9c8ad3ab7b5449712cb Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git [junjiew@codeaurora.org: dropped all conflicted changes in arch specific cpufreq files that we don't use] Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:19 -08:00
Viresh Kumar	a6a7321205	cpufreq: distinguish drivers that do asynchronous notifications There are few special cases like exynos5440 which doesn't send POSTCHANGE notification from their ->target() routine and call some kind of bottom halves for doing this work, work/tasklet/etc.. From which they finally send POSTCHANGE notification. Its better if we distinguish them from other cpufreq drivers in some way so that core can handle them specially. So this patch introduces another flag: CPUFREQ_ASYNC_NOTIFICATION, which will be set by such drivers. This also changes exynos5440-cpufreq.c and powernow-k8 in order to set this flag. Acked-by: Amit Daniel Kachhap <amit.daniel@samsung.com> Acked-by: Kukjin Kim <kgene.kim@samsung.com> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 7dbf694db6ac7c759599316d50d7050efcbd512a Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git [junjiew@codeaurora.org: resolve trivial merge conflicts] Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:19 -08:00
Nicolas Pitre	4d0f60f2c7	cpufreq: arm_big_little: reconfigure switcher behavior at run time The b.L switcher can be turned on/off at run time. It is therefore necessary to change the cpufreq driver behavior accordingly. The driver must be unregistered/registered with the cpufreq core to reconfigure freq tables for the virtual or actual CPUs. This is accomplished via the b.L switcher notifier callback. Signed-off-by: Nicolas Pitre <nicolas.pitre@linaro.org> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 45cac118ffd7c9920b3d85bf551c2205674eb4f2 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:17 -08:00
Viresh Kumar	3b35d5501e	cpufreq: arm_big_little: add in-kernel switching (IKS) support This patch adds IKS (In Kernel Switcher) support to cpufreq driver. This creates a combined freq table for A7-A15 CPU pairs. A7 frequencies are virtualized and scaled down to half the actual frequencies to approximate a linear scale across the combined A7+A15 range. When the requested frequency change crosses the A7-A15 boundary a cluster switch is invoked. Based on earlier work from Sudeep KarkadaNagesha. Signed-off-by: Sudeep KarkadaNagesha <sudeep.karkadanagesha@arm.com> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Nicolas Pitre <nico@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: e79a23c5b9870b7f80425793abeb10e57f7486d4 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:17 -08:00
viresh kumar	ca72f8bfb0	cpufreq: create per policy rwsem instead of per CPU cpu_policy_rwsem We have per-CPU cpu_policy_rwsem for cpufreq core, but we never use all of them. We always use rwsem of policy->cpu and so we can actually make this rwsem per policy instead. This patch does this change. With this change other tricky situations are also avoided now, like which lock to take while we are changing policy->cpu, etc. Suggested-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Reviewed-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Tested-by: Andrew Lunn <andrew@lunn.ch> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: ad7722dab7292dbc1c4586d701ac226b68122d39 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:17 -08:00
Viresh Kumar	28ffa72403	cpufreq: Implement light weight ->target_index() routine Currently, the prototype of cpufreq_drivers target routines is: int target(struct cpufreq_policy policy, unsigned int target_freq, unsigned int relation); And most of the drivers call cpufreq_frequency_table_target() to get a valid index of their frequency table which is closest to the target_freq. And they don't use target_freq and relation after that. So, it makes sense to just do this work in cpufreq core before calling cpufreq_frequency_table_target() and simply pass index instead. But this can be done only with drivers which expose their frequency table with cpufreq core. For others we need to stick with the old prototype of target() until those drivers are converted to expose frequency tables. This patch implements the new light weight prototype for target_index() routine. It looks like this: int target_index(struct cpufreq_policy policy, unsigned int index); CPUFreq core will call cpufreq_frequency_table_target() before calling this routine and pass index to it. Because CPUFreq core now requires to call routines present in freq_table.c CONFIG_CPU_FREQ_TABLE must be enabled all the time. This also marks target() interface as deprecated. So, that new drivers avoid using it. And Documentation is updated accordingly. It also converts existing .target() to newly defined light weight .target_index() routine for many driver. Acked-by: Hans-Christian Egtvedt <egtvedt@samfundet.no> Acked-by: Jesper Nilsson <jesper.nilsson@axis.com> Acked-by: Linus Walleij <linus.walleij@linaro.org> Acked-by: Russell King <linux@arm.linux.org.uk> Acked-by: David S. Miller <davem@davemloft.net> Tested-by: Andrew Lunn <andrew@lunn.ch> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rjw@rjwysocki.net> Git-commit: 9c0ebcf78fde0ffa348a95a544c6d3f2dac5af65 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git [junjiew@codeaurora.org: ignored all arch specific files that generated conflicts] Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:16 -08:00
Lan Tianyu	ebb72a01ef	cpufreq / governor: Remove fossil comment cpufreq_set_policy() has been changed to origin __cpufreq_set_policy() and policy->lock has been converted to rewrite lock by commit 5a01f2. So remove the comment. Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Lan Tianyu <tianyu.lan@intel.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: a814613b9a32d9ab9578d9dab396265c826d37f0 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:16 -08:00
Srivatsa S. Bhat	0e6d14088d	cpufreq: Detect spurious invocations of update_policy_cpu() The function update_policy_cpu() is expected to be called when the policy->cpu of a cpufreq policy is to be changed: ie., the new CPU nominated to become the policy->cpu is different from the old one. Print a warning if it is invoked with new_cpu == old_cpu, since such an invocation might hint at a faulty logic in the caller. Suggested-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 99ec899eafe2ec0a7dd96e9de5fa0a2bea3032ba Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:16 -08:00
Sudeep KarkadaNagesha	afc20abfb0	cpufreq: arm-big-little: use clk_get instead of clk_get_sys Currently clk_get_sys is used with cpu-cluster.<n> as the device id which is incorrect. It should be connection/consumer ID instead. It is possible to specify input clock in the cpu device node along with the optional clock-name. clk_get_sys can't handle that. This patch replaces clk_get_sys with clk_get to extend support for clocks specified in the device tree cpu node. Signed-off-by: Sudeep KarkadaNagesha <sudeep.karkadanagesha@arm.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 076dec90fc32c830184b0f0fa1842a6de1199bc6 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:16 -08:00
Viresh Kumar	67f699ebc1	cpufreq: remove CONFIG_CPU_FREQ_TABLE CONFIG_CPU_FREQ_TABLE will be always enabled when cpufreq framework is used, as cpufreq core depends on it. So, we don't need this CONFIG option anymore as it is not configurable. Remove CONFIG_CPU_FREQ_TABLE and update its users. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 3bc28ab6da039f8020bbcea8e832b63a900bdb66 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git [junjiew@codeaurora.org: resolved merge conflicts for Kconfig.arm and Kconfig.powerpc by ignoring missing configs. Searched and removed CPU_FREQ_TABLE config in our tree (arch/arm/mach-tegra/Kconfig, arch/powerpc/platforms/Kconfig, Documentation/android.txt). These conflicts are generated because we don't pull Kconfig changes for archs we don't use.] Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:16 -08:00
Viresh Kumar	45b1c5e57e	cpufreq: create cpufreq_generic_init() routine Many CPUFreq drivers for SMP system (where all cores share same clock lines), do similar stuff in their ->init() part. This patch creates a generic routine in cpufreq core which can be used by these so that we can remove some redundant code. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 70e9e778337973d5bf57004092b360bd3f3c412f Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:15 -08:00
Viresh Kumar	b88c9a3589	cpufreq: arm_big_little: don't initialize part of policy is set by core Many common initializations of struct policy are moved to core now and hence this driver doesn't need to do it. This patch removes such code. Most recent of those changes is to call ->get() in the core after calling ->init(). Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: e4c8afe3a06c682e215c3e38240126b652fa98d0 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:15 -08:00
Viresh Kumar	3db386d97a	cpufreq: call cpufreq_driver->get() after calling ->init() Almost all drivers set policy->cur with current CPU frequency in their ->init() part. This can be done for all of them at core level and so they wouldn't need to do it. This patch adds supporting code in cpufreq core for calling get() after we have called init() for a policy. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: da60ce9f2faca87013fd3cab1c3bed5183608c3d Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:15 -08:00
Viresh Kumar	1a491aee5a	cpufreq: arm_big_little: Use generic cpufreq routines Most of the CPUFreq drivers do similar things in .exit() and .verify() routines and .attr. So its better if we have generic routines for them which can be used by cpufreq drivers then. This patch uses these generic routines in the arm_big_little driver. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 3c75a1503f2c5ca91279436b1f573002c869ef06 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:15 -08:00
Viresh Kumar	00d1c89bb8	cpufreq: define generic .attr, .exit() and .verify() routines Most of the CPUFreq drivers do similar things in .exit() and .verify() routines and .attr. So its better if we have generic routines for them which can be used by cpufreq drivers then. This patch introduces generic .attr, .exit() and .verify() cpufreq drivers. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 184345129c53e76069c209f9912ed7c457eceb31 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:14 -08:00
Viresh Kumar	74253297ce	cpufreq: add new routine cpufreq_verify_within_cpu_limits() Most of the users of cpufreq_verify_within_limits() calls it for limiting with min/max from policy->cpuinfo. We can make that code simple by introducing another routine which will do this for them automatically. This patch adds another routine cpufreq_verify_within_cpu_limits() and updates others to use it. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Acked-by: Dirk Brandewie <dirk.j.brandewie@intel.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: be49e3465f222b4b796be8a21d14afbfd8f5d20f Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:14 -08:00
Viresh Kumar	9e07689844	cpufreq: use cpufreq_driver->flags to mark CPUFREQ_HAVE_GOVERNOR_PER_POLICY Use cpufreq_driver->flags to mark CPUFREQ_HAVE_GOVERNOR_PER_POLICY instead of a separate field within cpufreq_driver. This will save some bytes of memory. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Reviewed-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 0b981e70748861a3e10ea2e2a689bdcee3e15085 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:14 -08:00
Viresh Kumar	de2f4cb19f	cpufreq: rename __cpufreq_set_policy() as cpufreq_set_policy() Earlier there used to be two functions named __cpufreq_set_policy() and cpufreq_set_policy(), but now we only have a single routine lets name it cpufreq_set_policy() instead of __cpufreq_set_policy(). This also removes some invalid comments or fixes some incorrect comments. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 037ce8397d23b2f84ccfb879cf4b43277b0454e3 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git [junjiew@codeaurora.org: resolve trivial merge conflicts] Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:13 -08:00
Viresh Kumar	442f568915	cpufreq: Optimize cpufreq_frequency_table_verify() cpufreq_frequency_table_verify() is rewritten here to make it more logical and efficient. - merge multiple lines for variable declarations together. - quit early if any frequency between min/max is found. - don't call cpufreq_verify_within_limits() in case any valid freq is found as it is of no use. - rename the count variable as found and change its type to boolean. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 77db50c4eb1991d6e88254390ec368e1d23a8fa5 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:13 -08:00
Viresh Kumar	4beb48071b	cpufreq: remove __cpufreq_remove_dev() Nobody except cpufreq_remove_dev() calls __cpufreq_remove_dev() and so we don't need two separate routines here. Merge code from __cpufreq_remove_dev() into cpufreq_remove_dev() and get rid of __cpufreq_remove_dev(). Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Reviewed-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 27a862e98341226a50835f29aa26ffa528215ecc Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:13 -08:00
Viresh Kumar	780b477ed7	cpufreq: don't break string in print statements As a rule its better not to break string (quoted inside "") in a print statement even if it crosses 80 column boundary as that may introduce bugs and so this patch rewrites one of the print statements.. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Reviewed-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 75949c9a1fe0fd07983788449059337edac2b9f6 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:13 -08:00
Viresh Kumar	59f75cc637	cpufreq: Remove extra blank line We don't need a blank line just at start of a block, lets remove it. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Reviewed-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: bbdd04ab1f375ef46a0e2d98de439863d35e4d3e Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:12 -08:00
Viresh Kumar	0fd511e4dc	cpufreq: remove invalid comment from __cpufreq_remove_dev() Some section of kerneldoc comment for __cpufreq_remove_dev() is invalid now. Remove it. Suggested-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Reviewed-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 67a29e558b17a923c3a53c348315c572b8ca261a Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:12 -08:00
Viresh Kumar	2f59f0d636	cpufreq: make return type of lock_policy_rwsem_{read\|write}() as void lock_policy_rwsem_{read\|write}() currently has return type of int, but it always returns zero and hence its return type should be void instead. This patch makes that change and modifies all of the users accordingly. Reported-by: Jon Medhurst<tixy@linaro.org> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Reviewed-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 1b750e3bdae5b2d0f3d377b0c56e7465f85b67f2 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:12 -08:00
Viresh Kumar	807d40fea3	cpufreq: arm_big_little: call cpufreq_frequency_table_put_attr() Drivers which have an exit path must call cpufreq_frequency_table_put_attr() if they have called cpufreq_frequency_table_get_attr() in their init path. This driver was missing this part and is fixed with this patch. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 2457dac670f287b260d50792988f4788f403ca32 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:12 -08:00
Viresh Kumar	2e99c3a3d0	cpufreq: arm_big_little: use cpufreq_table_validate_and_show() Lets use cpufreq_table_validate_and_show() instead of calling cpufreq_frequency_table_cpuinfo() and cpufreq_frequency_table_get_attr(). Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 39b10ebe5d30ef46ddea1daa89ca55bd2c817d7b Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:11 -08:00
Viresh Kumar	a10db86d70	cpufreq: Add new helper cpufreq_table_validate_and_show() Almost every cpufreq driver is required to validate its frequency table with: cpufreq_frequency_table_cpuinfo() and then expose it to cpufreq core with: cpufreq_frequency_table_get_attr(). This patch creates another helper routine cpufreq_table_validate_and_show() that will do both these steps in a single call and will return 0 for success, error otherwise. This also fixes potential bugs in cpufreq drivers where people have called cpufreq_frequency_table_get_attr() before calling cpufreq_frequency_table_cpuinfo(), as the later may fail. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 27047a603645d0885bcd72d7a0b6cce6e3c94ca7 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:11 -08:00
Nishanth Menon	a2dc9b1bc9	PM / OPP: rename header to linux/pm_opp.h Since Operating Performance Points (OPP) functions are specific to device specific power management, be specific and rename opp.h to pm_opp.h Reported-by: Randy Dunlap <rdunlap@infradead.org> Signed-off-by: Nishanth Menon <nm@ti.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: e4db1c7439b31993a4886b273bb9235a8eea82bf Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git [junjiew@codeaurora.org: resolve trivial merge conflicts] Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:11 -08:00
Nishanth Menon	4dfc0d4764	PM / OPP: rename functions to dev_pm_opp* Since Operating Performance Points (OPP) functions are specific to device specific power management, be specific and rename opp_* accessors in OPP library with dev_pm_opp_* equivalent. Affected functions are: opp_get_voltage opp_get_freq opp_get_opp_count opp_find_freq_exact opp_find_freq_floor opp_find_freq_ceil opp_add opp_enable opp_disable opp_get_notifier opp_init_cpufreq_table opp_free_cpufreq_table Reported-by: Randy Dunlap <rdunlap@infradead.org> Signed-off-by: Nishanth Menon <nm@ti.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 5d4879cda67b09f086807821cf594ee079d6dfbe Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git [junjiew@codeaurora.org: resolve trivial merge conflicts] Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:11 -08:00
Viresh Kumar	f1093739ff	cpufreq: check cpufreq driver is valid and cpufreq isn't disabled in cpufreq_get() cpufreq_get() can be called from external drivers which might not be aware if cpufreq driver is registered or not. And so we should actually check if cpufreq driver is registered or not and also if cpufreq is active or disabled, at the beginning of cpufreq_get(). Otherwise call to lock_policy_rwsem_read() might hit BUG_ON(!policy). Reported-and-tested-by: Linus Walleij <linus.walleij@linaro.org> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Reviewed-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 26ca8694344af4c833d22590c5b77d6b9cff0722 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:10 -08:00
Yinghai Lu	ed0ea642ab	cpufreq: return EEXIST instead of EBUSY for second registering On systems that support intel_pstate, acpi_cpufreq fails to load, and udev keeps trying until trace gets filled up and kernel crashes. The root cause is driver return ret from cpufreq_register_driver(), because when some other driver takes over before, it will return EBUSY and then udev will keep trying ... cpufreq_register_driver() should return EEXIST instead so that the system can boot without appending intel_pstate=disable and still use intel_pstate. Signed-off-by: Yinghai Lu <yinghai@kernel.org> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 4dea5806d332f91d640d99943db99a5539e832c3 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:10 -08:00
Viresh Kumar	eb24ac11f5	cpufreq: unlock correct rwsem while updating policy->cpu Current code looks like this: WARN_ON(lock_policy_rwsem_write(cpu)); update_policy_cpu(policy, new_cpu); unlock_policy_rwsem_write(cpu); {lock\|unlock}_policy_rwsem_write(cpu) takes/releases policy->cpu's rwsem. Because cpu is changing with the call to update_policy_cpu(), the unlock_policy_rwsem_write() will release the incorrect lock. The right solution would be to release the same lock as was taken earlier. Also update_policy_cpu() was also called from cpufreq_add_dev() without any locks and so its better if we move this locking to inside update_policy_cpu(). This patch fixes a regression introduced in 3.12 by commit f9ba680d23 (cpufreq: Extract the handover of policy cpu to a helper function). Reported-and-tested-by: Jon Medhurst<tixy@linaro.org> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 8efd57657d8ef666810b55e609da72de92314dc4 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:10 -08:00
Viresh Kumar	240a20eaf0	cpufreq: Clear policy->cpus bits in __cpufreq_remove_dev_finish() This broke after a recent change "cedb70a cpufreq: Split __cpufreq_remove_dev() into two parts" from Srivatsa. Consider a scenario where we have two CPUs in a policy (0 & 1) and we are removing CPU 1. On the call to __cpufreq_remove_dev_prepare() we have cleared 1 from policy->cpus and now on a call to __cpufreq_remove_dev_finish() we read cpumask_weight of policy->cpus, which will come as 1 and this code will behave as if we are removing the last CPU from policy :) Fix it by clearing the CPU mask in __cpufreq_remove_dev_finish() instead of __cpufreq_remove_dev_prepare(). Tested-by: Stephen Warren <swarren@wwwdotorg.org> Reviewed-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 9c8f1ee40b6368e6b2775c9c9f816e2a5dca3c07 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:10 -08:00
Lan Tianyu	8f4301c5f5	cpufreq: Acquire the lock in cpufreq_policy_restore() for reading In cpufreq_policy_restore() before system suspend policy is read from percpu's cpufreq_cpu_data_fallback. It's a read operation rather than a write one, so take the lock for reading in there. Signed-off-by: Lan Tianyu <tianyu.lan@intel.com> Reviewed-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 44871c9c7f7963f8869dd8bc9620221c9e9db153 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:10 -08:00
Srivatsa S. Bhat	005b70df3b	cpufreq: Prevent problems in update_policy_cpu() if last_cpu == new_cpu If update_policy_cpu() is invoked with the existing policy->cpu itself as the new-cpu parameter, then a lot of things can go terribly wrong. In its present form, update_policy_cpu() always assumes that the new-cpu is different from policy->cpu and invokes other functions to perform their respective updates. And those functions implement the actual update like this: per_cpu(..., new_cpu) = per_cpu(..., last_cpu); per_cpu(..., last_cpu) = NULL; Thus, when new_cpu == last_cpu, the final NULL assignment makes the per-cpu references vanish into thin air! (memory leak). From there, it leads to more problems: cpufreq_stats_create_table() now doesn't find the per-cpu reference and hence tries to create a new sysfs-group; but sysfs already had created the group earlier, so it complains that it cannot create a duplicate filename. In short, the repercussions of a rather innocuous invocation of update_policy_cpu() can turn out to be pretty nasty. Ideally update_policy_cpu() should handle this situation (new == last) gracefully, and not lead to such severe problems. So fix it by adding an appropriate check. Signed-off-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Tested-by: Stephen Warren <swarren@nvidia.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: cb38ed5cf1c4fdb7454e4b48fb70c396f5acfb21 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:09 -08:00
Srivatsa S. Bhat	def400bcef	cpufreq: Restructure if/else block to avoid unintended behavior In __cpufreq_remove_dev_prepare(), the code which decides whether to remove the sysfs link or nominate a new policy cpu, is governed by an if/else block with a rather complex set of conditionals. Worse, they harbor a subtlety which leads to certain unintended behavior. The code looks like this: if (cpu != policy->cpu && !frozen) { sysfs_remove_link(&dev->kobj, "cpufreq"); } else if (cpus > 1) { new_cpu = cpufreq_nominate_new_policy_cpu(...); ... update_policy_cpu(..., new_cpu); } The original intention was: If the CPU going offline is not policy->cpu, just remove the link. On the other hand, if the CPU going offline is the policy->cpu itself, handover the policy->cpu job to some other surviving CPU in that policy. But because the 'if' condition also includes the 'frozen' check, now there are two possibilities by which we can enter the 'else' block: 1. cpu == policy->cpu (intended) 2. cpu != policy->cpu && frozen (unintended) Due to the second (unintended) scenario, we end up spuriously nominating a CPU as the policy->cpu, even when the existing policy->cpu is alive and well. This can cause problems further down the line, especially when we end up nominating the same policy->cpu as the new one (ie., old == new), because it totally confuses update_policy_cpu(). To avoid this mess, restructure the if/else block to only do what was originally intended, and thus prevent any unwelcome surprises. Signed-off-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Tested-by: Stephen Warren <swarren@nvidia.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 61173f256a3bebfbd09b4bd2c164dde378614091 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:09 -08:00
Srivatsa S. Bhat	63384c5ec3	cpufreq: Fix crash in cpufreq-stats during suspend/resume Stephen Warren reported that the cpufreq-stats code hits a NULL pointer dereference during the second attempt to suspend a system. He also pin-pointed the problem to commit 5302c3f "cpufreq: Perform light-weight init/teardown during suspend/resume". That commit actually ensured that the cpufreq-stats table and the cpufreq-stats sysfs entries are not torn down (ie., not freed) during suspend/resume, which makes it all the more surprising. However, it turns out that the root-cause is not that we access an already freed memory, but that the reference to the allocated memory gets moved around and we lose track of that during resume, leading to the reported crash in a subsequent suspend attempt. In the suspend path, during CPU offline, the value of policy->cpu is updated by choosing one of the surviving CPUs in that policy, as long as there is atleast one CPU in that policy. And cpufreq_stats_update_policy_cpu() is invoked to update the reference to the stats structure by assigning it to the new CPU. However, in the resume path, during CPU online, we end up assigning a fresh CPU as the policy->cpu, without letting cpufreq-stats know about this. Thus the reference to the stats structure remains (incorrectly) associated with the old CPU. So, in a subsequent suspend attempt, during CPU offline, we end up accessing an incorrect location to get the stats structure, which eventually leads to the NULL pointer dereference. Fix this by letting cpufreq-stats know about the update of the policy->cpu during CPU online in the resume path. (Also, move the update_policy_cpu() function higher up in the file, so that __cpufreq_add_dev() can invoke it). Reported-and-tested-by: Stephen Warren <swarren@nvidia.com> Signed-off-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 0d66b91ebff49841f607a3c079984c907c8a4199 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:09 -08:00
Rafael J. Wysocki	7b13c20fec	Revert "cpufreq: make sure frequency transitions are serialized" Commit 7c30ed5 (cpufreq: make sure frequency transitions are serialized) attempted to serialize frequency transitions by adding checks to the CPUFREQ_PRECHANGE and CPUFREQ_POSTCHANGE notifications. However, it assumed that the notifications will always originate from the driver's .target() callback, but they also can be triggered by cpufreq_out_of_sync() and that leads to warnings like this on some systems: WARNING: CPU: 0 PID: 14543 at drivers/cpufreq/cpufreq.c:317 __cpufreq_notify_transition+0x238/0x260() In middle of another frequency transition accompanied by a call trace similar to this one: [<ffffffff81720daa>] dump_stack+0x46/0x58 [<ffffffff8106534c>] warn_slowpath_common+0x8c/0xc0 [<ffffffff815b8560>] ? acpi_cpufreq_target+0x320/0x320 [<ffffffff81065436>] warn_slowpath_fmt+0x46/0x50 [<ffffffff815b1ec8>] __cpufreq_notify_transition+0x238/0x260 [<ffffffff815b33be>] cpufreq_notify_transition+0x3e/0x70 [<ffffffff815b345d>] cpufreq_out_of_sync+0x6d/0xb0 [<ffffffff815b370c>] cpufreq_update_policy+0x10c/0x160 [<ffffffff815b3760>] ? cpufreq_update_policy+0x160/0x160 [<ffffffff81413813>] cpufreq_set_cur_state+0x8c/0xb5 [<ffffffff814138df>] processor_set_cur_state+0xa3/0xcf [<ffffffff8158e13c>] thermal_cdev_update+0x9c/0xb0 [<ffffffff8159046a>] step_wise_throttle+0x5a/0x90 [<ffffffff8158e21f>] handle_thermal_trip+0x4f/0x140 [<ffffffff8158e377>] thermal_zone_device_update+0x57/0xa0 [<ffffffff81415b36>] acpi_thermal_check+0x2e/0x30 [<ffffffff81415ca0>] acpi_thermal_notify+0x40/0xdc [<ffffffff813e7dbd>] acpi_device_notify+0x19/0x1b [<ffffffff813f8241>] acpi_ev_notify_dispatch+0x41/0x5c [<ffffffff813e3fbe>] acpi_os_execute_deferred+0x25/0x32 [<ffffffff81081060>] process_one_work+0x170/0x4a0 [<ffffffff81082121>] worker_thread+0x121/0x390 [<ffffffff81082000>] ? manage_workers.isra.20+0x170/0x170 [<ffffffff81088fe0>] kthread+0xc0/0xd0 [<ffffffff81088f20>] ? flush_kthread_worker+0xb0/0xb0 [<ffffffff8173582c>] ret_from_fork+0x7c/0xb0 [<ffffffff81088f20>] ? flush_kthread_worker+0xb0/0xb0 For this reason, revert commit 7c30ed5 along with the fix 266c13d (cpufreq: Fix serialization of frequency transitions) on top of it and we will revisit the serialization problem later. Reported-by: Alessandro Bono <alessandro.bono@gmail.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 798282a8718347b04a2f0a4bae7d775c48c6bcb9 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:09 -08:00
Srivatsa S. Bhat	4f03fd6c97	cpufreq: Use signed type for 'ret' variable, to store negative error values There are places where the variable 'ret' is declared as unsigned int and then used to store negative return values such as -EINVAL. Fix them by declaring the variable as a signed quantity. Signed-off-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 5136fa56582beadb7fa71eb30bc79148bfcba5c1 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:08 -08:00
Srivatsa S. Bhat	656b745f10	cpufreq: Remove temporary fix for race between CPU hotplug and sysfs-writes Commit "cpufreq: serialize calls to __cpufreq_governor()" had been a temporary and partial solution to the race condition between writing to a cpufreq sysfs file and taking a CPU offline. Now that we have a proper and complete solution to that problem, remove the temporary fix. Signed-off-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 56d07db274b7b15ca38b60ea4a762d40de093000 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:08 -08:00
Srivatsa S. Bhat	d6f5eef479	cpufreq: Synchronize the cpufreq store_() routines with CPU hotplug The functions that are used to write to cpufreq sysfs files (such as store_scaling_max_freq()) are not hotplug safe. They can race with CPU hotplug tasks and lead to problems such as trying to acquire an already destroyed timer-mutex etc. Eg: __cpufreq_remove_dev() __cpufreq_governor(policy, CPUFREQ_GOV_STOP); policy->governor->governor(policy, CPUFREQ_GOV_STOP); cpufreq_governor_dbs() case CPUFREQ_GOV_STOP: mutex_destroy(&cpu_cdbs->timer_mutex) cpu_cdbs->cur_policy = NULL; <PREEMPT> store() __cpufreq_set_policy() __cpufreq_governor(policy, CPUFREQ_GOV_LIMITS); policy->governor->governor(policy, CPUFREQ_GOV_LIMITS); case CPUFREQ_GOV_LIMITS: mutex_lock(&cpu_cdbs->timer_mutex); <-- Warning (destroyed mutex) if (policy->max < cpu_cdbs->cur_policy->cur) <- cur_policy == NULL So use get_online_cpus()/put_online_cpus() in the store_() functions, to synchronize with CPU hotplug. However, there is an additional point to note here: some parts of the CPU teardown in the cpufreq subsystem are done in the CPU_POST_DEAD stage, with cpu_hotplug.lock released. So, using the get/put_online_cpus() functions alone is insufficient; we should also ensure that we don't race with those latter steps in the hotplug sequence. We can easily achieve this by checking if the CPU is online before proceeding with the store, since the CPU would have been marked offline by the time the CPU_POST_DEAD notifiers are executed. Reported-by: Stephen Boyd <sboyd@codeaurora.org> Signed-off-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 4f750c930822b92df74327a4d1364eff87701360 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:08 -08:00
Srivatsa S. Bhat	844015c914	cpufreq: Invoke __cpufreq_remove_dev_finish() after releasing cpu_hotplug.lock __cpufreq_remove_dev_finish() handles the kobject cleanup for a CPU going offline. But because we destroy the kobject towards the end of the CPU offline phase, there are certain race windows where a task can try to write to a cpufreq sysfs file (eg: using store_scaling_max_freq()) while we are taking that CPU offline, and this can bump up the kobject refcount, which in turn might hinder the CPU offline task from running to completion. (It can also cause other more serious problems such as trying to acquire a destroyed timer-mutex etc., depending on the exact stage of the cleanup at which the task managed to take a new refcount). To fix the race window, we will need to synchronize those store_() call-sites with CPU hotplug, using get_online_cpus()/put_online_cpus(). However, that in turn can cause a total deadlock because it can end up waiting for the CPU offline task to complete, with incremented refcount! Write to sysfs CPU offline task -------------- ---------------- kobj_refcnt++ Acquire cpu_hotplug.lock get_online_cpus(); Wait for kobj_refcnt to drop to zero DEADLOCK* A simple way to avoid this problem is to perform the kobject cleanup in the CPU offline path, with the cpu_hotplug.lock released. That is, we can perform the wait-for-kobj-refcnt-to-drop as well as the subsequent cleanup in the CPU_POST_DEAD stage of CPU offline, which is run with cpu_hotplug.lock released. Doing this helps us avoid deadlocks due to holding kobject refcounts and waiting on each other on the cpu_hotplug.lock. (Note: We can't move all of the cpufreq CPU offline steps to the CPU_POST_DEAD stage, because certain things such as stopping the governors have to be done before the outgoing CPU is marked offline. So retain those parts in the CPU_DOWN_PREPARE stage itself). Reported-by: Stephen Boyd <sboyd@codeaurora.org> Signed-off-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 1aee40ac9c86759c05f2ceb4523642b22fc8ea36 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:08 -08:00
Srivatsa S. Bhat	c834b85822	cpufreq: Split __cpufreq_remove_dev() into two parts During CPU offline, the cpufreq core invokes __cpufreq_remove_dev() to perform work such as stopping the cpufreq governor, clearing the CPU from the policy structure etc, and finally cleaning up the kobject. There are certain subtle issues related to the kobject cleanup, and it would be much easier to deal with them if we separate that part from the rest of the cleanup-work in the CPU offline phase. So split the __cpufreq_remove_dev() function into 2 parts: one that handles the kobject cleanup, and the other that handles the rest of the work. Reported-by: Stephen Boyd <sboyd@codeaurora.org> Signed-off-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: cedb70afd077b00bff7379042fdbf7eef32606c9 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:07 -08:00
Andreas Schwab	0367d22be7	cpufreq: Fix wrong time unit conversion The time spent by a CPU under a given frequency is stored in jiffies unit in the cpu var cpufreq_stats_table->time_in_state[i], i being the index of the frequency. This is what is displayed in the following file on the right column: cat /sys/devices/system/cpu/cpuX/cpufreq/stats/time_in_state `2301000` 19835820 2300000 3172 [...] Now cpufreq converts this jiffies unit delta to clock_t before returning it to the user as in the above file. And that conversion is achieved using the API cputime64_to_clock_t(). Although it accidentally works on traditional tick based cputime accounting, where cputime_t maps directly to jiffies, it doesn't work with other types of cputime accounting such as CONFIG_VIRT_CPU_ACCOUNTING_* where cputime_t can map to nsecs or any granularity preffered by the architecture. For example we get a buggy zero delta on full dyntick configurations: cat /sys/devices/system/cpu/cpuX/cpufreq/stats/time_in_state `2301000` 0 2300000 0 [...] Fix this with using the proper jiffies_64_t to clock_t conversion. Reported-and-tested-by: Carsten Emde <C.Emde@osadl.org> Signed-off-by: Andreas Schwab <schwab@linux-m68k.org> Signed-off-by: Frederic Weisbecker <fweisbec@gmail.com> Acked-by: Paul E. McKenney <paulmck@linux.vnet.ibm.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: a857c0b9e24e39fe5be82451b65377795f9538d8 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:07 -08:00
Viresh Kumar	84886f9776	cpufreq: serialize calls to __cpufreq_governor() We can't take a big lock around __cpufreq_governor() as this causes recursive locking for some cases. But calls to this routine must be serialized for every policy. Otherwise we can see some unpredictable events. For example, consider following scenario: __cpufreq_remove_dev() __cpufreq_governor(policy, CPUFREQ_GOV_STOP); policy->governor->governor(policy, CPUFREQ_GOV_STOP); cpufreq_governor_dbs() case CPUFREQ_GOV_STOP: mutex_destroy(&cpu_cdbs->timer_mutex) cpu_cdbs->cur_policy = NULL; <PREEMPT> store() __cpufreq_set_policy() __cpufreq_governor(policy, CPUFREQ_GOV_LIMITS); policy->governor->governor(policy, CPUFREQ_GOV_LIMITS); case CPUFREQ_GOV_LIMITS: mutex_lock(&cpu_cdbs->timer_mutex); <-- Warning (destroyed mutex) if (policy->max < cpu_cdbs->cur_policy->cur) <- cur_policy == NULL And so store() will eventually result in a crash if cur_policy is NULL at this point. Introduce an additional variable which would guarantee serialization here. Reported-by: Stephen Boyd <sboyd@codeaurora.org> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 19c763031acb831a5ab9c1a701b7fedda073eb3f Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:07 -08:00
Viresh Kumar	88dca9646b	cpufreq: don't allow governor limits to be changed when it is disabled __cpufreq_governor() returns with -EBUSY when governor is already stopped and we try to stop it again, but when it is stopped we must not allow calls to CPUFREQ_GOV_LIMITS event as well. This patch adds this check in __cpufreq_governor(). Reported-by: Stephen Boyd <sboyd@codeaurora.org> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: f73d39338444d9915c746403bd98b145ff9d2ba4 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:07 -08:00
Stratos Karafotis	ecc0eef693	cpufreq: governor: Fix typos in comments - 'Governer' should be 'Governor'. - 'S' is used for Siemens (electrical conductance) in SI units, so use small 's' for seconds. Signed-off-by: Stratos Karafotis <stratosk@semaphore.gr> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: c4afc410942f9f0675a5431adbdb03cf5908d1df Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:06 -08:00
Stratos Karafotis	57943819d6	cpufreq: governors: Remove duplicate check of target freq in supported range Function __cpufreq_driver_target() checks if target_freq is within policy->min and policy->max range. generic_powersave_bias_target() also checks if target_freq is valid via a cpufreq_frequency_table_target() call. So, drop the unnecessary duplicate check in *_check_cpu(). Signed-off-by: Stratos Karafotis <stratosk@semaphore.gr> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 934dac1ea072bd8adff8d6a6abba561731e093cf Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:06 -08:00
Sudeep KarkadaNagesha	fcaef55c1f	cpufreq: arm_big_little: remove device tree parsing for cpu nodes Now that the cpu device registration initialises the of_node(if available) appropriately for all the cpus, parsing here is redundant. This patch removes all DT parsing and uses cpu->of_node instead. Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Sudeep KarkadaNagesha <sudeep.karkadanagesha@arm.com> Git-commit: da0eb143dbbaf26b6f084bee81d56fc64efb5390 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:06 -08:00
Li Zhong	a0a204fa01	cpufreq: fix bad unlock balance on !CONFIG_SMP This patch tries to fix lockdep complaint attached below. It seems that we should always read acquire the cpufreq_rwsem, whether CONFIG_SMP is enabled or not. And CONFIG_HOTPLUG_CPU depends on CONFIG_SMP, so it seems we don't need CONFIG_SMP for the code enabled by CONFIG_HOTPLUG_CPU. [ 0.504191] ===================================== [ 0.504627] [ BUG: bad unlock balance detected! ] [ 0.504627] 3.11.0-rc6-next-20130819 #1 Not tainted [ 0.504627] ------------------------------------- [ 0.504627] swapper/1 is trying to release lock (cpufreq_rwsem) at: [ 0.504627] [<ffffffff813d927a>] cpufreq_add_dev+0x13a/0x3e0 [ 0.504627] but there are no more locks to release! [ 0.504627] [ 0.504627] other info that might help us debug this: [ 0.504627] 1 lock held by swapper/1: [ 0.504627] #0: (subsys mutex#4){+.+.+.}, at: [<ffffffff8134a7bf>] subsys_interface_register+0x4f/0xe0 [ 0.504627] [ 0.504627] stack backtrace: [ 0.504627] CPU: 0 PID: 1 Comm: swapper Not tainted 3.11.0-rc6-next-20130819 #1 [ 0.504627] Hardware name: Bochs Bochs, BIOS Bochs 01/01/2007 [ 0.504627] ffffffff813d927a ffff88007f847c98 ffffffff814c062b ffff88007f847cc8 [ 0.504627] ffffffff81098bce ffff88007f847cf8 ffffffff81aadc30 ffffffff813d927a [ 0.504627] 00000000ffffffff ffff88007f847d68 ffffffff8109d0be 0000000000000006 [ 0.504627] Call Trace: [ 0.504627] [<ffffffff813d927a>] ? cpufreq_add_dev+0x13a/0x3e0 [ 0.504627] [<ffffffff814c062b>] dump_stack+0x19/0x1b [ 0.504627] [<ffffffff81098bce>] print_unlock_imbalance_bug+0xfe/0x110 [ 0.504627] [<ffffffff813d927a>] ? cpufreq_add_dev+0x13a/0x3e0 [ 0.504627] [<ffffffff8109d0be>] lock_release_non_nested+0x1ee/0x310 [ 0.504627] [<ffffffff81099d0e>] ? mark_held_locks+0xae/0x120 [ 0.504627] [<ffffffff811510cb>] ? kfree+0xcb/0x1d0 [ 0.504627] [<ffffffff813d77ea>] ? cpufreq_policy_free+0x4a/0x60 [ 0.504627] [<ffffffff813d927a>] ? cpufreq_add_dev+0x13a/0x3e0 [ 0.504627] [<ffffffff8109d2a4>] lock_release+0xc4/0x250 [ 0.504627] [<ffffffff8106c9f3>] up_read+0x23/0x40 [ 0.504627] [<ffffffff813d927a>] cpufreq_add_dev+0x13a/0x3e0 [ 0.504627] [<ffffffff8134a809>] subsys_interface_register+0x99/0xe0 [ 0.504627] [<ffffffff81b19f3b>] ? cpufreq_gov_dbs_init+0x12/0x12 [ 0.504627] [<ffffffff813d7f0d>] cpufreq_register_driver+0x9d/0x1d0 [ 0.504627] [<ffffffff81b19f3b>] ? cpufreq_gov_dbs_init+0x12/0x12 [ 0.504627] [<ffffffff81b1a039>] acpi_cpufreq_init+0xfe/0x1f8 [ 0.504627] [<ffffffff810002ba>] do_one_initcall+0xda/0x180 [ 0.504627] [<ffffffff81ae301e>] kernel_init_freeable+0x12c/0x1bb [ 0.504627] [<ffffffff81ae2841>] ? do_early_param+0x8c/0x8c [ 0.504627] [<ffffffff814b4dd0>] ? rest_init+0x140/0x140 [ 0.504627] [<ffffffff814b4dde>] kernel_init+0xe/0xf0 [ 0.504627] [<ffffffff814d029a>] ret_from_fork+0x7a/0xb0 [ 0.504627] [<ffffffff814b4dd0>] ? rest_init+0x140/0x140 Signed-off-by: Li Zhong <zhong@linux.vnet.ibm.com> Acked-and-tested-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 5025d628c8659fbf939f929107bf76db81dcdfff Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:06 -08:00
Viresh Kumar	e8c806d8e6	cpufreq: Use cpufreq_policy_list for iterating over policies To iterate over all policies we currently iterate over all online CPUs and then get the policy for each of them which is suboptimal. Use the newly created cpufreq_policy_list for this purpose instead. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 1b27429446f0c37353179544e844dc2086fa2353 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:05 -08:00
Viresh Kumar	982c7c772e	cpufreq: remove cpufreq_policy_cpu per-cpu variable cpufreq_policy_cpu per-cpu variables are used for storing the ID of the CPU that manages the given CPU's policy. However, we also store a policy pointer for each cpu in cpufreq_cpu_data, so the cpufreq_policy_cpu information is simply redundant. It is better to use cpufreq_cpu_data to retrieve a policy and get policy->cpu from there, so make that happen everywhere and drop the cpufreq_policy_cpu per-cpu variables which aren't necessary any more. [rjw: Changelog] Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 474deff744c4012f07cfa994947d7c6260c9ab89 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:05 -08:00
Viresh Kumar	cd4947e249	cpufreq: remove unnecessary check in __cpufreq_governor() We don't need to check if event is CPUFREQ_GOV_POLICY_INIT and put governor module as we are sure event can only be START/STOP here. Remove the useless check. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 9e9fd801676a946b759a8669baa24ba327c8c903 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:05 -08:00
Viresh Kumar	719216221d	cpufreq: remove policy from cpufreq_policy_list during suspend cpufreq_policy_list is a list of active policies. We do remove policies from this list when all CPUs belonging to that policy are removed. But during system suspend we don't really free a policy struct as it will be used again during resume, so we didn't remove it from cpufreq_policy_list as well.. However, this is incorrect. We are saying this policy isn't valid anymore and must not be referenced (though we haven't freed it), but it can still be used by code that iterates over cpufreq_policy_list. Remove policy from this list during system suspend as well. Of course, we must add it back whenever the first CPU belonging to that policy shows up. [rjw: Changelog] Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 9515f4d69b92feafe37581047a1bb41e41602faa Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:05 -08:00
Viresh Kumar	43a0679e2a	cpufreq: Fix white space in __cpufreq_remove_dev() Align closing brace '}' of an if block. [rjw: Subject and changelog] Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: edab2fbc21b9eb37007ad8bffe1159d536bbb451 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:04 -08:00
Rafael J. Wysocki	387bda797c	Revert "cpufreq: Use cpufreq_policy_list for iterating over policies" Revert commit eb60852 (cpufreq: Use cpufreq_policy_list for iterating over policies), because it breaks system suspend/resume on multiple machines. It either causes resume to block indefinitely or causes the BUG_ON() in lock_policy_rwsem_##mode() to trigger on sysfs accesses to cpufreq attributes. Conflicts: drivers/cpufreq/cpufreq.c Git-commit: 878f6e074e9a7784a6e351512eace4ccb3542eef Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:04 -08:00
Viresh Kumar	4c36081b1f	cpufreq: improve error checking on return values of __cpufreq_governor() The __cpufreq_governor() function can fail in rare cases especially if there are bugs in cpufreq drivers. Thus we must stop processing as soon as this routine fails, otherwise it may result in undefined behavior. This patch adds error checking code whenever this routine is called from any place. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 3de9bdeb28638e164d1f0eb38dd68e3f5d2ac95c Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:04 -08:00
Viresh Kumar	326e2e9f4f	cpufreq: Drop the owner field from struct cpufreq_driver We don't need to set .owner = THIS_MODULE any more in cpufreq drivers as this field isn't used any more by the cpufreq core. This patch removes it and updates all dependent drivers accordingly. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: adc97d6a735dbb1e94cb4f1bf0b55f258b349941 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git [junjiew@codeaurora.org: resolve trivial merge conflicts] Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:04 -08:00
Viresh Kumar	b22d99e4b9	cpufreq: Use rwsem for protecting critical sections Critical sections of the cpufreq core are protected with the help of the driver module owner's refcount, which isn't the correct approach, because it causes rmmod to return an error when some routine has updated that refcount. Let's use rwsem for this purpose instead. Only cpufreq_unregister_driver() will use write sem and everybody else will use read sem. [rjw: Subject & changelog] Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 6eed9404ab3c4baea54ce4c7e862e69df1d39f38 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:04 -08:00
Viresh Kumar	c15e6c9a55	cpufreq: Fix broken usage of governor->owner's refcount The cpufreq governor owner refcount usage is broken. We should only increment that refcount when a CPUFREQ_GOV_POLICY_INIT event has come and it should only be decremented if CPUFREQ_GOV_POLICY_EXIT has come. Currently, there can be situations where the governor is in use, but we have allowed it to be unloaded which may result in undefined behavior. Let's fix it. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: fe492f3f0332e23cc6ca4913e5a2ed78e1888902 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:03 -08:00
Viresh Kumar	c267384a9c	cpufreq: Use cpufreq_policy_list for iterating over policies To iterate over all policies we currently iterate over all CPUs and then get the policy for each of them. Let's use the newly created cpufreq_policy_list for this purpose. [rjw: Changelog] Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: eb608521f1e25a8c14295b6d9a3853c3cd8c6cf8 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:03 -08:00
Lukasz Majewski	6bc8f02509	cpufreq: Store cpufreq policies in a list Policies available in the cpufreq framework are now linked together. They are accessible via cpufreq_policy_list defined in the cpufreq core. [rjw: Fix from Yinghai Lu folded in] Signed-off-by: Lukasz Majewski <l.majewski@samsung.com> Signed-off-by: Myungjoo Ham <myungjoo.ham@samsung.com> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: c88a1f8b96e7384627b918dfabbfc0c615a4a914 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:03 -08:00
Viresh Kumar	d87467e6f2	cpufreq: Use sizeof(ptr) convetion for computing sizes Chapter 14 of Documentation/CodingStyle says: The preferred form for passing a size of a struct is the following: p = kmalloc(sizeof(p), ...); The alternative form where struct name is spelled out hurts readability and introduces an opportunity for a bug when the pointer variable type is changed but the corresponding sizeof that is passed to a memory allocator is not. This wasn't followed consistently in drivers/cpufreq, let's make it more consistent by always following this rule. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: d5b73cd870e2b049ef566aec2791dbf5fd26a7ec Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git [junjiew@codeaurora.org: resolve trivial merge conflicts] Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:03 -08:00
Viresh Kumar	068323d345	cpufreq: Give consistent names to cpufreq_policy objects They are called policy, cur_policy, new_policy, data, etc. Just call them policy wherever possible. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 3a3e9e06d0c11b8efa95933a88c9e67209fa4330 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git [junjiew@codeaurora.org: resolve trivial merge conflict] Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:02 -08:00
Viresh Kumar	b408618d46	cpufreq: Clean up header files included in the core This patch addresses the following issues in the header files in the cpufreq core: - Include headers in ascending order, so that we don't add same many times by mistake. - <asm/> must be included after <linux/>, so that they override whatever they need to. - Remove unnecessary includes. - Don't include files already included by cpufreq.h or cpufreq_governor.h. [rjw: Changelog] Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 5ff0a268037d344f86df690ccb994d8bc015d2d9 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:02 -08:00
Viresh Kumar	37b113da9d	cpufreq: Pass policy to cpufreq_add_policy_cpu() The caller of cpufreq_add_policy_cpu() already has a pointer to the policy structure and there is no need to look it up again in cpufreq_add_policy_cpu(). Let's pass it directly. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Reviewed-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: d8d3b4711297e101bbad826474013edbe342c333 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:02 -08:00
Rafael J. Wysocki	b6c1774b41	cpufreq: Avoid double kobject_put() for the same kobject in error code path The only case triggering a jump to the err_out_unregister label in __cpufreq_add_dev() is when cpufreq_add_dev_interface() fails. However, if cpufreq_add_dev_interface() fails, it calls kobject_put() for the policy kobject in its error code path and since that causes the kobject's refcount to become 0, the additional kobject_put() for the same kobject under err_out_unregister and the wait_for_completion() following it are pointless, so drop them. Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Reviewed-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Git-commit: 10659ab7b50e963429f1a681882404ca37aa584c Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:01 -08:00
Rafael J. Wysocki	ba921df0d0	cpufreq: Do not hold driver module references for additional policy CPUs The cpufreq core is a little inconsistent in the way it uses the driver module refcount. Namely, if __cpufreq_add_dev() is called for a CPU that doesn't share the policy object with any other CPUs, the driver module refcount it grabs to start with will be dropped by it before returning and will be equal to whatever it had been before that function was invoked. However, if the given CPU does share the policy object with other CPUs, either cpufreq_add_policy_cpu() is called to link the new CPU to the existing policy, or cpufreq_add_dev_symlink() is used to link the other CPUs sharing the policy with it to the just created policy object. In that case, because both cpufreq_add_policy_cpu() and cpufreq_add_dev_symlink() call cpufreq_cpu_get() for the given policy (the latter possibly many times) without the balancing cpufreq_cpu_put() (unless there is an error), the driver module refcount will be left by __cpufreq_add_dev() with a nonzero value (different from the initial one). To remove that inconsistency make cpufreq_add_policy_cpu() execute cpufreq_cpu_put() for the given policy before returning, which decrements the driver module refcount so that it will be equal to its initial value after __cpufreq_add_dev() returns. Also remove the cpufreq_cpu_get() call from cpufreq_add_dev_symlink(), since both the policy refcount and the driver module refcount are nonzero when it is called and they don't need to be bumped up by it. Accordingly, drop the cpufreq_cpu_put() from __cpufreq_remove_dev(), since it is only necessary to balance the cpufreq_cpu_get() called by cpufreq_add_policy_cpu() or cpufreq_add_dev_symlink(). Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Reviewed-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Git-commit: 71c3461ef7c67024792d283b88630245a6c169ba Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:01 -08:00
Viresh Kumar	8e4bf985f3	cpufreq: Don't pass CPU to cpufreq_add_dev_{symlink\|interface}() Pointer to struct cpufreq_policy is already passed to these routines and we don't need to send policy->cpu to them as well. So, get rid of this extra argument and use policy->cpu everywhere. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 308b60e71541518f3fe97171b4daf71adc607f3d Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:01 -08:00
Viresh Kumar	1a2410255c	cpufreq: Remove extra variables from cpufreq_add_dev_symlink() We call cpufreq_cpu_get() in cpufreq_add_dev_symlink() to increase usage refcount of policy, but not to get a policy for the given CPU. So, we don't really need to capture the return value of this routine. We can simply use policy passed as an argument to cpufreq_add_dev_symlink(). Moreover debug print is rewritten to make it more clear. [rjw: Changelog] Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: e8fdde1011ea45792e60f14f620b01f78cb0d34d Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:01 -08:00
Srivatsa S. Bhat	d6a685043e	cpufreq: Perform light-weight init/teardown during suspend/resume Now that we have the infrastructure to perform a light-weight init/tear-down, use that in the cpufreq CPU hotplug notifier when invoked from the suspend/resume path. This also ensures that the file permissions of the cpufreq sysfs files are preserved across suspend/resume, something which commit a66b2e (cpufreq: Preserve sysfs files across suspend/resume) originally intended to do, but had to be reverted due to other problems. Signed-off-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 5302c3fb2e62f4ca5e43e060491ba299f58c5231 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:00 -08:00
Srivatsa S. Bhat	420f1a2fae	cpufreq: Preserve policy structure across suspend/resume To perform light-weight cpu-init and teardown in the cpufreq subsystem during suspend/resume, we need to separate out the 2 main functionalities of the cpufreq CPU hotplug callbacks, as outlined below: 1. Init/tear-down of core cpufreq and CPU-specific components, which are critical to the correct functioning of the cpufreq subsystem. 2. Init/tear-down of cpufreq sysfs files during suspend/resume. The first part requires accurate updates to the policy structure such as its ->cpus and ->related_cpus masks, whereas the second part requires that the policy->kobj structure is not released or re-initialized during suspend/resume. To handle both these requirements, we need to allow updates to the policy structure throughout suspend/resume, but prevent the structure from getting freed up. Also, we must have a mechanism by which the cpu-up callbacks can restore the policy structure, without allocating things afresh. (That also helps avoid memory leaks). To achieve this, we use 2 schemes: a. Use a fallback per-cpu storage area for preserving the policy structures during suspend, so that they can be restored during resume appropriately. b. Use the 'frozen' flag to determine when to free or allocate the policy structure vs when to restore the policy from the saved fallback storage. Thus we can successfully preserve the structure across suspend/resume. Effectively, this helps us complete the separation of the 'light-weight' and the 'full' init/tear-down sequences in the cpufreq subsystem, so that this can be made use of in the suspend/resume scenario. Signed-off-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 8414809c6a1e8479e331e09254adb58b33a36d25 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:00 -08:00
Srivatsa S. Bhat	a3cd334d35	cpufreq: Introduce a flag ('frozen') to separate full vs temporary init/teardown During suspend/resume we would like to do a light-weight init/teardown of CPUs in the cpufreq subsystem and preserve certain things such as sysfs files etc across suspend/resume transitions. Add a flag called 'frozen' to help distinguish the full init/teardown sequence from the light-weight one. Signed-off-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: a82fab292898f88ea9ca99dd10c1773dcada08b6 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:00 -08:00
Srivatsa S. Bhat	4a247c9235	cpufreq: Extract the handover of policy cpu to a helper function During cpu offline, when the policy->cpu is going down, some other CPU present in the policy->cpus mask is nominated as the new policy->cpu. Extract this functionality from __cpufreq_remove_dev() and implement it in a helper function. This helps in upcoming code reorganization. Signed-off-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: f9ba680d23ea7e2fc31b4b7106a482d90ec62a24 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:07:00 -08:00
Srivatsa S. Bhat	e7fd09fd43	cpufreq: Extract non-interface related stuff from cpufreq_add_dev_interface cpufreq_add_dev_interface() includes the work of exposing the interface to the device, as well as a lot of unrelated stuff. Move the latter to cpufreq_add_dev(), where it is more appropriate. Signed-off-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: e18f1682bce701ddcf88ba3651e07c7ee9b3ed60 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:59 -08:00
Srivatsa S. Bhat	d35bcfbd1d	cpufreq: Add helper to perform alloc/free of policy structure Separate out the allocation of the cpufreq policy structure (along with its error handling) to a helper function. This makes the code easier to read and also helps with some upcoming code reorganization. Signed-off-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: e9698cc5d2749c5b74e137f94a95d7e505b097e8 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:59 -08:00
Srivatsa S. Bhat	da12b9488e	cpufreq: Fix misplaced call to cpufreq_update_policy() The call to cpufreq_update_policy() is placed in the CPU hotplug callback of cpufreq_stats, which has a higher priority than the CPU hotplug callback of cpufreq-core. As a result, during CPU_ONLINE/CPU_ONLINE_FROZEN, we end up calling cpufreq_update_policy() before calling cpufreq_add_dev() ! And for uninitialized CPUs, it just returns silently, not doing anything. To add to that, cpufreq_stats is not even the right place to call cpufreq_update_policy() to begin with. The cpufreq core ought to handle this in its own callback, from an elegance/relevance perspective. So move the invocation of cpufreq_update_policy() to cpufreq_cpu_callback, and place it after cpufreq_add_dev(). Signed-off-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 23d328994b548d6822b88fe7e1903652afc354e0 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:59 -08:00
Viresh Kumar	cf3f085426	cpufreq: rename ignore_nice as ignore_nice_load This sysfs file was called ignore_nice_load earlier and commit `4d5dcc4` (cpufreq: governor: Implement per policy instances of governors) changed its name to ignore_nice by mistake. Lets get it renamed back to its original name. Reported-by: Martin von Gagern <Martin.vGagern@gmx.net> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Cc: 3.10+ <stable@vger.kernel.org> # 3.10+ Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 6c4640c3adfd97ce10efed7c07405f52d002b9a8 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:59 -08:00
Stratos Karafotis	a61f710c5d	cpufreq: Remove unused function __cpufreq_driver_getavg() The target frequency calculation method in the ondemand governor has changed and it is now independent of the measured average frequency. Consequently, the __cpufreq_driver_getavg() function and getavg member of struct cpufreq_driver are not used any more, so drop them. [rjw: Changelog] Signed-off-by: Stratos Karafotis <stratosk@semaphore.gr> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: cffe4e0e7413eb29fb8bd035c8b12b33a4b8522a Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:59 -08:00
Stratos Karafotis	8a061dbee2	cpufreq: ondemand: Change the calculation of target frequency The ondemand governor calculates load in terms of frequency and increases it only if load_freq is greater than up_threshold multiplied by the current or average frequency. This appears to produce oscillations of frequency between min and max because, for example, a relatively small load can easily saturate minimum frequency and lead the CPU to the max. Then, it will decrease back to the min due to small load_freq. Change the calculation method of load and target frequency on the basis of the following two observations: - Load computation should not depend on the current or average measured frequency. For example, absolute load of 80% at 100MHz is not necessarily equivalent to 8% at 1000MHz in the next sampling interval. - It should be possible to increase the target frequency to any value present in the frequency table proportional to the absolute load, rather than to the max only, so that: Target frequency = C * load where we take C = policy->cpuinfo.max_freq / 100. Tested on Intel i7-3770 CPU @ 3.40GHz and on Quad core 1500MHz Krait. Phoronix benchmark of Linux Kernel Compilation 3.1 test shows an increase ~1.5% in performance. cpufreq_stats (time_in_state) shows that middle frequencies are used more, with this patch. Highest and lowest frequencies were used less by ~9%. [rjw: We have run multiple other tests on kernels with this change applied and in the vast majority of cases it turns out that the resulting performance improvement also leads to reduced consumption of energy. The change is additionally justified by the overall simplification of the code in question.] Signed-off-by: Stratos Karafotis <stratosk@semaphore.gr> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: dfa5bb622555d9da0df21b50f46ebdeef390041b Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:58 -08:00
Paul Gortmaker	ab6ac74b49	cpufreq: delete __cpuinit usage from all cpufreq files The __cpuinit type of throwaway sections might have made sense some time ago when RAM was more constrained, but now the savings do not offset the cost and complications. For example, the fix in commit `5e427ec2d0` ("x86: Fix bit corruption at CPU resume time") is a good example of the nasty type of bugs that can be created with improper use of the various __init prefixes. After a discussion on LKML[1] it was decided that cpuinit should go the way of devinit and be phased out. Once all the users are gone, we can then finally remove the macros themselves from linux/init.h. This removes all the drivers/cpufreq uses of the __cpuinit macros from all C files. [1] https://lkml.org/lkml/2013/5/20/589 [v2: leave 2nd lines of args misaligned as requested by Viresh] Cc: "Rafael J. Wysocki" <rjw@sisk.pl> Cc: Viresh Kumar <viresh.kumar@linaro.org> Cc: cpufreq@vger.kernel.org Cc: linux-pm@vger.kernel.org Acked-by: Dirk Brandewie <dirk.j.brandewie@intel.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Paul Gortmaker <paul.gortmaker@windriver.com> Git-commit: 2760984f6578d5a462155bb4727766d0c8b68387 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git [junjiew@codeaurora.org: resolve trivial merge conflicts. Remove __cpuinit for arch/arm/mach-msm/cpufreq.c as well] Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:58 -08:00
Viresh Kumar	a3ffd1408e	cpufreq: Fix serialization of frequency transitions Commit 7c30ed ("cpufreq: make sure frequency transitions are serialized") interacts poorly with systems that have a single core freqency for all cores. On such systems we have a single policy for all cores with several CPUs. When we do a frequency transition the governor calls the pre and post change notifiers which causes cpufreq_notify_transition() per CPU. Since the policy is the same for all of them all CPUs after the first and the warnings added are generated by checking a per-policy flag the warnings will be triggered for all cores after the first. Fix this by allowing notifier to be called for n times. Where n is the number of cpus in policy->cpus. Reported-and-tested-by: Mark Brown <broonie@linaro.org> Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 266c13d767be61a17d8e6f2310b9b7c46278273b Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:58 -08:00
Lan Tianyu	d0c9f30b15	acpi-cpufreq: Add new sysfs attribute freqdomain_cpus Commits `fcf8058` (cpufreq: Simplify cpufreq_add_dev()) and `aa77a52` (cpufreq: acpi-cpufreq: Don't set policy->related_cpus from .init()) changed the contents of the "related_cpus" sysfs attribute on systems where acpi-cpufreq is used and user space can't get the list of CPUs which are in the same hardware coordination CPU domain (provided by the ACPI AML method _PSD) via "related_cpus" any more. To make up for that loss add a new sysfs attribute "freqdomian_cpus" for the acpi-cpufreq driver which exposes the list of CPUs in the same domain regardless of whether it is coordinated by hardware or software. [rjw: Changelog, documentation] References: https://bugzilla.kernel.org/show_bug.cgi?id=58761 Reported-by: Jean-Philippe Halimi <jean-philippe.halimi@exascale-computing.eu> Signed-off-by: Lan Tianyu <tianyu.lan@intel.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: f4fd3797848aa04e72e942c855fd279840a47fe4 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:57 -08:00
Viresh Kumar	bdd9adf7ee	cpufreq: make sure frequency transitions are serialized Whenever we are changing frequency of a cpu, we are calling PRECHANGE and POSTCHANGE notifiers. They must be serialized. i.e. PRECHANGE or POSTCHANGE shouldn't be called twice contiguously. This can happen due to bugs in users of __cpufreq_driver_target() or actual cpufreq drivers who are sending these notifiers. This patch adds some protection against this. Now, we keep track of the last transaction and see if something went wrong. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 7c30ed532cf798a8d924562f2f44d03d7652f7a7 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:57 -08:00
Viresh Kumar	ac137ff7c1	cpufreq: arm-big-little: call CPUFREQ_POSTCHANGE notfier in error cases PRECHANGE and POSTCHANGE notifiers must be called in groups, i.e either both should be called or both shouldn't be. In case we have started PRECHANGE notifier and found an error, we must call POSTCHANGE notifier with freqs.new = freqs.old to guarantee that sequence of calling notifiers is complete. This patch fixes it. This also removes code setting policy->cur as this is also done by POSTCHANGE notifier. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Git-commit: 3d69dd50517f4a1c037298ac4af85aae1d070879 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:57 -08:00
Viresh Kumar	81266f636f	cpufreq: make __cpufreq_notify_transition() static __cpufreq_notify_transition() is used only in cpufreq.c, make it static. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 0956df9c842a534b0b36f62f3a0fdb5fca19dc96 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:57 -08:00
Viresh Kumar	0309e39a08	cpufreq: Fix minor formatting issues There were a few noticeable formatting issues in core cpufreq code. This cleans them up to make code look better. The changes include: - Whitespace cleanup. - Rearrangements of code. - Multiline comments fixes. - Formatting changes to fit 80 columns. Copyright information in cpufreq.c is also updated to include my name for 2013. [rjw: Changelog] Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: bb176f7d038fee4d46b3293e64e173bfb05ab7b5 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:56 -08:00
Viresh Kumar	399c4791d5	cpufreq: Simplify userspace governor Userspace governor has got more code than what it needs for its functioning, so simplify it. Portions of code removed are: - Extra header files which aren't required anymore (rearrange them as well). - cpu_{max\|min\|cur\|set}_freq, as they are always the same as policy->{max\|min\|cur}. - userspace_cpufreq_notifier_block as we don't need to set cpu_cur_freq anymore. - cpus_using_userspace_governor as it was for the notifier code. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: d1922f02562fe230396400e466e6e38dfeb072f5 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:56 -08:00
Viresh Kumar	fa715b03d7	cpufreq: remove unnecessary cpufreq_cpu_{get\|put}() calls struct cpufreq_policy is already passed as argument to some routines like: __cpufreq_driver_getavg() and so we don't really need to do cpufreq_cpu_get() before and cpufreq_cpu_put() in them to get a policy structure. Remove them. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: a262e94cdcb961762e5d91e7fcb857bba7d420a0 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:56 -08:00
Viresh Kumar	2314e15b61	cpufreq: rename index as driver_data in cpufreq_frequency_table The "index" field of struct cpufreq_frequency_table was never an index and isn't used at all by the cpufreq core. It only is useful for cpufreq drivers for their internal purposes. Many people nowadays blindly set it in ascending order with the assumption that the core will use it, which is a mistake. Rename it to "driver_data" as that's what its purpose is. All of its users are updated accordingly. [rjw: Changelog] Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Acked-by: Simon Horman <horms+renesas@verge.net.au> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 5070158804b5339c71809f5e673cea1cfacd804d Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git [junjiew@codeaurora.org: update non-upstream files] Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:56 -08:00
Viresh Kumar	0536fb861a	cpufreq: Don't create empty /sys/devices/system/cpu/cpufreq directory When we don't have any file in cpu/cpufreq directory we shouldn't create it. Specially with the introduction of per-policy governor instance patchset, even governors are moved to cpu/cpu*/cpufreq/governor-name directory and so this directory is just not required. Lets have it only when required. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 2361be23666232dbb4851a527f466c4cbf5340fc Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:55 -08:00
Viresh Kumar	aa47efbd4e	cpufreq: Move get_cpu_idle_time() to cpufreq.c Governors other than ondemand and conservative can also use get_cpu_idle_time() and they aren't required to compile cpufreq_governor.c. So, move these independent routines to cpufreq.c instead. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 72a4ce340a7ebf39e1c6fdc8f5feb4f974d6c635 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git [junjiew@codeaurora.org: update non-upstream files] Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:55 -08:00
Viresh Kumar	4b7d898329	cpufreq: governors: Move get_governor_parent_kobj() to cpufreq.c get_governor_parent_kobj() can be used by any governor, generic cpufreq governors or platform specific ones and so must be present in cpufreq.c instead of cpufreq_governor.c. This patch moves it to cpufreq.c. This also adds EXPORT_SYMBOL_GPL(get_governor_parent_kobj) so that modules can use this function too. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 944e9a0316e60bc5bc122e46c1fde36e5f6e9f56 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:55 -08:00
Viresh Kumar	4f660094e4	cpufreq: Add EXPORT_SYMBOL_GPL for have_governor_per_policy This patch adds: EXPORT_SYMBOL_GPL(have_governor_per_policy), so that this routine can be used by modules too. Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Git-commit: 3f869d6d41d032392abafe17ea5257a2514a24a7 Git-repo: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-12-20 19:06:55 -08:00
Srivatsa Vaddagiri	840cdb0958	cpufreq: cpu-boost: Resolve deadlock when waking up sync thread CPU boost driver receives notification from scheduler when threads migrate towards a cpu and in turn wakes up a sync thread associated with that cpu to handle any required frequency transitions. The wakeup call however can lead to a deadlock inside scheduler under some circumstance. The deadlock is seen when sync thread is the only thread running on a cpu and goes to sleep (say by calling wait_event() -> schedule()). Midway through this sleep (schedule()) call, while cpu is still running in context of sync thread, scheduler attempts a load balance (realizing that cpu is about to become idle) which can result in tasks being migrated towards the cpu going idle. This will cause migration notification to be issued and in turn a wakeup on sync thread. The wakeup call however gets stuck in below while() loop inside scheduler: try_to_wake_up(struct task_struct p, ...) { / * If the owning (remote) cpu is still in the middle of * schedule() with this task as prev, wait until its done * referencing the task. */ while (p->on_cpu) cpu_relax(); } A possible fix could be to teach try_to_wake_up() about this special case. Another fix, implemented in this patch and that helps minimize scheduler changes, is to have cpu boost driver not issue a wakeup under this special circumstance, which was found to occur very infrequently. Change-Id: I92bc68a22d51595a208673fe2a1eedfa97004f9e Signed-off-by: Srivatsa Vaddagiri <vatsa@codeaurora.org>	2013-12-20 18:56:07 -08:00
Rohit Gupta	24d3ed232f	cpufreq: interactive: Use default min_sample_time if SDF is zero Use min_sample_time to step down from max frequencies if sampling_down_factor is set to zero. Change-Id: I2e89bef6220557cb95efd20d658e0c05753b4c3c Signed-off-by: Rohit Gupta <rohgup@codeaurora.org>	2013-12-20 17:47:47 -08:00
Linux Build Service Account	cd45ce66b0	Merge "cpufreq: interactive: Remove trace event from idle_start handler"	2013-12-10 17:15:21 -08:00
Linux Build Service Account	09efed950d	Merge "cpufreq: interactive: sync freq feature for interactive governor"	2013-12-10 17:15:09 -08:00
Rohit Gupta	cdd6914bb7	cpufreq: interactive: Remove trace event from idle_start handler Removed the trace_cpufreq_interactive_idle_start. Also fix a crash resulting from accessing NULL policy before taking the pcpu->enable_sem lock. The policy can be NULL if the core is hotplugged out before the enable_sem lock is taken. Change-Id: I7e2809cc016b3b383a44cdf3c697013e2d2b5417 Signed-off-by: Rohit Gupta <rohgup@codeaurora.org>	2013-12-09 19:22:27 -08:00
Linux Build Service Account	0e50cfa58c	Merge "cpufreq: Add Input Boost feature to the cpu-boost driver"	2013-12-08 13:30:02 -08:00
Linux Build Service Account	0f3ff5b57f	Merge "cpufreq: interactive: Reset floor_validate_time if busy at max for 100ms"	2013-12-07 11:10:53 -08:00
Rohit Gupta	b6cd9b3352	cpufreq: interactive: Reset floor_validate_time if busy at max for 100ms When the interactive governor selects to run at max frequency it doesn't re-schedule the timer until it hits an idle. This change checks if the CPU has been continuously busy for last 100ms on hitting an idle start. If yes, then floor_validate_time is reset so that the CPU stays at max frequency for at least another 100 ms before stepping down. This is an important feature for detecting CPU intensive workloads which require high frequencies for achieving better performance. Change-Id: I7d48ffbc3d50a80af9be3bf94667ee3d0120b763 Signed-off-by: Rohit Gupta <rohgup@codeaurora.org>	2013-12-06 16:43:07 -08:00
Rohit Gupta	f3d1980b4d	cpufreq: interactive: sync freq feature for interactive governor 1) Add load info to cpufreq_interactive_cpuinfo 2) If load on any other online cpu exceeds sync_freq_load_threshold, do not allow the frequency to drop below sync_freq Change-Id: I3617e10f87b85178914a18bcf04ac2a31a4f1ec1 Signed-off-by: Rohit Gupta <rohgup@codeaurora.org>	2013-12-06 15:44:10 -08:00
Rohit Gupta	ff6af80775	cpufreq: interactive: Allow 1 ms error in above_hispeed_delay comparisons Allow for an error of 1 ms while taking into account above_hispeed_delay for a frequency Change-Id: I744e44387152e4efb5978df4f2b9533bf79d4582 Signed-off-by: Rohit Gupta <rohgup@codeaurora.org>	2013-12-04 18:42:21 -08:00
Mark Langsdorf	16802f7ac3	cpufreq: highbank-cpufreq: Enable Midway/ECX-2000 commit fbbc5bfb44a22e7a8ef753a1c8dfb448d7ac8b85 upstream. Calxeda's new ECX-2000 part uses the same cpufreq interface as highbank, so add it to the driver's compatibility list. This is a minor change that can safely be applied to the 3.10 and 3.11 stable trees. Signed-off-by: Mark Langsdorf <mark.langsdorf@calxeda.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2013-12-04 10:57:18 -08:00
Rohit Gupta	c556bd1322	cpufreq: Add Input Boost feature to the cpu-boost driver On incoming input events boost the frequency of all online cpus for at least input_boost_ms ms. This is accomplished by changing the policy->min of all the online cpus to input_boost_freq. Change-Id: Idb0ab75d68ae4ceff259cbbaaec1a9bb3bc871d3 Signed-off-by: Rohit Gupta <rohgup@codeaurora.org>	2013-12-03 18:28:44 -08:00
Rohit Gupta	2dd0e605ee	cpufreq: Add a sync limit to cpu-boost Perform frequency synchronization only when source CPU's frequency is less than sync_threshold, else sync to the sync_threshold. Change-Id: I544c414568d4e015b80ce5891dd215275bac95da Signed-off-by: Rohit Gupta <rohgup@codeaurora.org>	2013-12-03 18:28:40 -08:00
Rohit Gupta	d094d23694	cpufreq: interactive: Add a sampling_down_factor for max frequencies Change min_sample_time to sampling_down_factor ms when running at max frequency. This would keep the frequency up at max for at least that much time before allowing it to step down. This is an important performance enhancement for CPU intensive workloads like benchmarks where there is consistently high load on CPU to keep it up at max frequency. Change-Id: Ia7e35b0625bedf20e7ef3a1f52e5828ffbfed93e Signed-off-by: Rohit Gupta <rohgup@codeaurora.org>	2013-11-27 17:42:17 -08:00
Saravana Kannan	6249bf152c	cpufreq: cpu-boost: Add cpu-boost driver When certain bursty and important events take place, it might take a while for the current cpufreq governor to notice the new load and react to it. That would result in poor user experience. To alleviate this, the cpu-boost driver boosts the frequency of a CPU for a short duration to maintain good user experience while the governor catches up. Specifically, this commit deals with ensuring that when "important" tasks migrate from a fast CPU to a slow CPU, the frequency of the slow CPU is boosted to be at least as high as the fast CPU for a short duration. Since this driver enforces the boost by hooking into standard cpufreq ADJUST notifiers, it has several advantages: - More portable across kernel versions where the cpufreq internals might have been rewritten. - Governor agnostic and hence works with multiple governors like conservative, ondemand, interactive, etc. - Does not affect the sampling period/logic of existing governors. - Can have the boost period adjusted independent of governor sampling period. Change-Id: Ibd814a20043d0aba64ee7637a4a79b9ffa1b0991 Signed-off-by: Saravana Kannan <skannan@codeaurora.org>	2013-11-25 17:14:47 -08:00
Ian Maund	f06163e6d0	msm: reap unused kernel files This change removes source files from the kernel tree that were not being used during make. The list of used files was generated using an annotated make log and was then compared with new files added since the public release of kernel version 3.10.00. New files which were added but not used have been removed from the tree. A diff was also run to determine the list of files that had been modified since the release of kernel version 3.10.00. These files were then scrubbed based on the current kernel configuration, removing invalid and unused conditionals. Some files which support planned functionality or are useful in debugging have been excluded from this reap. Change-Id: Ia44a224d3cea7bc78dd45e8a8279860d35d4b008 Signed-off-by: Ian Maund <imaund@codeaurora.org>	2013-11-21 17:45:28 -08:00
Minsung Kim	01ae0541f0	cpufreq: interactive: fix show_target_loads and show_above_hispeed_delay Remove a trailing whitespace from target_loads and above_hispeed_delay. Problem happens when user-space program tried to restore parameters that saved before changing parameters. In this case was returned error(EINVAL). Change-Id: I5a74e3824602cd6f2b74651adda5ec1b627e61e9 Signed-off-by: Minsung Kim <ms925.kim@samsung.com> Git-commit: cf0fad49d17cb8273ce555dd5b7afab67d7923bf Git-repo: https://android.googlesource.com/kernel/common/ Signed-off-by: Dilip Gudlur <dgudlur@codeaurora.org>	2013-11-18 11:57:24 -08:00
Dirk Brandewie	f606b358df	cpufreq / intel_pstate: Fix max_perf_pct on resume commit 52e0a509e5d6f902ec26bc2a8bb02b137dc453be upstream. If the system is suspended while max_perf_pct is less than 100 percent or no_turbo set policy->{min,max} will be set incorrectly with scaled values which turn the scaled values into hard limits. References: https://bugzilla.kernel.org/show_bug.cgi?id=61241 Reported-by: Patrick Bartels <petzicus@googlemail.com> Signed-off-by: Dirk Brandewie <dirk.j.brandewie@intel.com> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2013-11-13 12:05:30 +09:00
Junjie Wu	aa3edab571	cpufreq: Save and restore user min/max freq for hotplug In addition to restoring the previous governor after hotplug, restore min/max frequency set in user_policy. This patch combines commit: 1) 026cf76c5a2b1b84cbe12b79800f97389abfd589 (android-msm-2.6.35) cpufreq: Save and restore min and max frequencies 2) 82a8db17421e475eb662ad83674d4f7a351bc55e (msm-3.4) cpufreq: Save user policy min/max instead of policy min/max during hotplug And refactors it to apply cleanly onto the 3.10 kernel. Change-Id: Id097132765a5487618ca020bae3cc9c9ea791072 Signed-off-by: Junjie Wu <junjiew@codeaurora.org>	2013-09-20 23:32:29 -07:00
Stephen Boyd	b6d59d6bfc	cpufreq: Don't use smp_processor_id() in preemptible context Workqueues are preemptible even if works are queued on them with queue_work_on(). Let's use raw_smp_processor_id() here to silence the warning. BUG: using smp_processor_id() in preemptible [00000000] code: kworker/3:2/674 caller is gov_queue_work+0x28/0xb0 CPU: 0 PID: 674 Comm: kworker/3:2 Tainted: G W 3.10.0 #30 Workqueue: events od_dbs_timer [<c010c178>] (unwind_backtrace+0x0/0x11c) from [<c0109dec>] (show_stack+0x10/0x14) [<c0109dec>] (show_stack+0x10/0x14) from [<c03885a4>] (debug_smp_processor_id+0xbc/0xf0) [<c03885a4>] (debug_smp_processor_id+0xbc/0xf0) from [<c0635864>] (gov_queue_work+0x28/0xb0) [<c0635864>] (gov_queue_work+0x28/0xb0) from [<c0635618>] (od_dbs_timer+0x108/0x134) [<c0635618>] (od_dbs_timer+0x108/0x134) from [<c01aa8f8>] (process_one_work+0x25c/0x444) [<c01aa8f8>] (process_one_work+0x25c/0x444) from [<c01aaf88>] (worker_thread+0x200/0x344) [<c01aaf88>] (worker_thread+0x200/0x344) from [<c01b03bc>] (kthread+0xa0/0xb0) [<c01b03bc>] (kthread+0xa0/0xb0) from [<c01061b8>] (ret_from_fork+0x14/0x3c) Signed-off-by: Stephen Boyd <sboyd@codeaurora.org>	2013-09-04 17:24:51 -07:00
Stephen Boyd	c43e0a8d13	cpufreq: Fix timer/workqueue corruption due to double queueing When a CPU is hot removed we'll cancel all the delayed work items via gov_cancel_work(). Normally this will just cancel a delayed timer on each CPU that the policy is managing and the work won't run, but if the work is already running the workqueue code will wait for the work to finish before continuing to prevent the work items from re-queuing themselves like they normally do. This scheme will work most of the time, except for the case where the work function determines that it should adjust the delay for all other CPUs that the policy is managing. If this scenario occurs, the canceling CPU will cancel its own work but queue up the other CPUs works to run. For example: CPU0 CPU1 ---- ---- cpu_down() ... __cpufreq_remove_dev() cpufreq_governor_dbs() case CPUFREQ_GOV_STOP: gov_cancel_work(dbs_data, policy); cpu0 work is canceled timer is canceled cpu1 work is canceled <work runs> <waits for cpu1> od_dbs_timer() gov_queue_work(, , true); cpu0 work queued cpu1 work queued cpu2 work queued ... cpu1 work is canceled cpu2 work is canceled ... At the end of the GOV_STOP case cpu0 still has a work queued to run although the code is expecting all of the works to be canceled. __cpufreq_remove_dev() will then proceed to re-initialize all the other CPUs works except for the CPU that is going down. The CPUFREQ_GOV_START case in cpufreq_governor_dbs() will trample over the queued work and debugobjects will spit out a warning: WARNING: at lib/debugobjects.c:260 debug_print_object+0x94/0xbc() ODEBUG: init active (active state 0) object type: timer_list hint: delayed_work_timer_fn+0x0/0x10 Modules linked in: CPU: 0 PID: 1491 Comm: sh Tainted: G W 3.10.0 #19 [<c010c178>] (unwind_backtrace+0x0/0x11c) from [<c0109dec>] (show_stack+0x10/0x14) [<c0109dec>] (show_stack+0x10/0x14) from [<c01904cc>] (warn_slowpath_common+0x4c/0x6c) [<c01904cc>] (warn_slowpath_common+0x4c/0x6c) from [<c019056c>] (warn_slowpath_fmt+0x2c/0x3c) [<c019056c>] (warn_slowpath_fmt+0x2c/0x3c) from [<c0388a7c>] (debug_print_object+0x94/0xbc) [<c0388a7c>] (debug_print_object+0x94/0xbc) from [<c0388e34>] (__debug_object_init+0x2d0/0x340) [<c0388e34>] (__debug_object_init+0x2d0/0x340) from [<c019e3b0>] (init_timer_key+0x14/0xb0) [<c019e3b0>] (init_timer_key+0x14/0xb0) from [<c0635f78>] (cpufreq_governor_dbs+0x3e8/0x5f8) [<c0635f78>] (cpufreq_governor_dbs+0x3e8/0x5f8) from [<c06325a0>] (__cpufreq_governor+0xdc/0x1a4) [<c06325a0>] (__cpufreq_governor+0xdc/0x1a4) from [<c0633704>] (__cpufreq_remove_dev.isra.10+0x3b4/0x434) [<c0633704>] (__cpufreq_remove_dev.isra.10+0x3b4/0x434) from [<c08989f4>] (cpufreq_cpu_callback+0x60/0x80) [<c08989f4>] (cpufreq_cpu_callback+0x60/0x80) from [<c08a43c0>] (notifier_call_chain+0x38/0x68) [<c08a43c0>] (notifier_call_chain+0x38/0x68) from [<c01938e0>] (__cpu_notify+0x28/0x40) [<c01938e0>] (__cpu_notify+0x28/0x40) from [<c0892ad4>] (_cpu_down+0x7c/0x2c0) [<c0892ad4>] (_cpu_down+0x7c/0x2c0) from [<c0892d3c>] (cpu_down+0x24/0x40) [<c0892d3c>] (cpu_down+0x24/0x40) from [<c0893ea8>] (store_online+0x2c/0x74) [<c0893ea8>] (store_online+0x2c/0x74) from [<c04519d8>] (dev_attr_store+0x18/0x24) [<c04519d8>] (dev_attr_store+0x18/0x24) from [<c02a69d4>] (sysfs_write_file+0x100/0x148) [<c02a69d4>] (sysfs_write_file+0x100/0x148) from [<c0255c18>] (vfs_write+0xcc/0x174) [<c0255c18>] (vfs_write+0xcc/0x174) from [<c0255f70>] (SyS_write+0x38/0x64) [<c0255f70>] (SyS_write+0x38/0x64) from [<c0106120>] (ret_fast_syscall+0x0/0x30) The simplest fix is to check and see if the governor is being stopped and ignore the all_cpus flag so that only the work that's being canceled has the chance to re-queue itself. Signed-off-by: Stephen Boyd <sboyd@codeaurora.org>	2013-09-04 17:24:47 -07:00

... 2 3 4 5 6 ...

1076 Commits