]> xenbits.xensource.com Git - xen.git/log
xen.git
16 years agoImplements Guest MCE# MSR read/write virtualization
Keir Fraser [Fri, 20 Mar 2009 17:24:53 +0000 (17:24 +0000)]
Implements Guest MCE# MSR read/write virtualization

Signed-off-by: Jiang, Yunhong <yunhong.jiang@intel.com>
Signed-off-by: Ke, Liping <liping.ke@intel.com>
16 years agox86: Core support for Intel MCA support
Keir Fraser [Fri, 20 Mar 2009 17:24:29 +0000 (17:24 +0000)]
x86: Core support for Intel MCA support

Those patches based on AMD and SUN's MCA related jobs.
We have latest rebase after SUN's latest improvements.
We will have late following patches for recovery actions. This is a
basic framework for Intel.

Some implementation notes:
1) When error happens, if the error is fatal (pcc = 1) or can't be
recovered (pcc = 0, yet no good recovery methods),
    for avoiding losing logs in DOM0, we will reset machine
    immediately. Most of MCA MSRs are sticky. After reboot,
    MCA polling mechanism will send vIRQ to DOM0 for logging.
2) When MCE# happens, all CPUs enter MCA context. The first CPU who
read&clear the error MSR bank will be this
    MCE# owner. Necessary locks/synchronization will help to judge the
    owner and select most severe error.
3) For convenience, we will select the most offending CPU to do most
of processing&recovery job.
4) MCE# happens, we will do three jobs:
    a. Send vIRQ to DOM0 for logging
    b. Send vMCE# to Impacted Guest (Currently Only inject to impacted
    DOM0)
    c. Guest vMCE MSR virtualization
5) Some further improvement/adds for newer CPUs might be done  later
    a) Connection with recovery actions (cpu/memory online/offline)
    b) More software-recovery identification in severity_scan
    c) More refines and tests for HVM might be done when needed.

This patch Enable basic MCA support For Intel

Signed-off-by: Jiang, Yunhong<yunhong.jiang@intel.com>
Signed-off-by: Ke, Liping <Liping.ke@intel.com>
16 years agox86: Enable Xen C-state management by default.
Keir Fraser [Fri, 20 Mar 2009 15:38:46 +0000 (15:38 +0000)]
x86: Enable Xen C-state management by default.

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agopygrub: run first entry in config when invalid default boot option provided
Keir Fraser [Fri, 20 Mar 2009 15:37:33 +0000 (15:37 +0000)]
pygrub: run first entry in config when invalid default boot option provided

This behaviour matches that of real GRUB.

Signed-off-by: Michal Novotny <minovotn@redhat.com>
16 years agoCPUIDLE: enable MSI capable HPET for timer broadcast
Keir Fraser [Fri, 20 Mar 2009 12:33:55 +0000 (12:33 +0000)]
CPUIDLE: enable MSI capable HPET for timer broadcast

The HPET broadcast was default not enabled because the old code use
HPET channel 0 in legacy replacing mode which will hide both PIT & RTC
irqs and cause issues if RTC is needed in some cases. The upstream
default broadcast timer is PIT, which is in periodic mode (100HZ) and
would be expensive to be used as oneshot. MSI capable HPET is coming
into being. It is capable to deliver interrupt through FSB directly,
and has no side effect.

This patch extends support for MSI HPET based on original legacy HPET
code. The broadcast timer selection logic becomes: 1. if MSI capable HPET
available, use multiple HPET channels (no more than possible cpu num)
in MSI mode; 2. else if legacy replacing mode available for HPET &
'hpetbroadcast' option is given in cmd line, use HPET channel 0 in
legacy mode; 3 else use PIT.

While entering broadcast, it gets a hpet channel (look for a spare one
first, if failing allocate a shared one), attach to current cpu, setup
the irq affinity & broadcast cpumask. While exiting broadcast, it
detach the used hpet channel and try to change the owner if the
broadcast mask is not empty. Some optimizations(static affinity) were
done for (MSI HPET channels >= possible cpu num) case.

A new hw_interrupt_controller 'HPET_MSI' is created for HPET MSI
interrupt handling.

Signed-off-by: Wei Gang <gang.wei@intel.com>=
16 years agoxend: specify the slot for pass-through devices
Keir Fraser [Fri, 20 Mar 2009 12:09:20 +0000 (12:09 +0000)]
xend: specify the slot for pass-through devices

Currently a slot may be specified for a hot-plug device,
but not for a pass-through device that is inserted at boot time.
This patch adds support for the latter.

The syntax is:
     BUS:DEV.FUNC[@VSLOT]
e.g: 0000:00:1d:0@7

This may be important as recent changes that allow any free PCI
slot to be used for pass-through (and hotplug) may case pass-through
devices to be assigned in different locations to before. Amongst
other things, specifying the slot will allow users to move them
back, if there is a need.

Signed-off-by: Simon Horman <horms@verge.net.au>
16 years agoxenstore: Don't hardcode install path
Keir Fraser [Fri, 20 Mar 2009 10:37:12 +0000 (10:37 +0000)]
xenstore: Don't hardcode install path

Signed-off-by: Christoph Egger <Christoph.Egger@amd.com>
16 years agoxmexample: fix cpuid example
Keir Fraser [Fri, 20 Mar 2009 10:36:40 +0000 (10:36 +0000)]
xmexample: fix cpuid example

Signed-off-by: Christoph Egger <Christoph.Egger@amd.com>
16 years agovtd: fix Dom0 S3 when VT-d is enabled.
Keir Fraser [Fri, 20 Mar 2009 09:36:57 +0000 (09:36 +0000)]
vtd: fix Dom0 S3 when VT-d is enabled.

On some platforms that support Queued Invalidation and Interrupt
Remapping, Dom0 S3 doesn't work. The patch fixes the issue.

1) In device_power_down(), we should invoke iommu_suspend() after
ioapic_suspend(); in device_power_up(), we should invoke
iommu_resume() before ioapic_resume().

2) Add 2 functions: disable_qinval() and disable_intremap(); in
iommu_suspend(), we invoke them and iommu_disable_translation().
   Rename qinval_setup() to enable_qinval() and rename
   intremap_setup() to enable_intremap().

3) In iommu_resume(), remove the unnecessary
iommu_flush_{context, iotlb}_global() -- actually we mustn't do that
if Queued Invalidation was enabled before S3 because at this point of
S3 resume, Queued Invalidation hasn't been re-enabled.

4) Add a static global array ioapic_pin_to_intremap_index[] to
remember what intremap_index an ioapic pin uses -- during S3 resume,
ioapic_resume() re-writes all the ioapic RTEs, so we can use the array
to re-use the previously-allocated IRTE;

5) Some cleanups:
   a) Change some failure handlings in enable_intremap() to panic().
   b) Remove the unnecessary local variable iec_cap in
   __iommu_flush_iec().
   c) Add a dmar_writeq(iommu->reg, DMAR_IQT_REG, 0) in
   enable_qinval().

Signed-off-by: Dexuan Cui <dexuan.cui@intel.com>
16 years agoDo not pass DIRECTMAP_VIRT_END to virt_to_mfn(). It's out of range.
Keir Fraser [Fri, 20 Mar 2009 09:34:09 +0000 (09:34 +0000)]
Do not pass DIRECTMAP_VIRT_END to virt_to_mfn(). It's out of range.

Signed-off-by: Xiaowei Yang <xiaowei.yang@intel.com>
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agovtd: only enable Interrupt Remapping if Queued Invalidation is also enabled.
Keir Fraser [Fri, 20 Mar 2009 09:10:55 +0000 (09:10 +0000)]
vtd: only enable Interrupt Remapping if Queued Invalidation is also enabled.

If Queued Invalidation is not supported or not enabled, we should not
enable Interrupt Remapping even if HW supports it, because Interrupt
Remapping needs Queued Invalidation to invalidate Interrupt Remapping
Cache.

Signed-off-by: Dexuan Cui <dexuan.cui@intel.com>
16 years agoxenpm: Add a small scheduler knob "sched_smt_power_savings"
Keir Fraser [Fri, 20 Mar 2009 08:59:47 +0000 (08:59 +0000)]
xenpm: Add a small scheduler knob "sched_smt_power_savings"

Current scheduler only care performance, thus always picks pCPU from
the most idle package. This knob provides another option to pick pCPU from
least idle package, for user who want performance power balance.

Signed-off-by: Yu Ke <ke.yu@intel.com>
Signed-off-by: Tian Kevin <kevin.tian@intel.com>
16 years agoTypo fix for request_irq
Keir Fraser [Fri, 20 Mar 2009 08:57:45 +0000 (08:57 +0000)]
Typo fix for request_irq

Signed-off-by: Wei Gang <gang.wei@intel.com>
16 years agox86_64: Fix handling of DIRECTMAP_VIRT_START in virt_to_maddr().
Keir Fraser [Fri, 20 Mar 2009 08:57:23 +0000 (08:57 +0000)]
x86_64: Fix handling of DIRECTMAP_VIRT_START in virt_to_maddr().

Signed-off-by: Xiaowei Yang <xiaowei.yang@intel.com>
16 years agoAdd cpufreq actual average freq information to xenpm tools
Keir Fraser [Fri, 20 Mar 2009 08:49:17 +0000 (08:49 +0000)]
Add cpufreq actual average freq information to xenpm tools

Signed-off-by: Liu, Jinsong <jinsong.liu@intel.com>
16 years agoDeliver cpufreq actual average freq to libxc interface
Keir Fraser [Fri, 20 Mar 2009 08:48:57 +0000 (08:48 +0000)]
Deliver cpufreq actual average freq to libxc interface

Calculate cpu actual average freq according to aperf and mperf MSR and
then deliver to libxc interface

Signed-off-by: Liu, Jinsong <jinsong.liu@intel.com>
16 years agocpufreq: Update cpufreq aperf and mperf read, so that it can be used
Keir Fraser [Fri, 20 Mar 2009 08:48:17 +0000 (08:48 +0000)]
cpufreq: Update cpufreq aperf and mperf read, so that it can be used
by both ondemand gov and user program

Current __get_measured_perf read aperf and mperf MSR and then clear
them for the sake of ondemand governor. This solution block user
program to get aperf and mperf on their purpose. In this patch, it no
longer clear aperf and mperf MSR, so that it can be used by both
ondemand gov and user program.

Signed-off-by: Liu, Jinsong <jinsong.liu@intel.com>
16 years agomcheck: Small fix to Intel CMCI code
Keir Fraser [Fri, 20 Mar 2009 08:44:54 +0000 (08:44 +0000)]
mcheck: Small fix to Intel CMCI code

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agofs-back: unmapp all shared ring pages when a domain dies
Keir Fraser [Fri, 20 Mar 2009 08:43:53 +0000 (08:43 +0000)]
fs-back: unmapp all shared ring pages when a domain dies

This bug causes some pages not to be properly freed by xen, hence
after a cycle of 50 save\restore we don't have enough spare memory to
start 2 VMs any more. This patch fixes the issue properly unmapping
all the pages needed by the shared ring.

Signed-off-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com>
16 years agoxend: fix vscsi error path
Keir Fraser [Fri, 20 Mar 2009 08:42:59 +0000 (08:42 +0000)]
xend: fix vscsi error path

Fix xend start problem on non-Linux Dom0 where /proc/mount doesn't exist.

Signed-off-by: Christoph Egger <Christoph.Egger@amd.com>
16 years agohvm: Allow HVM guests direct access to 0xed debug/delay port
Keir Fraser [Fri, 20 Mar 2009 08:42:14 +0000 (08:42 +0000)]
hvm: Allow HVM guests direct access to 0xed debug/delay port

Port 0x80 is already passed-through.  Writes to port 0x80 are
traditionally used as IO delay.  Recent Linux kernel versions can be
configured to use port 0xed instead (CONFIG_IO_DELAY_0XED=3Dy).
Ubuntu 8.04 and later kernels are configured to use port 0xed.

In Linux IO delay is implemented by (in|out)[bwl]_p.  From a first
glance at Linux it looks like the only relevant usage of delayed IO is
in drivers/video/console/vgacon.c.  Affected is VGA console
initialization and font setup.  Output / scrolling is not affected.

Signed-off-by: Thomas Friebel <thomas.friebel@amd.com>
16 years agotmem: Placeholder hypercall.
Keir Fraser [Fri, 20 Mar 2009 08:40:18 +0000 (08:40 +0000)]
tmem: Placeholder hypercall.

Signed-off-by: Dan Magenheimer <dan.magenheimer@oracle.com>
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agox86: Fix the 32-bit build after c/s 19400.
Keir Fraser [Fri, 20 Mar 2009 08:34:24 +0000 (08:34 +0000)]
x86: Fix the 32-bit build after c/s 19400.
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agox86: Ensure we execute LTR when running on non-compat GDT which has
Keir Fraser [Thu, 19 Mar 2009 17:04:06 +0000 (17:04 +0000)]
x86: Ensure we execute LTR when running on non-compat GDT which has
busy bit clear in TSS descriptor.

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agobuild system cleanup - get rid of some hardcoded paths
Keir Fraser [Thu, 19 Mar 2009 14:01:56 +0000 (14:01 +0000)]
build system cleanup - get rid of some hardcoded paths

Signed-off-by: Christoph Egger <Christoph.Egger@amd.com>
16 years agoClean up physdev.h public header.
Keir Fraser [Thu, 19 Mar 2009 10:24:48 +0000 (10:24 +0000)]
Clean up physdev.h public header.

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agoXen: use proper device ID to search VT-d unit for ARI and SR-IOV device
Keir Fraser [Thu, 19 Mar 2009 10:20:11 +0000 (10:20 +0000)]
Xen: use proper device ID to search VT-d unit for ARI and SR-IOV device

PCIe Alternative Routing-ID Interpretation (ARI) ECN defines the Extended
Function -- a function whose function number is greater than 7 within an
ARI Device. Intel VT-d spec 1.2 section 8.3.2 specifies that the Extended
Function is under the scope of the same remapping unit as the traditional
function. The hypervisor needs to know if a function is Extended
Function so it can find proper DMAR for it.

And section 8.3.3 specifies that the SR-IOV Virtual Function is under the
scope of the same remapping unit as the Physical Function. The hypervisor
also needs to know if a function is the Virtual Function and which
Physical Function it's associated with for same reason.

Signed-off-by: Yu Zhao <yu.zhao@intel.com>
16 years agoXen: documentation for VT-d/SR-IOV
Keir Fraser [Thu, 19 Mar 2009 10:10:59 +0000 (10:10 +0000)]
Xen: documentation for VT-d/SR-IOV

Add a section about how to use the SR-IOV device with VT-d.

Signed-off-by: Yu Zhao <yu.zhao@intel.com>
16 years agocpuidle and xenpm small cleanups.
Keir Fraser [Thu, 19 Mar 2009 10:10:31 +0000 (10:10 +0000)]
cpuidle and xenpm small cleanups.

Signed-off-by: Guanqun Lu <guanqun.lu@intel.com>
16 years agoxenpm: add timeout option to 'xenpm start' command.
Keir Fraser [Thu, 19 Mar 2009 10:09:59 +0000 (10:09 +0000)]
xenpm: add timeout option to 'xenpm start' command.

Thus we can sample a fixed time of period without manual interruption.

Signed-off-by: Guanqun Lu <guanqun.lu@intel.com>
16 years agoFix a cpufreq userspace limitation bug
Keir Fraser [Thu, 19 Mar 2009 10:09:24 +0000 (10:09 +0000)]
Fix a cpufreq userspace limitation bug

Fix a cpufreq userspace limitation bug, so that userspace freq can
return to correct freq when freq_limitation return to high value (like
ppc event)

Signed-off-by: Liu, Jinsong <jinsong.liu@intel.com>=
16 years agoxentrace: Add acpi pm tick output to idle tracing
Keir Fraser [Thu, 19 Mar 2009 10:08:48 +0000 (10:08 +0000)]
xentrace: Add acpi pm tick output to idle tracing

The reason is that tsc stops and it causes the inaccuracy.
And later we can write some scripts based on this patch.

Signed-off-by: Guanqun Lu <guanqun.lu@intel.com>
16 years agox86, hvm: Fix double-free of vpmu->context
Keir Fraser [Thu, 19 Mar 2009 10:05:01 +0000 (10:05 +0000)]
x86, hvm: Fix double-free of vpmu->context

When `opcontrol --shutdown' is called after xenoprof is used on Dom0,
the vpmu owner becomes PMU_OWNER_NONE. So it is possible to acquire
the owner as PMU_OWNER_HVM and to allocate vpmu->context twice. As a
result, the hypervisor panics because of double-alloc/free of
vpmu->context.

Signed-off-by: Kazuhiro Suzuki <kaz@jp.fujitsu.com>
16 years agox86 dom builder: Allocate domU pages in batches.
Keir Fraser [Thu, 19 Mar 2009 10:04:15 +0000 (10:04 +0000)]
x86 dom builder: Allocate domU pages in batches.

Allows a 32-bit dom0 to create very large guests.

Signed-off-by: Mukesh Rathor <mukesh.rathor@oracle.com>
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agoioemu-stubdom: Use xen-setup-stubdom rather than configure
Keir Fraser [Wed, 18 Mar 2009 17:30:13 +0000 (17:30 +0000)]
ioemu-stubdom: Use xen-setup-stubdom rather than configure

xen-setup runs the upstream configure script, and ends up
autodetecting various properties of the host and then trying to use
them in stubdom, which works badly.

In commit c609854fe18aab3216f28e1e5291b75970d398d5 in
qemu-xen-unstable we provide a new `xen-setup-stubdom' script, which
hardcodes the relevant answers.

With these two changes, stubdom should compile again.

Signed-off-by: Ian Jackson <ian.jackson@eu.citrix.com>
16 years agolibxc: Use write_exact() in lock_suspend_event().
Keir Fraser [Wed, 18 Mar 2009 17:25:54 +0000 (17:25 +0000)]
libxc: Use write_exact() in lock_suspend_event().
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agoAdd PVUSB and PVSCSI interface headers.
Keir Fraser [Wed, 18 Mar 2009 15:42:47 +0000 (15:42 +0000)]
Add PVUSB and PVSCSI interface headers.
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agofs-back: build fixes
Keir Fraser [Wed, 18 Mar 2009 13:25:25 +0000 (13:25 +0000)]
fs-back: build fixes

Remove some unused variables and replaces read and write
to the pipe with read_exact and write_exact (these two functions are
implemented in libxc, that we have to link anyway).

This allows fs-backed to be compiled with -D_FORTIFY_SOURCE=2, hence
should fix the problems reported by Boris.

Signed-off-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com>
16 years agoActually set a HVM domain's time offset when it sets the RTC
Keir Fraser [Wed, 18 Mar 2009 13:24:42 +0000 (13:24 +0000)]
Actually set a HVM domain's time offset when it sets the RTC

Signed-off-by: Tim Deegan <Tim.Deegan@citrix.com>
16 years agofs-backend.c misses the unistd.h include for read() and write().
Keir Fraser [Wed, 18 Mar 2009 11:51:30 +0000 (11:51 +0000)]
fs-backend.c misses the unistd.h include for read() and write().

Signed-off-by: Stefan Berger <stefanb@us.ibm.com>
16 years agox86: Change the mod_lx_entry to accept the vcpu parameter
Keir Fraser [Wed, 18 Mar 2009 11:50:16 +0000 (11:50 +0000)]
x86: Change the mod_lx_entry to accept the vcpu parameter

Signed-off-by: Jiang, Yunhong <yunhong.jiang@intel.com>
16 years agoMove the suspend event channel function to libxc, it will use the
Keir Fraser [Wed, 18 Mar 2009 11:49:26 +0000 (11:49 +0000)]
Move the suspend event channel function to libxc, it will use the
/var/lib/xen/suspend_evtchn_lock.d to protect the access.

Signed-off-by: Jiang Yunhong <yunhong.jiang@intel.com>
16 years agoxc_save: remove the dependency on the global si structure
Keir Fraser [Wed, 18 Mar 2009 11:48:48 +0000 (11:48 +0000)]
xc_save: remove the dependency on the global si structure

Signed-off-by: Jiang Yunhong <yunhong.jiang@intel.com>
16 years agovtd: Only enable some VT-d features if all VT-d engines support them.
Keir Fraser [Wed, 18 Mar 2009 11:46:32 +0000 (11:46 +0000)]
vtd: Only enable some VT-d features if all VT-d engines support them.

By default, we enable snoop control, queued invalidation and interrupt
remapping if all VT-d engines support them, and for DMA passthrough we
don't enable it by default.

A user can use 'iommu=passthrough' to enable DMA passthrough (only
for Dom0). A user can use 'iommu=no-snoop,no-qinval,no-intremap' to
disable the 3 features.

Signed-off-by: Dexuan Cui <dexuan.cui@intel.com>=
16 years agoDisable xen platform PCI device when xen_platform_pci=0 is specified
Keir Fraser [Wed, 18 Mar 2009 11:37:59 +0000 (11:37 +0000)]
Disable xen platform PCI device when xen_platform_pci=0 is specified

 - Change guest firmware to use new fixed byte port 0x10.
 - Add "xen_platform_pci" option into guest config file and APIs.
    xen_platform_pci=0: Disable xen platform device.
    xen_platform_pci=1: Enable xen platform device. (default)
 - Add "disable_pf" entry into xenstore.

Signed-off-by: Yuji Shimada <shimada-yxb@necst.nec.co.jp>
16 years agoxend: Fix vscsi assignment when XenAPI is available
Keir Fraser [Wed, 18 Mar 2009 11:34:20 +0000 (11:34 +0000)]
xend: Fix vscsi assignment when XenAPI is available

When XenAPI is available, xm scsi-attach fails.  Also xm new and
xm create fail if a vscsi parameter is defined into domain
configuration files.

Signed-off-by: Masaki Kanno <kanno.masaki@jp.fujitsu.com>
16 years agoxend: Use vnc-listen setting specified in xend config file
Keir Fraser [Wed, 18 Mar 2009 11:32:46 +0000 (11:32 +0000)]
xend: Use vnc-listen setting specified in xend config file

If domU frame buffer configuration has no vnclisten attribute set, use
the global vnc-listen address specified in xend configuration file
when updating vfb location.  Currently the hardcoded value 'localhost'
is used.

Signed-off-by: Jim Fehlig <jfehlig@novell.com>
16 years agox86: Assert non-zero refcount on entry to put_page().
Keir Fraser [Tue, 17 Mar 2009 15:40:25 +0000 (15:40 +0000)]
x86: Assert non-zero refcount on entry to put_page().
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agoRevert debugging patch c/s 19344.
Keir Fraser [Tue, 17 Mar 2009 15:29:57 +0000 (15:29 +0000)]
Revert debugging patch c/s 19344.

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agox86: Fix get_page() to not drop reference count if it wasn't incremented.
Keir Fraser [Tue, 17 Mar 2009 15:29:20 +0000 (15:29 +0000)]
x86: Fix get_page() to not drop reference count if it wasn't incremented.

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agox86 mcheck: Fix the build.
Keir Fraser [Tue, 17 Mar 2009 14:32:52 +0000 (14:32 +0000)]
x86 mcheck: Fix the build.
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agox86 shadow: Fix 2-on-3 mode.
Keir Fraser [Tue, 17 Mar 2009 14:30:15 +0000 (14:30 +0000)]
x86 shadow: Fix 2-on-3 mode.

Signed-off-by: Gianluca Guida <Gianluca.Guida@eu.citrix.com>
16 years agox86 shadow: Prevent in-sync L1s to become writable
Keir Fraser [Tue, 17 Mar 2009 14:29:26 +0000 (14:29 +0000)]
x86 shadow: Prevent in-sync L1s to become writable
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agofs-back: Fixes and cleanups
Keir Fraser [Tue, 17 Mar 2009 14:28:33 +0000 (14:28 +0000)]
fs-back: Fixes and cleanups

Signed-off-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com>
16 years agoAdd CPU status info and a status call to the CPU on/offline sysctls.
Keir Fraser [Tue, 17 Mar 2009 14:25:41 +0000 (14:25 +0000)]
Add CPU status info and a status call to the CPU on/offline sysctls.

Signed-off-by: Frank van der Linden <frank.vanderlinden@sun.com>
16 years agox86 mcheck: Provide MCA "injection" hypervisor services.
Keir Fraser [Tue, 17 Mar 2009 14:24:08 +0000 (14:24 +0000)]
x86 mcheck: Provide MCA "injection" hypervisor services.

Signed-off-by: Gavin Maltby <gavin.maltby@sun.com>
16 years agox86 mcheck: Replace hypervisor MCA telemetry structures with something
Keir Fraser [Tue, 17 Mar 2009 14:22:50 +0000 (14:22 +0000)]
x86 mcheck: Replace hypervisor MCA telemetry structures with something
more robust and designed to make terminal error telemetry available to
the dom0 panic flow for diagnosis on reboot.

Use common code for a lot of the AMD and Intel MCE handling code.

Signed-off-by: Gavin Maltby <gavin.maltby@sun.com>
Signed-off-by: Frank van der Linden <frank.vanderlinden@sun.com>
16 years agoMake stubdom compile on systems with libvde installed
Keir Fraser [Tue, 17 Mar 2009 14:21:18 +0000 (14:21 +0000)]
Make stubdom compile on systems with libvde installed

We need to disable libvde in the stubdom qemu-xen configuration.
Thanks to Jun Koi for the report and testing.

Signed-off-by: Ian Jackson <ian.jackson@eu.citrix.com>
16 years agoFix MAX_EVTCHNS() definition.
Keir Fraser [Tue, 17 Mar 2009 13:21:46 +0000 (13:21 +0000)]
Fix MAX_EVTCHNS() definition.

Pointed out by Jan Beulich.

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agox86, cpuidle: disable ARB_DISABLE access for latest intel platforms
Keir Fraser [Tue, 17 Mar 2009 10:50:16 +0000 (10:50 +0000)]
x86, cpuidle: disable ARB_DISABLE access for latest intel platforms

ARB_DISABLE is a nop on all of the recent Intel platforms. Disable
ARB_DISABLE and attached c3_lock on C3 entry exit for such platforms.

Signed-off-by: Venkatesh Pallipadi <venkatesh.pallipadi@intel.com>
Signed-off-by: Wei Gang <gang.wei@intel.com>
16 years agoxend: Add s3_integrity attribute for XenAPI
Keir Fraser [Tue, 17 Mar 2009 10:49:42 +0000 (10:49 +0000)]
xend: Add s3_integrity attribute for XenAPI

When XenAPI is available, xm new and xm create fail.

# xm new vm1
Using config file "/etc/xen/vm1".
Attribute 's3_integrity' not declared
# xm create vm1
Using config file "/etc/xen/vm1".
Attribute 's3_integrity' not declared

This patch adds an s3_integrity into create.dtd.

Signed-off-by: Masaki Kanno <kanno.masaki@jp.fujitsu.com>
16 years agohvmloader: Allow any unused PCI device to be used for PCI hotplug
Keir Fraser [Tue, 17 Mar 2009 10:48:52 +0000 (10:48 +0000)]
hvmloader: Allow any unused PCI device to be used for PCI hotplug

Signed-off-by: Simon Horman <horms@verge.net.au>
16 years agoxend: Use AUTO_PHP_SLOT_STR=0xff rather than 0x0 for auto pci allocation
Keir Fraser [Tue, 17 Mar 2009 10:42:32 +0000 (10:42 +0000)]
xend: Use AUTO_PHP_SLOT_STR=0xff rather than 0x0 for auto pci allocation

Rather than use "0x0" to signify that the hotplug slot should
be automatically selected by qemu-dm, define AUTO_PHP_SLOT_STR.

At the same time, change the magic value form 0x0 to 0xff,
in line with changes made to qemu-dm to allow any unused PCI
device to be used for hotplug.

Signed-off-by: Simon Horman <horms@verge.net.au>
16 years agoxend: Accept udev events and update physical resource information
Keir Fraser [Tue, 17 Mar 2009 10:40:47 +0000 (10:40 +0000)]
xend: Accept udev events and update physical resource information

When a udev event is received, udevevent.py parses the udev data and
tells XendNode.py to update the physical resource information.
This patch also add a boolean parameter 'xend-udev-event-server', to
let users indicate whether we should enable this function or not.

Signed-off-by: Yosuke Iwamatsu <y-iwamatsu@ab.jp.nec.com>
16 years agoAdd udev rules to deliver hw events to xend
Keir Fraser [Tue, 17 Mar 2009 10:38:06 +0000 (10:38 +0000)]
Add udev rules to deliver hw events to xend

Signed-off-by: Yosuke Iwamatsu <y-iwamatsu@ab.jp.nec.com>
16 years agoxend: Add lock for lspci_info in pci.py
Keir Fraser [Tue, 17 Mar 2009 10:37:12 +0000 (10:37 +0000)]
xend: Add lock for lspci_info in pci.py

Add lspci_info_lock to protect lspci_info.

Signed-off-by: Yosuke Iwamatsu <y-iwamatsu@ab.jp.nec.com>
16 years agoxend: Add lock for xen-api class instances in XendAPIStore.py
Keir Fraser [Tue, 17 Mar 2009 10:36:51 +0000 (10:36 +0000)]
xend: Add lock for xen-api class instances in XendAPIStore.py

Add __classes_lock to protect __classes, since it can be modified by
the udev listener thread.

Signed-off-by: Yosuke Iwamatsu <y-iwamatsu@ab.jp.nec.com>
16 years agoxend: Implement DGRAM (connectionless) type socket listeners
Keir Fraser [Tue, 17 Mar 2009 10:36:20 +0000 (10:36 +0000)]
xend: Implement DGRAM (connectionless) type socket listeners

Introduce SocketDgramListener and UnixDgramListener classes.
We already have STREAM (connection) type socket listener classes in
the source tree, but we need DGRAM (connectionless) type listeners to
receive udev events.

Signed-off-by: Yosuke Iwamatsu <y-iwamatsu@ab.jp.nec.com>
16 years agoxenpm: Add CPU topology info (thread/core/socket)
Keir Fraser [Fri, 13 Mar 2009 10:09:25 +0000 (10:09 +0000)]
xenpm: Add CPU topology info (thread/core/socket)

CPU topology info is necessary for power management analysis. For
example, to analysis the effect of Px state coordination, Cx
package/core coordination,  the thread/core/socket topology
information is needed.

This patch add new command "get-cpu-topology" in xenpm to print the
CPU topology info:

Signed-off-by: Yu Ke <ke.yu@intel.com>
16 years ago[SVM] Always read zero AMD C1E control MSR to allow cross-vendor migration
Keir Fraser [Fri, 13 Mar 2009 07:45:11 +0000 (07:45 +0000)]
[SVM] Always read zero AMD C1E control MSR to allow cross-vendor migration

Signed-off-by: Christoph Egger <Christoph.Egger@amd.com>
16 years ago[SVM] fix segment descriptors to allow migration to Intel VT systems
Keir Fraser [Fri, 13 Mar 2009 07:43:45 +0000 (07:43 +0000)]
[SVM] fix segment descriptors to allow migration to Intel VT systems

Signed-off-by: Christoph Egger <Christoph.Egger@amd.com>
16 years agoxend: Fix xm scsi-attach for host OS without lsscsi
Keir Fraser [Fri, 13 Mar 2009 07:38:47 +0000 (07:38 +0000)]
xend: Fix xm scsi-attach for host OS without lsscsi

In the case of a host OS without lsscsi command, xm scsi-attach
command fails even if you specify existing SCSI device.

Signed-off-by: Masaki Kanno <kanno.masaki@jp.fujitsu.com>
16 years agoxend: Fix domain core-dumping about reset option
Keir Fraser [Fri, 13 Mar 2009 07:38:05 +0000 (07:38 +0000)]
xend: Fix domain core-dumping about reset option

This patch moves a call processing of a domain reset processing
from xm to xend.  Also, it adds a reset argument to do_dump() of
SrvDomain.py.

Signed-off-by: Masaki Kanno <kanno.masaki@jp.fujitsu.com>
16 years agominios: allow to allocate machine contiguous pages
Keir Fraser [Fri, 13 Mar 2009 07:37:24 +0000 (07:37 +0000)]
minios: allow to allocate machine contiguous pages

This is a port of XenLinux xen_alloc_contig_memory() to mini-os. A
sufficiently privileged mini-os guest can exchange a small number of
its pages with machine contiguous pages.

Signed-off-by: Rolf Neugebauer <rolf.neugebauer@netronome.com>
16 years agoblktap: Move error signaling to blktapctrl
Keir Fraser [Thu, 12 Mar 2009 18:48:09 +0000 (18:48 +0000)]
blktap: Move error signaling to blktapctrl

Until now the udev script for blktap devices needs to decide if to
signal success or failure to xend. As this script runs completely
independent of blktapctrl and tapdisk/ioemu which do the real work,
the udev script can't even theoretically know if tapdisk is happy.

This patch removes the udev script and replaces its checks by new
ones in libblktap.

Signed-off-by: Kevin Wolf <kwolf@suse.de>
16 years agoblktapctrl: Fix too early close of pipes
Keir Fraser [Thu, 12 Mar 2009 18:46:26 +0000 (18:46 +0000)]
blktapctrl: Fix too early close of pipes

Connections to ioemu have single_handler set, so they are closed as
soon as all images of a certain type are closed. This is wrong with
ioemu: All images that belong to the same domain are handled by the
same backend process (usually qemu-dm, but also tapdisk-ioemu for
domains without device model), regardless of the image type.

This patch checks for the same-domain condition for ioemu connections.

Signed-off-by: Kevin Wolf <kwolf@suse.de>
16 years agoblktap: Export disk type constants for ioemu
Keir Fraser [Thu, 12 Mar 2009 18:42:59 +0000 (18:42 +0000)]
blktap: Export disk type constants for ioemu

Currently all disk types that are supported are defined in a header
file private to blktapctrl and tapdisk. When restoring ioemu as a
backend for blktap these constants are needed by ioemu, so move them
to a more public header file.

Signed-off-by: Kevin Wolf <kwolf@suse.de>
16 years agoblktapctrl: Select backend by prefix
Keir Fraser [Thu, 12 Mar 2009 18:42:31 +0000 (18:42 +0000)]
blktapctrl: Select backend by prefix

This patch adds support for specifying the backend (tapdisk or ioemu)
to blktapctrl. Images can be specified e.g. as tap:tapdisk:aio,
tap:ioemu:qcow2 or tap:vmdk. When omitting the backend, a default is
chosen based on the image type (currently always tapdisk because ioemu
as a backend is broken until a follow-up patch series against qemu-xen
is applied)

Signed-off-by: Kevin Wolf <kwolf@suse.de>
16 years agoxend: fix regression in c/s 19330
Keir Fraser [Thu, 12 Mar 2009 15:40:52 +0000 (15:40 +0000)]
xend: fix regression in c/s 19330

attached patch fixes a regression in c/s 19330 which
prevents to start guests on a Linux Dom0.

Signed-off-by: Christoph Egger <Christoph.Egger@amd.com>
16 years agoAdd debug code to catch count_info errors in page_alloc.c
Keir Fraser [Thu, 12 Mar 2009 15:37:12 +0000 (15:37 +0000)]
Add debug code to catch count_info errors in page_alloc.c

This patch is temporary.

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agoCode cleanups after page offline patch.
Keir Fraser [Thu, 12 Mar 2009 15:31:36 +0000 (15:31 +0000)]
Code cleanups after page offline patch.

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agoxend: Fix removing /vm/UUID/device paths when device cannot be disconnected
Keir Fraser [Thu, 12 Mar 2009 15:08:08 +0000 (15:08 +0000)]
xend: Fix removing /vm/UUID/device paths when device cannot be disconnected

This is a corrected version of a patch commited as c/s 19250 and
reverted by c/s 19314. Thanks to "trap sigerr ERR" in
xen-hotplug-common.sh the xen-hotplug-cleanup would exit when reading
/local/domain/ID/vm fails thus skipping all the xenstore-rm lines in
the rest of the script.

Change deviceDestroy behavior to remove /vm/UUID/device/...
path only when force was used (as it already does so for both frontend
and backend) and do the removing from xen-hotplug-cleanup script when
we are sure the device is really not attached to the guest any more.

Signed-off-by: Jiri Denemark <jdenemar@redhat.com>
16 years agoxentrace: error handling on non-existent formats file
Keir Fraser [Thu, 12 Mar 2009 15:07:20 +0000 (15:07 +0000)]
xentrace: error handling on non-existent formats file

Attached patch does proper error handling when specified
formats file doesn't exist.

Signed-off-by: Christoph Egger <Christoph.Egger@amd.com>
16 years agoxend: Use parse_hex() in XendPPCI.py
Keir Fraser [Thu, 12 Mar 2009 11:33:35 +0000 (11:33 +0000)]
xend: Use parse_hex() in XendPPCI.py

Signed-off-by: Yosuke Iwamatsu <y-iwamatsu@ab.jp.nec.com>
16 years agoxend: Do not create XendNetwork instance of tmpbridge
Keir Fraser [Thu, 12 Mar 2009 11:33:06 +0000 (11:33 +0000)]
xend: Do not create XendNetwork instance of tmpbridge

This patch prevents xend from creating a XendNetwork instance of
tmpbridge (a temporary bridge used in network-bridge script).

Signed-off-by: Yosuke Iwamatsu <y-iwamatsu@ab.jp.nec.com>
16 years agox86: Fix a typo in vpmu_core2.c
Keir Fraser [Thu, 12 Mar 2009 11:26:32 +0000 (11:26 +0000)]
x86: Fix a typo in vpmu_core2.c

When the HVM guest which is xenoprof's passive domain, is going to
shutdown, the Hypervisor panics because of typo in vpmu_core2.c

Signed-off-by: Kazuhiro Suzuki <kaz@jp.fujitsu.com>
16 years agoPV-on-HVM: Add a compatibility linux/scatterlist.h for kernels before 2.6.12
Keir Fraser [Thu, 12 Mar 2009 11:25:40 +0000 (11:25 +0000)]
PV-on-HVM: Add a compatibility linux/scatterlist.h for kernels before 2.6.12

Signed-off-by: KUWAMURA Shin'ya <kuwa@jp.fujitsu.com>
16 years agoFix Makefile portability: head -n -1 isn't portable.
Keir Fraser [Thu, 12 Mar 2009 11:24:25 +0000 (11:24 +0000)]
Fix Makefile portability: head -n -1 isn't portable.

Signed-off-by: John Levon <john.levon@sun.com>
16 years agox86: Fix APIC 0x40 error when CPU online and Host s3 resume
Keir Fraser [Thu, 12 Mar 2009 11:16:54 +0000 (11:16 +0000)]
x86: Fix APIC 0x40 error when CPU online and Host s3 resume

disable_APIC_timer actually is not useful here. Actually it will
trigger a local APIC error when masking the LVT entry when vector is
zero (before timer is inited) on Intel P6 family. This APIC error(40)
appears when online the offlined CPU and Host S3 resume.

Signed-off-by: Liping Ke <liping.ke@intel.com>
Signed-off-by: Gang Wei <gang.wei@intel.com>
16 years agoFix xenstore permissions for Solaris domUs.
Keir Fraser [Thu, 12 Mar 2009 11:14:22 +0000 (11:14 +0000)]
Fix xenstore permissions for Solaris domUs.

Solaris domUs use hvmpv/ and guest/ frontend xenstore directories.

Signed-off-by: John Levon <john.levon@sun.com>
16 years agorombios: __STRING() is Linux-specific
Keir Fraser [Thu, 12 Mar 2009 11:12:43 +0000 (11:12 +0000)]
rombios: __STRING() is Linux-specific

Signed-off-by: John Levon <john.levon@sun.com>
16 years agopygrub: Detect NetWare PV in pygrub
Keir Fraser [Thu, 12 Mar 2009 11:10:22 +0000 (11:10 +0000)]
pygrub: Detect NetWare PV in pygrub

Signed-off-by: John Levon <john.levon@sun.com>
16 years agoDomain core-dumping fixes
Keir Fraser [Thu, 12 Mar 2009 11:09:57 +0000 (11:09 +0000)]
Domain core-dumping fixes

The code was attempting to use the domain's current number of pages
(info.nr_pages) as a maximum index.  We then walk the memory map and
can easily over-write past the end of the nr_pages-sized array, if the
domain has more pages mapped in than earlier (live dump).  Restrict
ourselves to the current number of pages.

Also fix the dump core method in xend to actually implement the crash
and live options.  In particular this means that xend clients other
than xm now get non-live dumps by default.

Signed-off-by: John Levon <john.levon@sun.com>
16 years agoxend: Skip tap pipe creation on Solaris
Keir Fraser [Thu, 12 Mar 2009 11:08:45 +0000 (11:08 +0000)]
xend: Skip tap pipe creation on Solaris

Signed-off-by: John Levon <john.levon@sun.com>
16 years agoxend: Crash the domain if qemu-dm stops running
Keir Fraser [Thu, 12 Mar 2009 11:07:34 +0000 (11:07 +0000)]
xend: Crash the domain if qemu-dm stops running

The comment about the lack of restart prevention is not accurate.

Bump the minimum restart time to 60 seconds.

Signed-off-by: John Levon <john.levon@sun.com>
16 years agoFix qemu spawn for Solaris
Keir Fraser [Thu, 12 Mar 2009 11:07:00 +0000 (11:07 +0000)]
Fix qemu spawn for Solaris

On Solaris, xend runs in a 'process contract' such that all children
are killed when the service is restarted.  Spawn qemu processes in a
new contract to avoid this.

Signed-off-by: John Levon <john.levon@sun.com>
16 years agoxend: Allow replacement of a cdrom device on a device_add
Keir Fraser [Thu, 12 Mar 2009 11:01:25 +0000 (11:01 +0000)]
xend: Allow replacement of a cdrom device on a device_add

When adding a device, allow a cd to replace another if the device is a
cdrom.

Signed-off by: Ryan Scott <ryan.scott@sun.com>

16 years agoxend: Time-out if guest fails to suspend
Keir Fraser [Thu, 12 Mar 2009 11:00:52 +0000 (11:00 +0000)]
xend: Time-out if guest fails to suspend

If a guest fails to re-write control/shutdown node within a minute,
fail the suspend operation.

Signed-off-by: John Levon <john.levon@sun.com>
16 years agoxend: Allow off-line NIC removal
Keir Fraser [Thu, 12 Mar 2009 10:59:53 +0000 (10:59 +0000)]
xend: Allow off-line NIC removal

Match a MAC address so device_destroy can remove off-line NICs.

Signed-off-by: John Levon <john.levon@sun.com>
16 years agoxend: Correctly catch invalid domain lookups
Keir Fraser [Thu, 12 Mar 2009 10:59:19 +0000 (10:59 +0000)]
xend: Correctly catch invalid domain lookups

If a client asks about a non-existent domain, report the error back
properly instead of just backtracing within xend.

Signed-off-by: John Levon <john.levon@sun.com>