]> xenbits.xensource.com Git - people/liuw/libxenctrl-split/xen.git/log
people/liuw/libxenctrl-split/xen.git
15 years agox86 hvm: Allow cross-vendor migration
Keir Fraser [Tue, 26 May 2009 14:01:36 +0000 (15:01 +0100)]
x86 hvm: Allow cross-vendor migration

Intercept #UD and emulate SYSCALL/SYSENTER/SYSEXIT as necessary.

Signed-off-by: Christoph Egger <Christoph.Egger@amd.com>
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
15 years agoblktap2: a completely rewritten blktap implementation
Keir Fraser [Tue, 26 May 2009 10:52:31 +0000 (11:52 +0100)]
blktap2: a completely rewritten blktap implementation

Benefits to blktap2 over the old version of blktap:

* Isolation from xenstore - Blktap devices are now created directly on
   the linux dom0 command line, rather than being spawned in response
   to XenStore events.  This is handy for debugging, makes blktap
   generally easier to work with, and is a step toward a generic
   user-level block device implementation that is not Xen-specific.

* Improved tapdisk infrastructure: simpler request forwarding, new
   request scheduler, request merging, more efficient use of AIO.

* Improved tapdisk error handling and memory management.  No
   allocations on the block data path, IO retry logic to protect
   guests
   transient block device failures.  This has been tested and is known
   to work on weird environments such as NFS soft mounts.

* Pause and snapshot of live virtual disks (see xmsnap script).

* VHD support.  The VHD code in this release has been rigorously
   tested, and represents a very mature implementation of the VHD
   image
   format.

* No more duplication of mechanism with blkback.  The blktap kernel
   module has changed dramatically from the original blktap.  Blkback
   is now always used to talk to Xen guests, blktap just presents a
   Linux gendisk that blkback can export.  This is done while
   preserving the zero-copy data path from domU to physical device.

These patches deprecate the old blktap code, which can hopefully be
removed from the tree completely at some point in the future.

Signed-off-by: Jake Wires <jake.wires@citrix.com>
Signed-off-by: Dutch Meyer <dmeyer@cs.ubc.ca>
15 years agoTranscendent memory ("tmem") for Xen.
Keir Fraser [Tue, 26 May 2009 10:05:04 +0000 (11:05 +0100)]
Transcendent memory ("tmem") for Xen.

Tmem, when called from a tmem-capable (paravirtualized) guest, makes
use of otherwise unutilized ("fallow") memory to create and manage
pools of pages that can be accessed from the guest either as
"ephemeral" pages or as "persistent" pages.  In either case, the pages
are not directly addressible by the guest, only copied to and fro via
the tmem interface.  Ephemeral pages are a nice place for a guest to
put recently evicted clean pages that it might need again; these pages
can be reclaimed synchronously by Xen for other guests or other uses.
Persistent pages are a nice place for a guest to put "swap" pages to
avoid sending them to disk.  These pages retain data as long as the
guest lives, but count against the guest memory allocation.

Tmem pages may optionally be compressed and, in certain cases, can be
shared between guests.  Tmem also handles concurrency nicely and
provides limited QoS settings to combat malicious DoS attempts.
Save/restore and live migration support is not yet provided.

Tmem is primarily targeted for an x86 64-bit hypervisor.  On a 32-bit
x86 hypervisor, it has limited functionality and testing due to
limitations of the xen heap.  Nearly all of tmem is
architecture-independent; three routines remain to be ported to ia64
and it should work on that architecture too.  It is also structured to
be portable to non-Xen environments.

Tmem defaults off (for now) and must be enabled with a "tmem" xen boot
option (and does nothing unless a tmem-capable guest is running).  The
"tmem_compress" boot option enables compression which takes about 10x
more CPU but approximately doubles the number of pages that can be
stored.

Tmem can be controlled via several "xm" commands and many interesting
tmem statistics can be obtained.  A README and internal specification
will follow, but lots of useful prose about tmem, as well as Linux
patches, can be found at http://oss.oracle.com/projects/tmem .

Signed-off-by: Dan Magenheimer <dan.magenheimer@oracle.com>
15 years agouninstall: get rid of hardcoded pathes
Keir Fraser [Tue, 26 May 2009 09:14:34 +0000 (10:14 +0100)]
uninstall: get rid of hardcoded pathes

Signed-off-by: Christoph Egger <Christoph.Egger@amd.com>
15 years agox86 hvm viridian: Provide dummy support for APIC assist page to satisfy Win7.
Keir Fraser [Tue, 26 May 2009 09:13:43 +0000 (10:13 +0100)]
x86 hvm viridian: Provide dummy support for APIC assist page to satisfy Win7.

From: Tim Deegan <tim.deegan@citrix.com>
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
15 years agopvgrub: few lines in shutdown_blkfront were removed by mistake.
Keir Fraser [Tue, 26 May 2009 09:05:27 +0000 (10:05 +0100)]
pvgrub: few lines in shutdown_blkfront were removed by mistake.

Signed-off-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com>
15 years agoxm, xend: passthrough: Add assigned_or_requested_vslot()
Keir Fraser [Tue, 26 May 2009 09:04:10 +0000 (10:04 +0100)]
xm, xend: passthrough: Add assigned_or_requested_vslot()

Add an accessor to simplify accessing vslot if available,
otherwise requested_vslot.

Signed-off-by: Simon Horman <horms@verge.net.au>
15 years agoxend: Fix xm pci-detach for inactive devices
Keir Fraser [Tue, 26 May 2009 09:03:09 +0000 (10:03 +0100)]
xend: Fix xm pci-detach for inactive devices

In the case where a device is attached to an inactive domain
and then removed before the domain is activated it won't have
a vslot assigned, but it should still be valid to remove it.

I don't think that there are any other cases where vslot can
be invalid.
Signed-off-by: Simon Horman <horms@verge.net.au>
15 years agoblkif: Clarify units for 'sector'-sized blkif request params.
Keir Fraser [Tue, 26 May 2009 09:01:54 +0000 (10:01 +0100)]
blkif: Clarify units for 'sector'-sized blkif request params.

Signed-off-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com>
15 years agoAdd support for superpages (hugepages) in PV domain
Keir Fraser [Tue, 26 May 2009 08:58:38 +0000 (09:58 +0100)]
Add support for superpages (hugepages) in PV domain

This patch adds the option "superpages" to the domain configuration
file. If it is set, the domain is populated using 2M pages.

This code does not support fallback to small pages.  If the domain can
not be created with 2M pages, the  create will fail.

The patch also includes support for saving and restoring domains with
the superpage flag set.  However, if a domain has freed small pages
within its physical page array and then extended the array, the
restore will fill in those freed pages.  It will then attempt to
allocate more than its memory limit and will fail.  This is
significant because apparently Linux does this during boot, thus a
freshly booted Linux image can not be saved and restored successfully.

Signed-off-by: Dave McCracken <dcm@mccr.org>
15 years agominios: replace mktime implementation
Keir Fraser [Tue, 26 May 2009 08:54:53 +0000 (09:54 +0100)]
minios: replace mktime implementation

In the efforts to clarify MiniOS license it came to my attention that
few portions of MiniOS were taken from other GPL projects, one of them
is the mktime implementation. This patch replaces the current GPL
licensed mktime implementation with a different and BSD licensed
version.

Signed-off-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com>
15 years agoPV-on-HVM: Define atomic_cmpxchg() for old Linux kernels.
Keir Fraser [Tue, 26 May 2009 08:52:59 +0000 (09:52 +0100)]
PV-on-HVM: Define atomic_cmpxchg() for old Linux kernels.

Signed-off-by: KUWAMURA Shin'ya <kuwa@jp.fujitsu.com>
15 years agostubdom: 'file' based disk sharing
Keir Fraser [Tue, 26 May 2009 08:50:35 +0000 (09:50 +0100)]
stubdom: 'file' based disk sharing

Allow 'file' based disks, that are blkback based disks, to be shared
between the guest domain and the stubdom. It does so exploiting the
same exception introduced in the previous patch "stubdoms phy disks
sharing". Now we can remove the hack in stubdom-dm that forces "file"
disks to be opened using blktap instead of blkback.

Signed-off-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com>
15 years agominios: Fix blkfront driver when sector_size != 512
Keir Fraser [Tue, 26 May 2009 08:49:19 +0000 (09:49 +0100)]
minios: Fix blkfront driver when sector_size != 512

The first and last sector as well as the sector number of the request
is expressed in 512 bytes units, independently from the real sector
size.

Signed-off-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com>
15 years agoxend: Fix typo in usage of new auxbin.xen_configdir() function
Keir Fraser [Thu, 21 May 2009 03:31:47 +0000 (04:31 +0100)]
xend: Fix typo in usage of new auxbin.xen_configdir() function

Signed-off-by: Keir Fraser <keir.fraser@eu.citrix.com>
15 years agoACPI/NUMA: Improve SRAT parsing
Keir Fraser [Wed, 20 May 2009 15:02:50 +0000 (16:02 +0100)]
ACPI/NUMA: Improve SRAT parsing

This is to properly handle SRAT rev 2 extended proximity domain
values.

Also a first step to eliminate the redundant definitions of
ACPI provided table structures (Linux eliminated all of the duplicates
from include/linux/acpi.h in 2.6.21).

Portions based on a Linux patch from Kurt Garloff <garloff@suse.de>
and Alexey Starikovskiy <astarikovskiy@suse.de>.

IA64 build tested only.

Signed-off-by: Jan Beulich <jbeulich@novell.com>
15 years agox86-64: also handle virtual aliases of Xen image pages
Keir Fraser [Wed, 20 May 2009 14:38:34 +0000 (15:38 +0100)]
x86-64: also handle virtual aliases of Xen image pages

With the unification of the heaps, the pages freed from the Xen boot
image now can also end up being allocated to a domain, and hence the
respective aliases need handling when such pages get their
cacheability attributes changed.

Rather than establishing multiple mappings with non-WB attributes
(which temporarily still can cause aliasing issues), simply unmap
those pages from the Xen virtual space, and re-map them (to allow re-
establishing of eventual large page mappings) when the cachability
attribute for them gets restored to normal (WB).

Signed-off-by: Jan Beulich <jbeulich@novell.com>
15 years agox86: don't map more than the allocated space for frame_table
Keir Fraser [Wed, 20 May 2009 14:35:32 +0000 (15:35 +0100)]
x86: don't map more than the allocated space for frame_table

This is to avoid undue virtual address aliases in case the over-mapped
pages happen to get allocated to a domain, and then get their
cacheability attributes changed.

At the same time, use 1Gb mappings if possible and reasonable.

Signed-off-by: Jan Beulich <jbeulich@novell.com>
15 years agox86: some assorted irq related cleanups
Keir Fraser [Wed, 20 May 2009 14:30:36 +0000 (15:30 +0100)]
x86: some assorted irq related cleanups

Signed-off-by: Jan Beulich <jbeulich@novell.com>
15 years agoIntroduce __must_be_array
Keir Fraser [Wed, 20 May 2009 14:29:25 +0000 (15:29 +0100)]
Introduce __must_be_array

Cloning the similar construct from Linux, allowing to detect improper
uses of ARRAY_SIZE() at build time.

Signed-off-by: Jan Beulich <jbeulich@novell.com>
15 years agotools: get rid of hardcoded config dirs
Keir Fraser [Wed, 20 May 2009 14:27:30 +0000 (15:27 +0100)]
tools: get rid of hardcoded config dirs

Remove *all* hardcoded "/etc/xen" strings in python code.

Additionally, it removes pygrub_path from osdep.py. Its use has been
replaced with auxbin.pathTo("pygrub").

Signed-off-by: Christoph Egger <Christoph.Egger@amd.com>
15 years agoxend: remove spurious pci_len from getDeviceSxprs
Keir Fraser [Wed, 20 May 2009 14:13:36 +0000 (15:13 +0100)]
xend: remove spurious pci_len from getDeviceSxprs

Cc: Zhai Edwin <edwin.zhai@intel.com>
Cc: Masaki Kanno <kanno.masaki@jp.fujitsu.com>
Cc: Dexuan Cui <dexuan.cui@intel.com>
Signed-off-by: Simon Horman <horms@verge.net.au>
15 years agognttab: for sanity's sake, don't access the active grant table
Keir Fraser [Tue, 19 May 2009 22:44:28 +0000 (23:44 +0100)]
gnttab: for sanity's sake, don't access the active grant table
structure when we don't hold the lock.

Signed-off-by: Steven Smith <steven.smith@citrix.com>
15 years agognttab: Remove unused per-domain map_count field
Keir Fraser [Tue, 19 May 2009 22:30:32 +0000 (23:30 +0100)]
gnttab: Remove unused per-domain map_count field

Signed-off-by: Steven Smith <steven.smith@eu.citrix.com>
15 years agox86: Fix the P2M audit code.
Keir Fraser [Tue, 19 May 2009 22:28:25 +0000 (23:28 +0100)]
x86: Fix the P2M audit code.

It currently doesn't even compile; with this patch applied, it
compiles and didn't immediately explode as soon as I started a VM.
I've not given it much testing beyond that, though.

Signed-off-by: Steven Smith <steven.smith@citrix.com>
15 years agostubdom: Rebuild the ocaml runtime libraries with the options needed
Keir Fraser [Tue, 19 May 2009 13:17:56 +0000 (14:17 +0100)]
stubdom: Rebuild the ocaml runtime libraries with the options needed
if they are to be linked with object files created by ocamlc and the minios
kernel.

This is needed to build stubdoms written in ocaml.

Signed-off-by: Alex Zeffertt <alex.zeffertt@eu.citrix.com>
15 years agominios: Implement some extra lib functions for ocaml xenstored stubdom
Keir Fraser [Tue, 19 May 2009 13:15:43 +0000 (14:15 +0100)]
minios: Implement some extra lib functions for ocaml xenstored stubdom

We could stub them out as unsupported, but we may as well implement
them as they are very simple.

Signed-off-by: Alex Zeffertt <alex.zeffertt@eu.citrix.com>
15 years agominios: Define extra stub functions to allow build of ocaml xenstored stubdom
Keir Fraser [Tue, 19 May 2009 13:14:41 +0000 (14:14 +0100)]
minios: Define extra stub functions to allow build of ocaml xenstored stubdom

These are referenced by the libs that ocamlopt links xenstored to, but
ought not actually get called.

Signed-off-by: Alex Zeffertt <alex.zeffertt@eu.citrix.com>
15 years agoocaml-xenstored: Allow to build ocaml xenstored instead of C version
Keir Fraser [Tue, 19 May 2009 13:13:20 +0000 (14:13 +0100)]
ocaml-xenstored: Allow to build ocaml xenstored instead of C version

To use, set CONFIG_OCAML_XENSTORED=y at build time.
Then the build system will automatically download the remote repo
to tools/ocaml-xenstored.

Signed-off-by: Alex Zeffertt <alex.zeffertt@eu.citrix.com>
15 years agotools: Fix an obvious xgettext build typo.
Keir Fraser [Tue, 19 May 2009 12:38:05 +0000 (13:38 +0100)]
tools: Fix an obvious xgettext build typo.
Signed-off-by: Christoph Egger <Christoph.Egger@amd.com>
15 years agoxend: Fix xm pci commands for inactive managed domains.
Keir Fraser [Tue, 19 May 2009 01:23:32 +0000 (02:23 +0100)]
xend: Fix xm pci commands for inactive managed domains.

Signed-off-by: Masaki Kanno <kanno.masaki@jp.fujitsu.com>
15 years agoxend: Make hotplug script timeouts configurable
Keir Fraser [Tue, 19 May 2009 01:18:48 +0000 (02:18 +0100)]
xend: Make hotplug script timeouts configurable

In some configurations, when dom0 is busy with I/O, it may take
several minutes to complete all hotplug scripts required when a new
domain is being created. As device create timeout is set to 100
seconds, users get "hotplug scripts not working" error instead of a
new domain.

This patch makes both DEVICE_CREATE_TIMEOUT and DEVICE_DESTROY_TIMEOUT
configurable in xend-config.sxp to allow users to easily adapt hotplug
timeouts to their environment.

Signed-off-by: Jiri Denemark <jdenemar@redhat.com>
15 years agopython: get rid of hardcoded search pathes in python code.
Keir Fraser [Tue, 19 May 2009 01:16:37 +0000 (02:16 +0100)]
python: get rid of hardcoded search pathes in python code.

Signed-off-by: Christoph Egger <Christoph.Egger@amd.com>
15 years agox86 hvm: Correctly emulate task switches into vm86 mode.
Keir Fraser [Tue, 19 May 2009 01:12:04 +0000 (02:12 +0100)]
x86 hvm: Correctly emulate task switches into vm86 mode.

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
15 years agox86_emulate: Emulate LLDT and LTR instructions.
Keir Fraser [Tue, 19 May 2009 01:09:36 +0000 (02:09 +0100)]
x86_emulate: Emulate LLDT and LTR instructions.

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
15 years agoxentrace: Trace IRQs and entry/exit timestamps.
Keir Fraser [Tue, 19 May 2009 01:08:01 +0000 (02:08 +0100)]
xentrace: Trace IRQs and entry/exit timestamps.
From: Dulloor <dulloor@gmail.com>
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
15 years agoxend: adjust relocation buffer size
Keir Fraser [Tue, 19 May 2009 00:50:00 +0000 (01:50 +0100)]
xend: adjust relocation buffer size

This can greatly improve ssl relocation performance (to about 1/3
compared with buffersize 1024).

Signed-off-by: Zhigang Wang <zhigang.x.wang@oracle.com>
15 years agotools: clean up handling of xen config and scripts directories.
Keir Fraser [Tue, 19 May 2009 00:48:18 +0000 (01:48 +0100)]
tools: clean up handling of xen config and scripts directories.

For now hardcode /etc w/o a prefix as there are hardcoded
config paths in the code which would break otherwise.

Signed-off-by: Christoph Egger <Christoph.Egger@amd.com>
15 years agoxend: solve issues with xm block-configure command.
Keir Fraser [Tue, 19 May 2009 00:37:19 +0000 (01:37 +0100)]
xend: solve issues with xm block-configure command.

In the case of inactive managed domains:
 The following error occurs currently.  We cannot change the
 configuration of the VBD by using xm block-configure.  Of course,
 using xm block-detach and xm block-attach instead of xm
 block-configure, we can change it. However, I'd like to change it by
 using xm block-configure.

In the case of active domains:
 Another problem occurs after a domain was rebooted.  Even if we
 change a configuration of a VBD in the domain by using xm
 block-configure, the configuration of the VBD is reverted to previous
 configuration after the domain was rebooted.

Signed-off-by: Masaki Kanno <kanno.masaki@jp.fujitsu.com>
15 years agoxend: Avoid deprecation warnings with Python 2.6.
Keir Fraser [Tue, 19 May 2009 00:34:34 +0000 (01:34 +0100)]
xend: Avoid deprecation warnings with Python 2.6.

Signed-off-by: Brendan Cully <brendan@cs.ubc.ca>
15 years agox86, cpufreq: fix ondemand governor to take aperf/mperf feedback
Keir Fraser [Tue, 19 May 2009 00:31:26 +0000 (01:31 +0100)]
x86, cpufreq: fix ondemand governor to take aperf/mperf feedback

APERF/MPERF MSRs provides feedback about actual freq in
eplased time, which could be different from requested freq by
governor. However currently ondemand governor only takes that
feedback at freq down path. We should do that for scale up too.

Signed-off-by: Liu, Jinsong <jinsong.liu@intel.com>
15 years agoioemu: Export PREFIX variable to ioemu build environment.
Keir Fraser [Tue, 19 May 2009 00:27:36 +0000 (01:27 +0100)]
ioemu: Export PREFIX variable to ioemu build environment.
Signed-off-by: Christoph Egger <Christoph.Egger@amd.com>
15 years agox86 hvm: Provide x2APIC identifier in CPUID leaf 0xb.
Keir Fraser [Tue, 19 May 2009 00:25:59 +0000 (01:25 +0100)]
x86 hvm: Provide x2APIC identifier in CPUID leaf 0xb.

Signed-off-by: Nitin A Kamble <nitin.a.kamble@intel.com>
15 years agostubdom: fix issue with phy disks
Keir Fraser [Tue, 19 May 2009 00:18:35 +0000 (01:18 +0100)]
stubdom: fix issue with phy disks

Add an exception for stubdoms in the same_vm hotplug script function.

Signed-off-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com>
15 years agox86: Use PGC_cacheattr_mask rather than a hardcoded constant.
Keir Fraser [Tue, 19 May 2009 00:17:16 +0000 (01:17 +0100)]
x86: Use PGC_cacheattr_mask rather than a hardcoded constant.

Signed-off-by: Patrick Colp <Patrick.Colp@citrix.com>
15 years agoUpdate README for 3.4 branch.
Keir Fraser [Mon, 18 May 2009 12:53:55 +0000 (13:53 +0100)]
Update README for 3.4 branch.
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
15 years agovt-d: Fix interrupt remapping for multiple IOAPICs
Keir Fraser [Fri, 15 May 2009 07:12:39 +0000 (08:12 +0100)]
vt-d: Fix interrupt remapping for multiple IOAPICs

Current IOAPIC interrupt remapping code assumes there is only one
IOAPIC in system. It brings problem when there are more than one
IOAPIC in system. This patch extends ioapic_pin_to_intremap_index[]
array to handle multiple IOAPICs case.

Signed-off-by: Weidong Han <weidong.han@intel.com>
15 years agocpuidle: Fix possible false judge caused by type casting
Keir Fraser [Thu, 14 May 2009 14:46:43 +0000 (15:46 +0100)]
cpuidle: Fix possible false judge caused by type casting

For timer_deadline == 0 or timer_deadline - now > largest u32
case, the expected_us (in u32 type) may become wrong.

Add a tunable option to ease conditional adjustment.

Signed-off-by: Wei Gang <gang.wei@intel.com>
15 years agoxen public: make mmuext_op's vcpumask field const
Keir Fraser [Thu, 14 May 2009 14:46:04 +0000 (15:46 +0100)]
xen public: make mmuext_op's vcpumask field const

Linux started to pass around pointers to 'const cpumask_t' a while ago,
and passing such a pointer to set_xen_guest_handle() requires that the
field be a handle for a constant type in order to avoid compiler
warnings.

Signed-off-by: Jan Beulich <jbeulich@novell.com>
15 years agox86 vmx: Ensure debug-mode intercept for int3 and debug exceptions are
Keir Fraser [Wed, 13 May 2009 09:39:44 +0000 (10:39 +0100)]
x86 vmx: Ensure debug-mode intercept for int3 and debug exceptions are
reinstated when resetting EXCEPTION_BIRTMAP entry in VMCS after
exiting real mode.

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
15 years agopassthrough: Fix PCI hot-plug option parsing
Keir Fraser [Wed, 13 May 2009 09:28:35 +0000 (10:28 +0100)]
passthrough: Fix PCI hot-plug option parsing

When a PCI function is passed-through extra options may be passed
through.
In the case of boot-time PCI pass-through the documented format is:
    [dom:]bus:dev.slot[@vslot][[,opt]...]
    e.g.
    00:01.00.1@7,msitranslate=3D1

In the case of PCI hot-plug the xm pci-attach command take the
following arguments:
    [-o opt[,opt]...] [dom:]bus:dev.slot [vslot]
    e.g.
    -o msitranslate=3D1 00:01.00.1 7

    These xm ends up passing these to xem-qemu as:
    [dom:]bus:dev.slot[[,opt]...][@vslot]
    e.g.
    00:01.00.1,msitranslate=3D1@7

    Note that the option and the vslot have are transposed when
    compared to the format used by boot-time PCI pass-through.

The parser inside qemu-xen can only handle the format used by
boot-time PCI pass-through and because of this ignores
any options passed by hot-plug.

This patch alters format used by hot-plug to match the parser.

Signed-off-by: Simon Horman <horms@verge.net.au>
15 years agox86: update ioport_quirks_tbl[] for the proliant BL2xx
Keir Fraser [Mon, 11 May 2009 12:52:04 +0000 (13:52 +0100)]
x86: update ioport_quirks_tbl[] for the proliant BL2xx

Signed-off-by: Ian Campbell <ian.campbell@citrix.com>
15 years agostubdom docs: Update to reflect the new way of configuring stubdom domains.
Keir Fraser [Mon, 11 May 2009 12:50:34 +0000 (13:50 +0100)]
stubdom docs: Update to reflect the new way of configuring stubdom domains.

Signed-off-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com>
16 years agox86 hvm: hvm_set_callback_irq_level() must not be called in IRQ
Keir Fraser [Fri, 8 May 2009 10:50:12 +0000 (11:50 +0100)]
x86 hvm: hvm_set_callback_irq_level() must not be called in IRQ
context or with IRQs disabled. Ensure this by deferring to tasklet
(softirq) context if required.

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agoPermit user to suppress passing --prefix to setup.py
Keir Fraser [Thu, 7 May 2009 18:32:10 +0000 (19:32 +0100)]
Permit user to suppress passing --prefix to setup.py

We change all invocations of setup.py as follows:

* use $(PYTHON) instead of `python' so that the user can specify
  an alternative python version if they need to.  If not set it
  defaults to `python' in Config.mk.

* pass --prefix=$(PREFIX) via a new make variable
  $(PYTHON_PREFIX_ARG).  This allows a user to suppress the
  --prefix=... argument entirely by setting PYTHON_PREFIX_ARG=''.
  This will work around the bug described here
     https://bugs.launchpad.net/ubuntu/+bug/362570
  where passing --prefix=/usr/local (which ought to have no effect as
  /usr/local is the default prefix) changes which subdirectory
  distutils chooses, and results in the files being installed in
  site-packages which is not on the default search path.

  Users not affected by this python packaging bug should not set
  PYTHON_PREFIX_ARG and their builds will not be affected.  (Provided
  PREFIX did not contain spaces.  People who put spaces in PREFIX are
  being quite optimistic.)

Signed-off-by: Ian Jackson <ian.jackson@eu.citrix.com>
16 years agoxend: fix the pci BDF format string
Keir Fraser [Thu, 7 May 2009 09:03:18 +0000 (10:03 +0100)]
xend: fix the pci BDF format string

Fix the typo introduced by changeset 19446:2ef77b4bdf58.

Signed-off-by: Dexuan Cui <dexuan.cui@intel.com>
16 years agox86: Extend VMX_PERF_EXIT_REASON_SIZE to sync with latest SDM
Keir Fraser [Wed, 6 May 2009 11:40:51 +0000 (12:40 +0100)]
x86: Extend VMX_PERF_EXIT_REASON_SIZE to sync with latest SDM

Signed-off-by: Yang Xiaowei <xiaowei.yang@intel.com>
16 years agox86: Fix a couple of asm constraints.
Keir Fraser [Mon, 4 May 2009 21:52:18 +0000 (22:52 +0100)]
x86: Fix a couple of asm constraints.
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agoUpdate Xen version for 3.4.0-rc4-pre
Keir Fraser [Thu, 30 Apr 2009 13:34:15 +0000 (14:34 +0100)]
Update Xen version for 3.4.0-rc4-pre

16 years agoAdded tag 3.4.0-rc3 for changeset 22e01301ff64
Keir Fraser [Thu, 30 Apr 2009 13:33:45 +0000 (14:33 +0100)]
Added tag 3.4.0-rc3 for changeset 22e01301ff64

16 years agoUpdate Xen version for 3.4.0-rc3
Keir Fraser [Thu, 30 Apr 2009 13:33:32 +0000 (14:33 +0100)]
Update Xen version for 3.4.0-rc3

16 years agocpuidle: Fix for timer_deadline==0 case
Keir Fraser [Thu, 30 Apr 2009 09:05:00 +0000 (10:05 +0100)]
cpuidle: Fix for timer_deadline==0 case

After the scheduler timer became suspended before entering cpu idle
state, the percpu timer_deadline is possible to be 0, i.e. no soft
timer in the queue. This case will cause unexpected large residency
percentage in C1 for the purely idle cpu.

Signed-off-by: Wei Gang <gang.wei@intel.com>
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agoxenstore: Add brief notice regarding xenstore protocol limits to xs.h.
Keir Fraser [Thu, 30 Apr 2009 08:43:29 +0000 (09:43 +0100)]
xenstore: Add brief notice regarding xenstore protocol limits to xs.h.

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agox86_32: Fix build after e820 verbosity printks.
Keir Fraser [Wed, 29 Apr 2009 12:54:57 +0000 (13:54 +0100)]
x86_32: Fix build after e820 verbosity printks.

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agox86 shadow: fix audit mode.
Keir Fraser [Wed, 29 Apr 2009 10:53:23 +0000 (11:53 +0100)]
x86 shadow: fix audit mode.

Splintered L1s can have the dirty bit clean because of vram tracking.

Signed-off-by: Gianluca Guida <gianluca.guida@eu.citrix.com>
16 years agox86: Add boot options regarding e820 parsinga nd clipping.
Keir Fraser [Wed, 29 Apr 2009 10:52:14 +0000 (11:52 +0100)]
x86: Add boot options regarding e820 parsinga nd clipping.

 [no-]e820-mtrr-clip: Clip e820 RAM areas to top-of-memory according
 to MTRRs. Prefix no- to disable the clip.

 e820-verbose: Extra e820 info printed during boot.

Signed-off-by: George Dunlap <george.dunlap@eu.citrix.com>
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agox86 vmx: Reuse idtv_info to avoid double read of IDT_VECTORING.
Keir Fraser [Tue, 28 Apr 2009 12:45:08 +0000 (13:45 +0100)]
x86 vmx: Reuse idtv_info to avoid double read of IDT_VECTORING.

Signed-off-by: Eddie Dong <eddie.dong@intel.com>
16 years agoUpdate XEN_LINUX_GIT_REMOTEBRANCH to match changes made in upstream
Keir Fraser [Mon, 27 Apr 2009 17:59:56 +0000 (18:59 +0100)]
Update XEN_LINUX_GIT_REMOTEBRANCH to match changes made in upstream
repo. Needed if you want setting KERNELS=linux-2.6-pvops in
config/Linux.mk to work.

Signed-off-by: Alex Zeffert <alex.zeffert@eu.citrix.com>
Acked-by: Ian Campbell <ian.campbell@citrix.com>
16 years agoxend: Fix for op_pincpu in SrvDomain.py
Keir Fraser [Mon, 27 Apr 2009 14:42:38 +0000 (15:42 +0100)]
xend: Fix for op_pincpu in SrvDomain.py

op_pincpu method in SrvDomain.py does not currently work because
op_pincpu method gives string objects to a cpumap argument of
domain_pincpu method in XendDomain.py though the cpumap argument
expects list objects.

This patch solves the above problem as follows.

op_pincpu method gives string objects to the cpumap argument as is,
because op_pincpu method cannot give list objects to the cpumap
argument.
Instead, domain_pincpu method expects that the cpumap argument is
string objects, then domain_pincpu method converts the cpumap
argument into list objects.
Also, the patch modifies two methods (except for op_pincpu method)
calling domain_pincpu method.  The methods give string objects to
the cpumap argument instead of list objects.

Signed-off-by: Masaki Kanno <kanno.masaki@jp.fujitsu.com>
16 years agoxend: fix cset 19547:b2c43b0fba71 (a managed domain may not shutdown)
Keir Fraser [Mon, 27 Apr 2009 14:41:28 +0000 (15:41 +0100)]
xend: fix cset 19547:b2c43b0fba71 (a managed domain may not shutdown)

Signed-off-by: Kouya Shimura <kouya@jp.fujitsu.com>
16 years agox86 hvm: During task switch, LDT must be loaded before all other
Keir Fraser [Mon, 27 Apr 2009 14:40:09 +0000 (15:40 +0100)]
x86 hvm: During task switch, LDT must be loaded before all other
segments, as some may be loaded from the LDT.

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agox86 vtd: Do not exclude freed percpu areas from dom0 vtd tables.
Keir Fraser [Sat, 25 Apr 2009 07:19:14 +0000 (08:19 +0100)]
x86 vtd: Do not exclude freed percpu areas from dom0 vtd tables.

Signed-off-by: Qing He <qing.he@intel.com>
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agox86: Detect bogus BIOS e820 tables which are not fully covered as
Keir Fraser [Fri, 24 Apr 2009 16:51:56 +0000 (17:51 +0100)]
x86: Detect bogus BIOS e820 tables which are not fully covered as
cacheable (WB) by MTRRs. Fix up by truncating the memory map.

This fixes performance issues on certain systems.

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agoxentrace: Clean up HVM I/O tracing.
Keir Fraser [Fri, 24 Apr 2009 12:46:17 +0000 (13:46 +0100)]
xentrace: Clean up HVM I/O tracing.

Signed-off-by: Andre Przywara <andre.przywara@amd.com>
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agoxentrace: fix trivial typo.
Keir Fraser [Fri, 24 Apr 2009 12:20:55 +0000 (13:20 +0100)]
xentrace: fix trivial typo.

Signed-off-by: Christoph Egger <Christoph.Egger@amd.com>
16 years agoxend: modify the order of resetting/deassigning device.
Keir Fraser [Fri, 24 Apr 2009 12:19:37 +0000 (13:19 +0100)]
xend: modify the order of resetting/deassigning device.

  When guest OS shutdowns or a device is hotremoved.
   1. Xend resets devices.
   2. Xend deassigns devices.

Because if devices are deassigned before the reset, dom0 memory may be
overwritten by DMA.

Signed-off-by: Yuji Shimada <shimada-yxb@necst.nec.co.jp>
16 years agostubdoms: handle cdrom devices in the config file
Keir Fraser [Thu, 23 Apr 2009 15:22:48 +0000 (16:22 +0100)]
stubdoms: handle cdrom devices in the config file

Signed-off-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com>
16 years agox86 numa: Fix left shift overflows
Keir Fraser [Thu, 23 Apr 2009 11:21:29 +0000 (12:21 +0100)]
x86 numa: Fix left shift overflows

Signed-off-by: Yang Xiaowei <xiaowei.yang@intel.com>
16 years agostubdom: drop lwip support
Keir Fraser [Thu, 23 Apr 2009 11:19:14 +0000 (12:19 +0100)]
stubdom: drop lwip support

Network support is still provided the same way: using the tap
interface, created in qemu using netfront.
The lwip stack is still available to avoid additional compilation
issues.
However the stubdom is not going to have its own vif anymore,
this means that the only vnc server supported is the one in dom0.
You can still enable the vnc server in a stubdom at compile time, if
you want so.

Probably the most important change caused by this patch to xen users
is that you don't have to specify two vif in the stubdom config file
anymore, but just one:

-vif = [ '', 'ip=10.0.1.1,mac=aa:00:00:12:23:34']
+vif = ['ip=10.0.1.1,mac=aa:00:00:12:23:34']

Signed-off-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com>
16 years agoxm: Fix hot-unplug of statically-assigned devices
Keir Fraser [Thu, 23 Apr 2009 09:11:33 +0000 (10:11 +0100)]
xm: Fix hot-unplug of statically-assigned devices

Prior to changset 19510:5c69f98c348e - 'xm, xend: Replace "vslt" with
"vslot"', both vslt and vslot were used in the xm code, often fairly
arbitrarily.

However, in the dictionary that describes a pci function both vslt and
vslot were present. vslt stored the slot assigned to the function. And
vslot stored the slot the user requested for the function, or
AUTO_PHP_SLOT if no slot was requested.

With the renaming these two values got merged into a single entry.
This patch un-merges them by renaming the what was vslot to
requested_vslot.

So an out of chronological order list of name changes is:

'vslot' -> 'requested_vslot'
'vslt'  -> 'vslot'

Signed-off-by: Simon Horman <horms@verge.net.au>
16 years agox86: Remove bogus vtd frametable range check
Keir Fraser [Thu, 23 Apr 2009 09:10:11 +0000 (10:10 +0100)]
x86: Remove bogus vtd frametable range check

Signed-off-by: Joseph Cihula <joseph.cihula@intel.com>
16 years agoxend: Do not overwrite xauthority and display with empty values
Keir Fraser [Thu, 23 Apr 2009 09:09:16 +0000 (10:09 +0100)]
xend: Do not overwrite xauthority and display with empty values

Display and xauthority vars are read from vmConfig['platform'] first,
then they are read again from dev_info.
However if the user does not set those variable in the config file,
dev_info won't contain them, hence we are going to overwrite the
current significant values with null.
This patch fixes the problem setting display and xauthority to the
current values if dev_info does not contain them.

Signed-off-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com>
16 years agostubdoms: generate stubdom config file
Keir Fraser [Thu, 23 Apr 2009 09:08:04 +0000 (10:08 +0100)]
stubdoms: generate stubdom config file

This patch removes the need for a second configuration file for
stubdoms: it is going to be automatically generated by the script
stubdom-dm using command line options and xenstore to find any needed
information.
The configuration script will be placed under /etc/xen/stubdoms and
automatically removed when the domain is destroyed.
The only change needed in xend is not to write on xenstore sdl,
opengl and serial command line options for qemu, because stubdoms do
not support them.
It is safe to remove those two options from xenstore because qemu does
not use xenstore to read commans line options.
Finally this patch fixes blkfront disconnections from backends and
display and xauthority variables for pv guests.

Signed-off-by: Stefano Stabellini <stefano.stabellini@eu.citrix.com>
16 years agoxc_save: fixes typo in error message.
Keir Fraser [Wed, 22 Apr 2009 12:46:50 +0000 (13:46 +0100)]
xc_save: fixes typo in error message.

Signed-off-by: Christoph Egger <Christoph.Egger@amd.com>
16 years agonetwork-bridge: Fix for bonding
Keir Fraser [Wed, 22 Apr 2009 10:26:37 +0000 (11:26 +0100)]
network-bridge: Fix for bonding

If ${netdev} is bonding, brctl addif ${bridge} ${pdev} fails:
  can't add ${pdev} to bridge ${bridge}: Invalid argument
Because ${pdev} has no slaves at this point.=20
# Notice that ifdown ${netdev} clears slaves of ${netdev}.

This patch restores slaves before add_to_bridge2 ${bridge} ${pdev}.

Signed-off-by: KUWAMURA Shin'ya <kuwa@jp.fujitsu.com>
16 years agointel vtd: Remove the BUG_ON in domain_context_mapping().
Keir Fraser [Tue, 21 Apr 2009 15:31:26 +0000 (16:31 +0100)]
intel vtd: Remove the BUG_ON in domain_context_mapping().

Signed-off-by: Dexuan Cui <dexuan.cui@intel.com>
16 years ago[IA64] fix early access to per cpu area.
Isaku Yamahata [Tue, 21 Apr 2009 09:27:59 +0000 (18:27 +0900)]
[IA64] fix early access to per cpu area.

The following changeset broke booting xen-ia64 on some kinds of ia64 boxes.
http://xenbits.xensource.com/ext/ia64/xen-unstable.hg/rev/3fd8f9b34941

The tasklet_schedule call raise_softirq().
Because raise_softirq() use per_cpu, if we access per_cpu before cpu_init()
the behavior would be unexpected.

There was a similar issue on Linux/ia64.
The following change sets resolved it.
10617bbe84628eb18ab5f723d3ba35005adde143
c459ce8b5a7d933a3bcf6915ab17ac1e036e2ac4

This patch fixes the issue following the linux/ia64 solution.
Allocate per cpu area for cpu0 in .data section and initialize
it early.

reported-by: Akio Takebe <takebe_akio@jp.fujitsu.com>
Signed-off-by: Isaku Yamahata <yamahata@valinux.co.jp>
--HG--
rename : xen/include/asm-ia64/linux/asm/sections.h => xen/include/asm-ia64/linux-xen/asm/sections.h

16 years agoxenconsoled: Fixes to domain management.
Keir Fraser [Mon, 20 Apr 2009 14:38:29 +0000 (15:38 +0100)]
xenconsoled: Fixes to domain management.

Event-channel setup: Re-bind if the connection becomes unbound (e.g.,
due to 'slow' domain suspend cancellation), even if the remote port
identifier has not changed.

Domain logging: Only open log file once (don't leak fds) and fix a
small memory leak.

Evtchn changes based on a patch by Jiri Denemark <jdenemar@redhat.com>

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agox86: fix next->vcpu_dirty_cpumask checking in context_switch()
Keir Fraser [Mon, 20 Apr 2009 13:56:31 +0000 (14:56 +0100)]
x86: fix next->vcpu_dirty_cpumask checking in context_switch()

There was a timing window where flush_tlb_mask() could be called with
an empty mask (triggering a WARN_ON() in send_IPI_mask_flat() along
with APIC errors) because rather than using the already taken snapshot
of next's vcpu_dirty_cpumask struct vcpu's field was used directly,
which can get its only bit cleared by remote CPUs.

Replacing the structure field's use by the local variable then made
the inner cpus_empty() check completely redundant with the one in the
surrounding if()'s condition.

Signed-off-by: Jan Beulich <jbeulich@novell.com>
16 years agoxend: The method union_update on a set was renamed to update in Python 2.3
Keir Fraser [Mon, 20 Apr 2009 12:48:31 +0000 (13:48 +0100)]
xend: The method union_update on a set was renamed to update in Python 2.3

From: Ruben Kerkhof <ruben@rubenkerkhof.com>
Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agoTXT: update Makefile to latest tboot tarfile
Keir Fraser [Fri, 17 Apr 2009 16:39:05 +0000 (17:39 +0100)]
TXT: update Makefile to latest tboot tarfile

This patch updates the Makefile to download the latest version of
tboot, which supports the interface changes made recently.  This
should go into 3.4, since 3.4 supports the new tboot interface.

Signed-off-by: Joseph Cihula <joseph.cihula@intel.com>
16 years agoAMD IOMMU: Fix ioapic interrupt remapping
Keir Fraser [Fri, 17 Apr 2009 12:16:39 +0000 (13:16 +0100)]
AMD IOMMU: Fix ioapic interrupt remapping

A few ioapic redirection entries are initialized by hypervisor before
enabling iommu hardware. This patch copies those entries from ioapic
redirection table into interrupt remapping table after interrupt
remapping table has been allocated.

Signed-off-by: Wei Wang <wei.wang2@amd.com>
16 years agocpuidle: Add support for Always Running APIC timer, CPUID_0x6_EAX_Bit2.
Keir Fraser [Fri, 17 Apr 2009 12:14:01 +0000 (13:14 +0100)]
cpuidle: Add support for Always Running APIC timer, CPUID_0x6_EAX_Bit2.

This bit means the APIC timer continues to run even when CPU is
in deep C-states.

The advantage is that we can use LAPIC timer on these CPUs
always, and there is no need for "slow to read and program"
external timers (HPET/PIT) and the timer broadcast logic
and related code in C-state entry and exit.

Refer to the latest Intel SDM Vol 2A
(http://www.intel.com/products/processor/manuals/index.htm)

Signed-off-by: Wei Gang <gang.wei@intel.com>
16 years agox86: avoid EPT scanning errors when splitting superpages during live migration
Keir Fraser [Fri, 17 Apr 2009 12:07:06 +0000 (13:07 +0100)]
x86: avoid EPT scanning errors when splitting superpages during live migration

Since Xen did not lock the p2m table for p2m table reading, when
splitting the large page during live migration, we should make sure
the path of EPT entries be modified are always there while other CPUs
may access the super entries at the same time.

Signed-off-by: Xin, Xiaohui <xiaohui.xin@intel.com>
16 years agoxend: clean up qemu-dm related items on domain destroy
Keir Fraser [Fri, 17 Apr 2009 12:04:47 +0000 (13:04 +0100)]
xend: clean up qemu-dm related items on domain destroy

Some qemu-dm related stuffs might be left behind after the domain is
destroyed.
 - xenstore entry, /local/domain/0/device-model/<domid>
 - named pipes, /var/run/tap/qemu-{read,write}-<domid>

Signed-off-by: Kouya Shimura <kouya@jp.fujitsu.com>
16 years agoUpdate Xen version for 3.4.0-rc3-pre
Keir Fraser [Thu, 16 Apr 2009 10:54:06 +0000 (11:54 +0100)]
Update Xen version for 3.4.0-rc3-pre

16 years agoAdded tag 3.4.0-rc2 for changeset 087854cf3ed9
Keir Fraser [Thu, 16 Apr 2009 10:53:46 +0000 (11:53 +0100)]
Added tag 3.4.0-rc2 for changeset 087854cf3ed9

16 years agoUpdate Xen version for 3.4.0-rc2
Keir Fraser [Thu, 16 Apr 2009 10:53:38 +0000 (11:53 +0100)]
Update Xen version for 3.4.0-rc2

16 years agocpuidle: Fix PV RTC I/O port access hook
Keir Fraser [Thu, 16 Apr 2009 10:52:08 +0000 (11:52 +0100)]
cpuidle: Fix PV RTC I/O port access hook

Call hook before writing to I/O ports. Only hook single-byte writes.

Signed-off-by: Keir Fraser <keir.fraser@citrix.com>
16 years agocpuidle: Enable hpet broadcast by default
Keir Fraser [Wed, 15 Apr 2009 14:47:25 +0000 (15:47 +0100)]
cpuidle: Enable hpet broadcast by default

And stop legacy hpet broadcast and limit max C-state to shallower
state if RTC interrupts are enabled.

Signed-off-by: Wei Gang <gang.wei@intel.com>
Signed-off-by: Tian Kevin <kevin.tian@intel.com>