]> xenbits.xensource.com Git - libvirt.git/log
libvirt.git
11 years agoIntroduce new OOM testing support
Daniel P. Berrange [Mon, 23 Sep 2013 13:21:52 +0000 (14:21 +0100)]
Introduce new OOM testing support

The previous OOM testing support would re-run the entire "main"
method each iteration, failing a different malloc each time.
When a test suite has 'n' allocations, the number of repeats
requires is  (n * (n + 1) ) / 2.  This gets very large, very
quickly.

This new OOM testing support instead integrates at the
virtTestRun level, so each individual test case gets repeated,
instead of the entire test suite. This means the values of
'n' are orders of magnitude smaller.

The simple usage is

   $ VIR_TEST_OOM=1 ./qemuxml2argvtest
   ...
   29) QEMU XML-2-ARGV clock-utc                                         ... OK
       Test OOM for nalloc=36 .................................... OK
   30) QEMU XML-2-ARGV clock-localtime                                   ... OK
       Test OOM for nalloc=36 .................................... OK
   31) QEMU XML-2-ARGV clock-france                                      ... OK
       Test OOM for nalloc=38 ...................................... OK
   ...

the second lines reports how many mallocs have to be failed, and thus
how many repeats of the test will be run.

If it crashes, then running under valgrind will often show the problem

  $ VIR_TEST_OOM=1 ../run valgrind ./qemuxml2argvtest

When debugging problems it is also helpful to select an individual
test case

  $ VIR_TEST_RANGE=30 VIR_TEST_OOM=1 ../run valgrind ./qemuxml2argvtest

When things get really tricky, it is possible to request that just
specific allocs are failed. eg to fail allocs 5 -> 12, use

  $ VIR_TEST_RANGE=30 VIR_TEST_OOM=1:5-12 ../run valgrind ./qemuxml2argvtest

In the worse case, you might want to know the stack trace of the
alloc which was failed then VIR_TEST_OOM_TRACE can be set. If it
is set to 1 then it will only print if it thinks a mistake happened.
This is often not reliable, so setting it to 2 will make it print
the stack trace for every alloc that is failed.

  $ VIR_TEST_OOM_TRACE=2 VIR_TEST_RANGE=30 VIR_TEST_OOM=1:5-5 ../run valgrind ./qemuxml2argvtest
  30) QEMU XML-2-ARGV clock-localtime                                   ... OK
      Test OOM for nalloc=36 !virAllocN
  /home/berrange/src/virt/libvirt/src/util/viralloc.c:180
  virHashCreateFull
  /home/berrange/src/virt/libvirt/src/util/virhash.c:144
  virDomainDefParseXML
  /home/berrange/src/virt/libvirt/src/conf/domain_conf.c:11745
  virDomainDefParseNode
  /home/berrange/src/virt/libvirt/src/conf/domain_conf.c:12646
  virDomainDefParse
  /home/berrange/src/virt/libvirt/src/conf/domain_conf.c:12590
  testCompareXMLToArgvFiles
  /home/berrange/src/virt/libvirt/tests/qemuxml2argvtest.c:106
  virtTestRun
  /home/berrange/src/virt/libvirt/tests/testutils.c:250
  mymain
  /home/berrange/src/virt/libvirt/tests/qemuxml2argvtest.c:418 (discriminator 2)
  virtTestMain
  /home/berrange/src/virt/libvirt/tests/testutils.c:750
  ??
  ??:0
  _start
  ??:?
   FAILED

Signed-off-by: Daniel P. Berrange <berrange@redhat.com>
11 years agoFix multiple bugs in LXC domainMemoryStats driver
Daniel P. Berrange [Thu, 20 Feb 2014 15:32:49 +0000 (15:32 +0000)]
Fix multiple bugs in LXC domainMemoryStats driver

The virCgroupXXX APIs' return value must be checked for
being less than 0, not equal to 0.

An VIR_ERR_OPERATION_INVALID error must also be raised
when the VM is not running to prevent a crash on NULL
priv->cgroup field.

Signed-off-by: Daniel P. Berrange <berrange@redhat.com>
11 years agoAdd unit test for virCgroupGetPercpuStats.
Thorsten Behrens [Fri, 14 Feb 2014 17:49:08 +0000 (18:49 +0100)]
Add unit test for virCgroupGetPercpuStats.

11 years agoFix misspelled cpuacct.usage_percpu in cgroup mock.
Thorsten Behrens [Fri, 14 Feb 2014 17:49:07 +0000 (18:49 +0100)]
Fix misspelled cpuacct.usage_percpu in cgroup mock.

11 years agoAdd unit test for virCgroupGetMemoryUsage.
Thorsten Behrens [Fri, 14 Feb 2014 17:49:06 +0000 (18:49 +0100)]
Add unit test for virCgroupGetMemoryUsage.

11 years agoAdd unit test for virCgroupGetBlkioIo*Serviced
Thorsten Behrens [Fri, 14 Feb 2014 17:49:05 +0000 (18:49 +0100)]
Add unit test for virCgroupGetBlkioIo*Serviced

11 years agoWidening API change - accept empty path for virDomainBlockStats
Thorsten Behrens [Fri, 14 Feb 2014 17:49:04 +0000 (18:49 +0100)]
Widening API change - accept empty path for virDomainBlockStats

And provide domain summary stat in that case, for lxc backend.
Use case is a container inheriting all devices from the host,
e.g. when doing application containerization.

11 years agoImplement lxcDomainBlockStats* for lxc driver
Thorsten Behrens [Fri, 14 Feb 2014 17:49:03 +0000 (18:49 +0100)]
Implement lxcDomainBlockStats* for lxc driver

Adds lxcDomainBlockStatsFlags and lxcDomainBlockStats functions.

11 years agoImplement domainGetCPUStats for lxc driver.
Thorsten Behrens [Fri, 14 Feb 2014 17:49:02 +0000 (18:49 +0100)]
Implement domainGetCPUStats for lxc driver.

11 years agoMake qemuGetDomainTotalCPUStats a virCgroup function.
Thorsten Behrens [Fri, 14 Feb 2014 17:49:01 +0000 (18:49 +0100)]
Make qemuGetDomainTotalCPUStats a virCgroup function.

To reuse this from other drivers, like lxc.

11 years agoImplement domainMemoryStats API slot for LXC driver.
Thorsten Behrens [Fri, 14 Feb 2014 17:49:00 +0000 (18:49 +0100)]
Implement domainMemoryStats API slot for LXC driver.

11 years agoAdd util virCgroupGetBlkioIo*Serviced methods.
Thorsten Behrens [Fri, 14 Feb 2014 17:48:59 +0000 (18:48 +0100)]
Add util virCgroupGetBlkioIo*Serviced methods.

This reads blkio stats from blkio.throttle.io_service_bytes and
blkio.throttle.io_serviced.

11 years agovirsh: fix memleak when starting a guest with invalid fd
Jincheng Miao [Thu, 20 Feb 2014 09:29:15 +0000 (17:29 +0800)]
virsh: fix memleak when starting a guest with invalid fd

When start a guest with --pass-fd, if the argument of --pass-fd is invalid,
virsh will exit, but doesn't free the variable 'dom'.

The valgrind said:
...
==24569== 63 (56 direct, 7 indirect) bytes in 1 blocks are definitely lost in loss record 130 of 234
==24569==    at 0x4C2A1D4: calloc (in /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so)
==24569==    by 0x4E879A4: virAllocVar (viralloc.c:544)
==24569==    by 0x4EBD625: virObjectNew (virobject.c:190)
==24569==    by 0x4F3A18A: virGetDomain (datatypes.c:226)
==24569==    by 0x4F9311F: remoteDomainLookupByName (remote_driver.c:6636)
==24569==    by 0x4F44F20: virDomainLookupByName (libvirt.c:2277)
==24569==    by 0x12F616: vshCommandOptDomainBy (virsh-domain.c:105)
==24569==    by 0x131C79: cmdStart (virsh-domain.c:3330)
==24569==    by 0x12C4AB: vshCommandRun (virsh.c:1752)
==24569==    by 0x127001: main (virsh.c:3218)

https://bugzilla.redhat.com/show_bug.cgi?id=1067338

Signed-off-by: Jincheng Miao <jmiao@redhat.com>
Signed-off-by: Eric Blake <eblake@redhat.com>
11 years agolxc: Add destroy support for suspended domains
Richard Weinberger [Fri, 14 Feb 2014 15:42:48 +0000 (16:42 +0100)]
lxc: Add destroy support for suspended domains

Destroying a suspended domain needs special action.
We cannot simply terminate all process because they are frozen.
Do deal with that we send them SIGKILL and thaw them.
Upon wakeup the process sees the pending signal and dies immediately.

Signed-off-by: Richard Weinberger <richard@nod.at>
11 years agoFix build of portallocator on mingw
Ján Tomko [Thu, 20 Feb 2014 09:04:30 +0000 (10:04 +0100)]
Fix build of portallocator on mingw

IN6ADDR_ANY_INIT does not seem to be working as expected on MinGW:
error: missing braces around initializer [-Werror=missing-braces]
         .sin6_addr = IN6ADDR_ANY_INIT,

Use the in6addr_any variable instead.

Reported by Daniel P. Berrange.

11 years agonetworkRunHook: Run hook only if possible
Michal Privoznik [Wed, 19 Feb 2014 13:55:23 +0000 (14:55 +0100)]
networkRunHook: Run hook only if possible

Currently, networkRunHook() is called in networkAllocateActualDevice and
friends. These functions, however, doesn't necessarily work on networks,
For example, if domain's interface is defined in this fashion:

    <interface type='bridge'>
      <mac address='52:54:00:0b:3b:16'/>
      <source bridge='virbr1'/>
      <model type='rtl8139'/>
      <address type='pci' domain='0x0000' bus='0x00' slot='0x09' function='0x0'/>
    </interface>

The networkAllocateActualDevice jumps directly onto 'validate' label as
the interface is not type of 'network'. Hence, @network is left
initialized to NULL and networkRunHook(network, ...) is called. One of
the things that the hook function does is dereference @network. Soupir.

Signed-off-by: Michal Privoznik <mprivozn@redhat.com>
11 years agolibxl: use job functions in libxlDomainSetSchedulerParametersFlags
Jim Fehlig [Fri, 7 Feb 2014 00:29:09 +0000 (17:29 -0700)]
libxl: use job functions in libxlDomainSetSchedulerParametersFlags

Modify operation that needs to wait in the queue of modify jobs.

Signed-off-by: Jim Fehlig <jfehlig@suse.com>
11 years agolibxl: use job functions in libxlDomainSetAutostart
Jim Fehlig [Fri, 7 Feb 2014 00:24:48 +0000 (17:24 -0700)]
libxl: use job functions in libxlDomainSetAutostart

Setting autostart is a modify operation that needs to wait in the
queue of modify jobs.

Signed-off-by: Jim Fehlig <jfehlig@suse.com>
11 years agolibxl: use job functions in device attach and detach functions
Jim Fehlig [Fri, 7 Feb 2014 00:21:41 +0000 (17:21 -0700)]
libxl: use job functions in device attach and detach functions

These operations aren't necessarily time consuming, but need to
wait in the queue of modify jobs.

Signed-off-by: Jim Fehlig <jfehlig@suse.com>
11 years agolibxl: use job functions in vcpu set and pin functions
Jim Fehlig [Fri, 7 Feb 2014 00:16:14 +0000 (17:16 -0700)]
libxl: use job functions in vcpu set and pin functions

These operations aren't necessarily time consuming, but need to
wait in the queue of modify jobs.

Signed-off-by: Jim Fehlig <jfehlig@suse.com>
11 years agolibxl: use job functions in libxlDomainCoreDump
Jim Fehlig [Thu, 6 Feb 2014 23:54:39 +0000 (16:54 -0700)]
libxl: use job functions in libxlDomainCoreDump

Dumping a domain's core can take considerable time.  Use the
recently added job functions and unlock the virDomainObj while
dumping core.

Signed-off-by: Jim Fehlig <jfehlig@suse.com>
11 years agolibxl: use job functions in domain save operations
Jim Fehlig [Thu, 6 Feb 2014 23:34:58 +0000 (16:34 -0700)]
libxl: use job functions in domain save operations

Saving domain memory and cpu state can take considerable time.
Use the recently added job functions and unlock the virDomainObj
while saving the domain.

Signed-off-by: Jim Fehlig <jfehlig@suse.com>
11 years agolibxl: use job functions when cleaning up a domain
Jim Fehlig [Wed, 12 Feb 2014 23:06:41 +0000 (16:06 -0700)]
libxl: use job functions when cleaning up a domain

When explicitly destroying a domain (libxlDomainDestroyFlags), or
handling an out-of-band domain shutdown event, cleanup the domain
in the context of a job.  Introduce libxlVmCleanupJob to wrap
libxlVmCleanup in a job block.

11 years agolibxl: use job functions in libxlDomain{Suspend,Resume}
Jim Fehlig [Thu, 6 Feb 2014 23:21:50 +0000 (16:21 -0700)]
libxl: use job functions in libxlDomain{Suspend,Resume}

These operations aren't necessarily time consuming, but need to
wait in the queue of modify jobs.

Signed-off-by: Jim Fehlig <jfehlig@suse.com>
11 years agolibxl: use job functions in libxlDomainSetMemoryFlags
Jim Fehlig [Thu, 6 Feb 2014 23:10:25 +0000 (16:10 -0700)]
libxl: use job functions in libxlDomainSetMemoryFlags

Large balloon operation can be time consuming.  Use the recently
added job functions and unlock the virDomainObj while ballooning.

Signed-off-by: Jim Fehlig <jfehlig@suse.com>
11 years agolibxl: use job functions in libxlVmStart
Jim Fehlig [Thu, 6 Feb 2014 22:21:36 +0000 (15:21 -0700)]
libxl: use job functions in libxlVmStart

Creating a large domain could potentially be time consuming.  Use the
recently added job functions and unlock the virDomainObj while
the create operation is in progress.

Signed-off-by: Jim Fehlig <jfehlig@suse.com>
11 years agolibxl: Add job support to libxl driver
Jim Fehlig [Thu, 19 Dec 2013 05:54:39 +0000 (13:54 +0800)]
libxl: Add job support to libxl driver

Follows the pattern used in the QEMU driver for managing multiple,
simultaneous jobs within the driver.

Signed-off-by: Jim Fehlig <jfehlig@suse.com>
11 years agolibxl: remove libxlVmReap function
Jim Fehlig [Wed, 12 Feb 2014 22:22:18 +0000 (15:22 -0700)]
libxl: remove libxlVmReap function

This function, which only has five call sites, simply calls
libxl_domain_destroy and libxlVmCleanup.  Call those functions
directly at the call sites, allowing more control over how a
domain is destroyed and cleaned up.  This patch maintains the
existing semantic, leaving changes to a subsequent patch.

Signed-off-by: Jim Fehlig <jfehlig@suse.com>
11 years agolibxl: always set vm id to -1 on shutdown
Jim Fehlig [Wed, 12 Feb 2014 21:59:13 +0000 (14:59 -0700)]
libxl: always set vm id to -1 on shutdown

Once a domain has reached the shutdown state, set its ID to -1.

Signed-off-by: Jim Fehlig <jfehlig@suse.com>
11 years agoqemu: Use virtio network device for aarch64/virt
Oleg Strikov [Fri, 14 Feb 2014 14:09:00 +0000 (18:09 +0400)]
qemu: Use virtio network device for aarch64/virt

This patch changes network device type used by default from rtl8139
to virtio when architecture type is aarch64 and machine type is virt.
Qemu doesn't support any other machine types for aarch64 right now and
we can't make any other aarch64-specific tuning in this function yet.

Signed-off-by: Oleg Strikov <oleg.strikov@canonical.com>
11 years agobhyve: add a basic driver
Roman Bogorodskiy [Tue, 18 Feb 2014 10:08:10 +0000 (14:08 +0400)]
bhyve: add a basic driver

At this point it has a limited functionality and is highly
experimental. Supported domain operations are:

  * define
  * start
  * destroy
  * dumpxml
  * dominfo

It's only possible to have only one disk device and only one
network, which should be of type bridge.

11 years agoAdd a default USB keyboard and USB mouse for PPC64
Li Zhang [Mon, 17 Feb 2014 10:17:58 +0000 (18:17 +0800)]
Add a default USB keyboard and USB mouse for PPC64

There is no keyboard working on PPC64 and PS2 mouse is only for X86
when graphics are enabled.

Add a USB keyboard and USB mouse for PPC64 when graphics are enabled.

Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com>
Signed-off-by: Ján Tomko <jtomko@redhat.com>
11 years agoxen: format xen config for USB keyboard
Li Zhang [Mon, 17 Feb 2014 10:17:57 +0000 (18:17 +0800)]
xen: format xen config for USB keyboard

Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com>
Signed-off-by: Ján Tomko <jtomko@redhat.com>
11 years agoqemu: format qemu command line for USB keyboard
Li Zhang [Mon, 17 Feb 2014 10:17:56 +0000 (18:17 +0800)]
qemu: format qemu command line for USB keyboard

Format qemu command line for USB keyboard
and add test cases for it.

Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com>
Signed-off-by: Ján Tomko <jtomko@redhat.com>
11 years agoqemu: Add USB keyboard capability
Li Zhang [Mon, 17 Feb 2014 10:17:55 +0000 (18:17 +0800)]
qemu: Add USB keyboard capability

Add USB keyboard capability probing and test cases.

Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com>
Signed-off-by: Ján Tomko <jtomko@redhat.com>
11 years agoconf: Remove the implicit PS2 devices for non-X86 platforms
Li Zhang [Mon, 17 Feb 2014 10:17:54 +0000 (18:17 +0800)]
conf: Remove the implicit PS2 devices for non-X86 platforms

PS2 devices only work on X86 platform, other platforms may need
USB devices instead. Athough it doesn't influence the QEMU command line,
it's not right to add PS2 mouse/keyboard for non-X86 platform.

Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com>
Signed-off-by: Ján Tomko <jtomko@redhat.com>
11 years agoconf: Add keyboard input device type
Li Zhang [Mon, 17 Feb 2014 10:17:53 +0000 (18:17 +0800)]
conf: Add keyboard input device type

There is no keyboard support currently in libvirt.

For some platforms (PPC64 QEMU) this makes graphics unusable,
since the keyboard is not implicit and it can't be added via libvirt.

Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com>
Signed-off-by: Ján Tomko <jtomko@redhat.com>
11 years agoconf: Add one interface to add default input devices
Li Zhang [Mon, 17 Feb 2014 10:17:52 +0000 (18:17 +0800)]
conf: Add one interface to add default input devices

Use it for the default mouse.

Signed-off-by: Li Zhang <zhlcindy@linux.vnet.ibm.com>
Signed-off-by: Ján Tomko <jtomko@redhat.com>
11 years agobridge_driver.h: Fix build --without-network
Michal Privoznik [Tue, 18 Feb 2014 17:40:28 +0000 (18:40 +0100)]
bridge_driver.h: Fix build --without-network

The networkNotifyActualDevice function is accepting two arguments, not
one:

qemu/qemu_process.c: In function 'qemuProcessNotifyNets':
qemu/qemu_process.c:2776:47: error: macro "networkNotifyActualDevice" passed 2 arguments, but takes just 1
         if (networkNotifyActualDevice(def, net) < 0)
                                               ^

Signed-off-by: Michal Privoznik <mprivozn@redhat.com>
11 years agoFix conflicting types of virInitctlSetRunLevel
Ján Tomko [Tue, 18 Feb 2014 14:01:32 +0000 (15:01 +0100)]
Fix conflicting types of virInitctlSetRunLevel

aebbcdd didn't change the non-linux definition of the function,
breaking the build on FreeBSD:

../../src/util/virinitctl.c:164: error: conflicting types for
'virInitctlSetRunLevel'
../../src/util/virinitctl.h:40: error: previous declaration of
'virInitctlSetRunLevel' was here

11 years agonetwork: Taint networks that are using hook script
Michal Privoznik [Tue, 4 Feb 2014 16:36:54 +0000 (17:36 +0100)]
network: Taint networks that are using hook script

Basically, the idea is copied from domain code, where tainting
exists for a while. Currently, only one taint reason exists -
VIR_NETWORK_TAINT_HOOK to mark those networks which caused invoking
of hook script.

Signed-off-by: Michal Privoznik <mprivozn@redhat.com>
11 years agonetwork: Introduce network hooks
Michal Privoznik [Fri, 31 Jan 2014 15:48:06 +0000 (16:48 +0100)]
network: Introduce network hooks

There might be some use cases, where user wants to prepare the host or
its environment prior to starting a network and do some cleanup after
the network has been shut down. Consider all the functionality that
libvirt doesn't currently have as an example what a hook script can
possibly do.

Signed-off-by: Michal Privoznik <mprivozn@redhat.com>
11 years agonetwork_conf: Expose virNetworkDefFormatInternal
Michal Privoznik [Wed, 12 Feb 2014 16:36:35 +0000 (17:36 +0100)]
network_conf: Expose virNetworkDefFormatInternal

In the next patch I'm going to need the network format function that
takes virBuffer as argument. However, slightly change of name is more
appropriate then: virNetworkDefFormatBuf to match the rest of functions
that format an object to buffer.

Signed-off-by: Michal Privoznik <mprivozn@redhat.com>
11 years agoCVE-2013-6456: Avoid unsafe use of /proc/$PID/root in LXC hotunplug code
Daniel P. Berrange [Thu, 30 Jan 2014 17:58:36 +0000 (17:58 +0000)]
CVE-2013-6456: Avoid unsafe use of /proc/$PID/root in LXC hotunplug code

Rewrite multiple hotunplug functions to to use the
virProcessRunInMountNamespace helper. This avoids
risk of a malicious guest replacing /dev with an absolute
symlink, tricking the driver into changing the host OS
filesystem.

Signed-off-by: Daniel P. Berrange <berrange@redhat.com>
11 years agoCVE-2013-6456: Avoid unsafe use of /proc/$PID/root in LXC chardev hostdev hotplug
Daniel P. Berrange [Thu, 30 Jan 2014 17:47:39 +0000 (17:47 +0000)]
CVE-2013-6456: Avoid unsafe use of /proc/$PID/root in LXC chardev hostdev hotplug

Rewrite lxcDomainAttachDeviceHostdevMiscLive function
to use the virProcessRunInMountNamespace helper. This avoids
risk of a malicious guest replacing /dev with a absolute
symlink, tricking the driver into changing the host OS
filesystem.

Signed-off-by: Daniel P. Berrange <berrange@redhat.com>
11 years agoCVE-2013-6456: Avoid unsafe use of /proc/$PID/root in LXC block hostdev hotplug
Daniel P. Berrange [Thu, 30 Jan 2014 17:45:08 +0000 (17:45 +0000)]
CVE-2013-6456: Avoid unsafe use of /proc/$PID/root in LXC block hostdev hotplug

Rewrite lxcDomainAttachDeviceHostdevStorageLive function
to use the virProcessRunInMountNamespace helper. This avoids
risk of a malicious guest replacing /dev with a absolute
symlink, tricking the driver into changing the host OS
filesystem.

Signed-off-by: Daniel P. Berrange <berrange@redhat.com>
11 years agoCVE-2013-6456: Avoid unsafe use of /proc/$PID/root in LXC USB hotplug
Daniel P. Berrange [Thu, 30 Jan 2014 16:34:19 +0000 (16:34 +0000)]
CVE-2013-6456: Avoid unsafe use of /proc/$PID/root in LXC USB hotplug

Rewrite lxcDomainAttachDeviceHostdevSubsysUSBLive function
to use the virProcessRunInMountNamespace helper. This avoids
risk of a malicious guest replacing /dev with a absolute
symlink, tricking the driver into changing the host OS
filesystem.

Signed-off-by: Daniel P. Berrange <berrange@redhat.com>
11 years agoCVE-2013-6456: Avoid unsafe use of /proc/$PID/root in LXC disk hotplug
Daniel P. Berrange [Thu, 30 Jan 2014 15:59:20 +0000 (15:59 +0000)]
CVE-2013-6456: Avoid unsafe use of /proc/$PID/root in LXC disk hotplug

Rewrite lxcDomainAttachDeviceDiskLive function to use the
virProcessRunInMountNamespace helper. This avoids risk of
a malicious guest replacing /dev with a absolute symlink,
tricking the driver into changing the host OS filesystem.

Signed-off-by: Daniel P. Berrange <berrange@redhat.com>
11 years agoCVE-2013-6456: Avoid unsafe use of /proc/$PID/root in LXC shutdown/reboot code
Eric Blake [Tue, 24 Dec 2013 05:55:51 +0000 (22:55 -0700)]
CVE-2013-6456: Avoid unsafe use of /proc/$PID/root in LXC shutdown/reboot code

Use helper virProcessRunInMountNamespace in lxcDomainShutdownFlags and
lxcDomainReboot.  Otherwise, a malicious guest could use symlinks
to force the host to manipulate the wrong file in the host's namespace.

Idea by Dan Berrange, based on an initial report by Reco
<recoverym4n@gmail.com> at
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=732394

Signed-off-by: Eric Blake <eblake@redhat.com>
11 years agoAdd helper for running code in separate namespaces
Daniel P. Berrange [Thu, 30 Jan 2014 13:11:23 +0000 (13:11 +0000)]
Add helper for running code in separate namespaces

Implement virProcessRunInMountNamespace, which runs callback of type
virProcessNamespaceCallback in a container namespace. This uses a
child process to run the callback, since you can't change the mount
namespace of a thread. This implies that callbacks have to be careful
about what code they run due to async safety rules.

Idea by Dan Berrange, based on an initial report by Reco
<recoverym4n@gmail.com> at
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=732394

Signed-off-by: Daniel Berrange <berrange@redhat.com>
Signed-off-by: Eric Blake <eblake@redhat.com>
11 years agoAdd virFileMakeParentPath helper function
Daniel P. Berrange [Thu, 30 Jan 2014 17:06:39 +0000 (17:06 +0000)]
Add virFileMakeParentPath helper function

Add a helper function which takes a file path and ensures
that all directory components leading up to the file exist.
IOW, it strips the filename part of the path and passes
the result to virFileMakePath.

Signed-off-by: Daniel P. Berrange <berrange@redhat.com>
11 years agoMove check for cgroup devices ACL upfront in LXC hotplug
Daniel P. Berrange [Wed, 5 Feb 2014 17:48:03 +0000 (17:48 +0000)]
Move check for cgroup devices ACL upfront in LXC hotplug

The check for whether the cgroup devices ACL is available is
done quite late during LXC hotplug - in fact after the device
node is already created in the container in some cases. Better
to do it upfront so we fail immediately.

Signed-off-by: Daniel P. Berrange <berrange@redhat.com>
11 years agoDisks are always block devices, never character devices
Daniel P. Berrange [Wed, 5 Feb 2014 11:01:09 +0000 (11:01 +0000)]
Disks are always block devices, never character devices

The LXC disk hotplug code was allowing block or character devices
to be given as disk. A disk is always a block device.

Signed-off-by: Daniel P. Berrange <berrange@redhat.com>
11 years agoFix reset of cgroup when detaching USB device from LXC guests
Daniel P. Berrange [Tue, 4 Feb 2014 17:41:22 +0000 (17:41 +0000)]
Fix reset of cgroup when detaching USB device from LXC guests

When detaching a USB device from an LXC guest we must remove
the device from the cgroup ACL. Unfortunately we were telling
the cgroup code to use the guest /dev path, not the host /dev
path, and the guest device node had already been unlinked.
This was, however, fortunate since the code passed &priv->cgroup
instead of priv->cgroup, so would have crash if the device node
were accessible.

Signed-off-by: Daniel P. Berrange <berrange@redhat.com>
11 years agoRecord hotplugged USB device in LXC live guest config
Daniel P. Berrange [Tue, 4 Feb 2014 16:46:28 +0000 (16:46 +0000)]
Record hotplugged USB device in LXC live guest config

After hotplugging a USB device, the LXC driver forgot
to add the device def to the virDomainDefPtr.

Signed-off-by: Daniel P. Berrange <berrange@redhat.com>
11 years agoFix path used for USB device attach with LXC
Daniel P. Berrange [Tue, 4 Feb 2014 16:43:18 +0000 (16:43 +0000)]
Fix path used for USB device attach with LXC

The LXC code missed the 'usb' component out of the path
/dev/bus/usb/$BUSNUM/$DEVNUM, so it failed to actually
setup cgroups for the device. This was in fact lucky
because the call to virLXCSetupHostUsbDeviceCgroup
was also mistakenly passing '&priv->cgroup' instead of
just 'priv->cgroup'. So once the path is fixed, libvirtd
would then crash trying to access the bogus virCgroupPtr
pointer. This would have been a security issue, were it
not for the bogus path preventing the pointer reference
being reached.

Signed-off-by: Daniel P. Berrange <berrange@redhat.com>
11 years agoDon't block use of USB with containers
Daniel P. Berrange [Tue, 4 Feb 2014 16:21:12 +0000 (16:21 +0000)]
Don't block use of USB with containers

virDomainDefCompatibleDevice blocks use of USB if no USB
controller is present. This is not correct for containers
since devices can be assigned directly regardless of any
controllers.

Signed-off-by: Daniel P. Berrange <berrange@redhat.com>
11 years agoqemu: Implement VIR_DOMAIN_TAINT_HOOK
Michal Privoznik [Tue, 4 Feb 2014 15:42:13 +0000 (16:42 +0100)]
qemu: Implement VIR_DOMAIN_TAINT_HOOK

Currently, there's just one place where we care if hook script is
changing the domain XML: migration hook for incoming migration. In
all other places where a hook script is executed, we don't read the
XML back from the script.

Anyway, the hook script can alter domain XML and hence we should taint
it if the script did.

Signed-off-by: Michal Privoznik <mprivozn@redhat.com>
11 years agovirDomainTaintFlags: Introduce VIR_DOMAIN_TAINT_HOOK
Michal Privoznik [Tue, 4 Feb 2014 15:36:37 +0000 (16:36 +0100)]
virDomainTaintFlags: Introduce VIR_DOMAIN_TAINT_HOOK

This new flag is to be used for tainting domains which
XML definition was altered at runtime by a hook script.

Signed-off-by: Michal Privoznik <mprivozn@redhat.com>
11 years agoRevert "storage: Introduce internal pool support"
Peter Krempa [Fri, 14 Feb 2014 15:03:22 +0000 (16:03 +0100)]
Revert "storage: Introduce internal pool support"

The internal pools were an idea in one of the first iterations of the
gluster series, which we decided not to use. Somehow the patch still
got pushed. Remove it as the internal flag isn't needed.

This reverts commit 362da8209d760fc1acd3a1c8df5b04aa676492eb.

11 years agoAdd tests for secret XML parsing
Ján Tomko [Fri, 14 Feb 2014 14:44:59 +0000 (15:44 +0100)]
Add tests for secret XML parsing

also validate it against the RNG schema.

11 years agodocs: remove <auth> from secret XML format
Ján Tomko [Fri, 14 Feb 2014 14:37:06 +0000 (15:37 +0100)]
docs: remove <auth> from secret XML format

This belongs to the pool definition.

11 years agoForgot to add lxcconf2xmldata to dist.
Cédric Bosdonnat [Fri, 14 Feb 2014 15:06:55 +0000 (16:06 +0100)]
Forgot to add lxcconf2xmldata to dist.

11 years agolxc: Don't shadow global symbol "link"
Peter Krempa [Fri, 14 Feb 2014 12:46:35 +0000 (13:46 +0100)]
lxc: Don't shadow global symbol "link"

Yet another variable name frowned upon by older compilers. Introduced in
commit b73c029d.

11 years agoSupport IPv6 in port allocator
Ján Tomko [Fri, 18 Oct 2013 11:52:03 +0000 (13:52 +0200)]
Support IPv6 in port allocator

Also try to bind on IPv6 to check if the port is occupied.

Change the mocked bind in the test to return EADDRINUSE
for some ports only for the IPv4/IPv6 socket if we're testing
on a host with IPv6 compiled in.

Also mock socket() to make it fail with EAFNOTSUPPORTED
if LIBVIRT_TEST_IPV4ONLY is set in the environment, to
simulate a host without IPv6 support in the kernel. The
tests are repeated again with this variable set.

https://bugzilla.redhat.com/show_bug.cgi?id=1025407

11 years agoSplit out bind() from virPortAllocatorAcquire
Ján Tomko [Thu, 31 Oct 2013 14:14:15 +0000 (15:14 +0100)]
Split out bind() from virPortAllocatorAcquire

11 years agostorage: gluster: Don't leak private data when storage file init fails
Peter Krempa [Fri, 14 Feb 2014 12:08:39 +0000 (13:08 +0100)]
storage: gluster: Don't leak private data when storage file init fails

In a44b7b87bcc6681e2939f65a3552fc96f68bc7b6 I've introduced a function
that initializes a storage file wrapper object on gluster based volumes.

The initialization function leaks the private data pointer in case of
failure. This patch fixes it.

Reported by John Ferlan.

11 years agostorage: Fix build with older compilers afeter gluster snapshot series
Peter Krempa [Fri, 14 Feb 2014 10:46:37 +0000 (11:46 +0100)]
storage: Fix build with older compilers afeter gluster snapshot series

In commit e32268184b4fd1611ed5ffd3c758b8f6a34152e6 I accidentally added
twice a typedef for virStorageFileBackend when I moved it between files
across patch iterations. The double declaration breaks build on older
compilers in RHEL5 and FreeBSD.

Remove the spurious definition.

11 years agoqemu: snapshot: Add support for external active snapshots on gluster
Peter Krempa [Mon, 25 Nov 2013 17:56:24 +0000 (18:56 +0100)]
qemu: snapshot: Add support for external active snapshots on gluster

Add support for gluster backed images as sources for snapshots in the
qemu driver. This will also simplify adding further network backed
volumes as sources for snapshot in case qemu will support them.

11 years agoqemu: snapshot: Use new APIs to detect presence of existing storage files
Peter Krempa [Tue, 11 Feb 2014 16:18:35 +0000 (17:18 +0100)]
qemu: snapshot: Use new APIs to detect presence of existing storage files

Use the new storage driver based "stat" api to detect exiting files just
as we did with local files.

11 years agoqemu: Switch snapshot deletion to the new API functions
Peter Krempa [Fri, 31 Jan 2014 13:26:32 +0000 (14:26 +0100)]
qemu: Switch snapshot deletion to the new API functions

Use the new storage driver APIs to delete snapshot backing files in case
of failure instead of directly relying on "unlink". This will help us in
the future when we will be adding network based storage without local
representation in the host.

11 years agostorage: Add storage file backends for gluster
Peter Krempa [Mon, 3 Feb 2014 16:18:24 +0000 (17:18 +0100)]
storage: Add storage file backends for gluster

Implement storage backend functions to deal with gluster volumes and
implement the "stat" and "unlink" backend APIs.

11 years agostorage: add file functions for local and block files
Peter Krempa [Mon, 3 Feb 2014 15:41:49 +0000 (16:41 +0100)]
storage: add file functions for local and block files

Implement the "stat" and "unlink" function for "file" volumes and "stat"
for "block" volumes using the regular system calls.

11 years agostorage: Add file storage APIs in the default storage driver
Peter Krempa [Mon, 3 Feb 2014 15:12:57 +0000 (16:12 +0100)]
storage: Add file storage APIs in the default storage driver

Add APIs that will allow to use the storage driver to assist in
operations on files even for remote filesystems without native
representation as files in the host.

11 years agoconf: Move qemuSnapshotDiskGetActualType to virDomainSnapshotDiskGetActualType
Peter Krempa [Thu, 13 Feb 2014 09:41:01 +0000 (10:41 +0100)]
conf: Move qemuSnapshotDiskGetActualType to virDomainSnapshotDiskGetActualType

All the data for getting the actual type is present in the snapshot
config. There is no need to have this function private to the qemu
driver and it will be re-used later in other parts of libvirt

11 years agoconf: Move qemuDiskGetActualType to virDomainDiskGetActualType
Peter Krempa [Thu, 13 Feb 2014 09:41:01 +0000 (10:41 +0100)]
conf: Move qemuDiskGetActualType to virDomainDiskGetActualType

All the data for getting the actual type is present in the domain
config. There is no need to have this function private to the qemu
driver and it will be re-used later in other parts of libvirt

11 years agospec: add missing dep of libvirt-daemon-config-nwfilter
Eric Blake [Wed, 12 Feb 2014 21:33:16 +0000 (14:33 -0700)]
spec: add missing dep of libvirt-daemon-config-nwfilter

Similar to cf76c4b, if modules are used, then nwfilter configuration
requires the nwfilter driver module.

Signed-off-by: Eric Blake <eblake@redhat.com>
11 years agoRevert "spec: require libvirt-wireshark from libvirt metapackage"
Eric Blake [Thu, 13 Feb 2014 13:34:14 +0000 (06:34 -0700)]
Revert "spec: require libvirt-wireshark from libvirt metapackage"

This reverts commit 8d6c3659b8c9b861b00a19b26079d11d56dce680.

After further list discussion, it was decided that pulling in
wireshark as a dependency is a bit too much for the base 'libvirt'
package.  Remember also that 'libvirt-devel' is also not pulled in
by the base 'libvirt' - the metapackage exists for full
functionality of libvirtd, rather than to pull in all subpackages.

11 years agolxc from native: removed now remaining useless line
Cédric Bosdonnat [Thu, 13 Feb 2014 12:45:44 +0000 (13:45 +0100)]
lxc from native: removed now remaining useless line

11 years agoFix stream related spelling mistakes
Philipp Hahn [Thu, 13 Feb 2014 08:41:54 +0000 (09:41 +0100)]
Fix stream related spelling mistakes

Remove double "is".
Consistent spelling of all-uppercase I/O.

Signed-off-by: Philipp Hahn <hahn@univention.de>
11 years agospec: require libvirt-wireshark from libvirt metapackage
Eric Blake [Wed, 12 Feb 2014 20:27:38 +0000 (13:27 -0700)]
spec: require libvirt-wireshark from libvirt metapackage

In general, the 'libvirt' metapackage should pull in all subpackages.
Fix this for the wireshark subpackage created in commit f9ada9f.

* libvirt.spec.in (Requires): Add dependency.

Signed-off-by: Eric Blake <eblake@redhat.com>
11 years agospec: add missing dep of libvirt-daemon-config-network
Thierry Parmentelat [Tue, 11 Feb 2014 10:35:20 +0000 (11:35 +0100)]
spec: add missing dep of libvirt-daemon-config-network

When building modules, libvirt-daemon-config-network requires
libvirt-daemon-driver-network to ensure the 'default' network
is setup properly

Signed-off-by: Eric Blake <eblake@redhat.com>
11 years agospec: require libvirt-daemon-driver-interface only when built
Thierry Parmentelat [Mon, 10 Feb 2014 09:54:30 +0000 (10:54 +0100)]
spec: require libvirt-daemon-driver-interface only when built

Signed-off-by: Eric Blake <eblake@redhat.com>
11 years agoLXC from native: convert blkio throttle config
Cédric Bosdonnat [Wed, 5 Feb 2014 14:10:17 +0000 (15:10 +0100)]
LXC from native: convert blkio throttle config

11 years agoLXC: added some doc on domxml-from-native with mention of limitations
Cédric Bosdonnat [Wed, 5 Feb 2014 14:10:16 +0000 (15:10 +0100)]
LXC: added some doc on domxml-from-native with mention of limitations

11 years agoLXC from native: map vlan network type
Cédric Bosdonnat [Wed, 5 Feb 2014 14:10:15 +0000 (15:10 +0100)]
LXC from native: map vlan network type

The problem with VLAN is that the user still has to manually create the
vlan interface on the host. Then the generated configuration will use
it as a nerwork hostdev device. So the generated configurations of the
following two fragments are equivalent (see rhbz#1059637).

lxc.network.type = phys
lxc.network.link = eth0.5

lxc.network.type = vlan
lxc.network.link = eth0
lxc.network.vlan.id = 5

11 years agoLXC from native: map block filesystems
Cédric Bosdonnat [Wed, 5 Feb 2014 14:10:14 +0000 (15:10 +0100)]
LXC from native: map block filesystems

11 years agoLXC from native: map lxc.arch to /domain/os/type@arch
Cédric Bosdonnat [Wed, 5 Feb 2014 14:10:13 +0000 (15:10 +0100)]
LXC from native: map lxc.arch to /domain/os/type@arch

11 years agoLXC from native: add lxc.cgroup.blkio.* mapping
Cédric Bosdonnat [Wed, 5 Feb 2014 14:10:12 +0000 (15:10 +0100)]
LXC from native: add lxc.cgroup.blkio.* mapping

11 years agoLXC from native: map lxc.cgroup.cpuset.*
Cédric Bosdonnat [Wed, 5 Feb 2014 14:10:11 +0000 (15:10 +0100)]
LXC from native: map lxc.cgroup.cpuset.*

11 years agoLXC from native: map lxc.cgroup.cpu.*
Cédric Bosdonnat [Wed, 5 Feb 2014 14:10:10 +0000 (15:10 +0100)]
LXC from native: map lxc.cgroup.cpu.*

11 years agoLXC from native: migrate memory tuning
Cédric Bosdonnat [Wed, 5 Feb 2014 14:10:09 +0000 (15:10 +0100)]
LXC from native: migrate memory tuning

11 years agoLXC from native: convert lxc.id_map into <idmap>
Cédric Bosdonnat [Wed, 5 Feb 2014 14:10:08 +0000 (15:10 +0100)]
LXC from native: convert lxc.id_map into <idmap>

11 years agoLXC from native: convert macvlan network configuration
Cédric Bosdonnat [Wed, 5 Feb 2014 14:10:07 +0000 (15:10 +0100)]
LXC from native: convert macvlan network configuration

11 years agoLXC from native: convert lxc.tty to console devices
Cédric Bosdonnat [Wed, 5 Feb 2014 14:10:06 +0000 (15:10 +0100)]
LXC from native: convert lxc.tty to console devices

11 years agoLXC from native: convert phys network types to net hostdev devices
Cédric Bosdonnat [Wed, 5 Feb 2014 14:10:05 +0000 (15:10 +0100)]
LXC from native: convert phys network types to net hostdev devices

11 years agoLXC from native: migrate veth network configuration
Cédric Bosdonnat [Wed, 5 Feb 2014 14:10:04 +0000 (15:10 +0100)]
LXC from native: migrate veth network configuration

Some of the LXC configuration properties aren't migrated since they
would only cause problems in libvirt-lxc:
  * lxc.network.ipv[46]: LXC driver doesn't setup IP address of guests,
    see rhbz#1059624
  * lxc.network.name, see rhbz#1059630

11 years agoLXC from native: implement no network conversion
Cédric Bosdonnat [Wed, 5 Feb 2014 14:10:03 +0000 (15:10 +0100)]
LXC from native: implement no network conversion

If no network configuration is provided, LXC only provides the loopback
interface. To match this, we need to use the privnet feature. LXC will
also define a 'none' network type in its 1.0.0 version that fits
libvirt LXC driver's default.

11 years agoLXC from native: migrate fstab and lxc.mount.entry
Cédric Bosdonnat [Wed, 5 Feb 2014 14:10:02 +0000 (15:10 +0100)]
LXC from native: migrate fstab and lxc.mount.entry

Tmpfs relative size and default 50% size values aren't supported as
we have no idea of the available memory at the conversion time.

11 years agoLXC from native: import rootfs
Cédric Bosdonnat [Wed, 5 Feb 2014 14:10:01 +0000 (15:10 +0100)]
LXC from native: import rootfs

LXC rootfs can be either a directory or a block device or an image
file. The first two types have been implemented, but the image file is
still to be done since LXC auto-guesses the file format at mount time
and the LXC driver doesn't support the 'auto' format.