cloudstack

mirror of https://github.com/apache/cloudstack.git synced 2025-10-26 08:42:29 +01:00

Author	SHA1	Message	Date
Rene Glover	6ee6603359	Updates to HPE-Primera and Pure FlashArray Drivers to use Host-based VLUN Assignments (#8889 ) * Updates to change PUre and Primera to host-centric vlun assignments; various small bug fixes * update to add timestamp when deleting pure volumes to avoid future conflicts * update to migrate to properly check disk offering is valid for the target storage pool * Updates to change PUre and Primera to host-centric vlun assignments; various small bug fixes * update to add timestamp when deleting pure volumes to avoid future conflicts * update to migrate to properly check disk offering is valid for the target storage pool * improve error handling when copying volumes to add precision to which step failed * rename pure volume before delete to avoid conflicts if the same name is used before its expunged on the array * remove dead code in AdaptiveDataStoreLifeCycleImpl.java * Fix issues found in PR checks * fix session refresh TTL logic * updates from PR comments * logic to delete by path ONLY on supported OUI * fix to StorageSystemDataMotionStrategy compile error * change noisy debug message to trace message * fix double callback call in handleVolumeMigrationFromNonManagedStorageToManagedStorage * fix for flash array delete error * fix typo in StorageSystemDataMotionStrategy * change copyVolume to use writeback to speed up copy ops * remove returning PrimaryStorageDownloadAnswer when connectPhysicalDisk returns false during KVMStorageProcessor template copy * remove change to only set UUID on snapshot if it is a vmSnapshot * reverting change to UserVmManagerImpl.configureCustomRootDiskSize * add error checking/simplification per comments from @slavkap * Update engine/storage/datamotion/src/main/java/org/apache/cloudstack/storage/motion/StorageSystemDataMotionStrategy.java Co-authored-by: Suresh Kumar Anaparti <sureshkumar.anaparti@gmail.com> * address PR comments from @sureshanaparti --------- Co-authored-by: GLOVER RENE <rg9975@cs419-mgmtserver.rg9975nprd.app.ecp.att.com> Co-authored-by: Suresh Kumar Anaparti <sureshkumar.anaparti@gmail.com>	2024-06-25 10:35:39 +05:30
slavkap	8b07b66f14	Fix volume snapshot of encrypted NFS/StorPool volume (#8873 ) * Fix volume snapshot of encrypted NFS/StorPool volume * remove comments * removed invoking the real qemu convert command * fix UnsatisfiedLink error in unit tests * addressed comments extracted method	2024-06-24 13:09:21 +05:30
Suresh Kumar Anaparti	c17aa0d9ad	Import Remote KVM VM logging improvements (#9284 )	2024-06-24 11:34:37 +05:30
Rene Peinthor	f4612c51ec	libvirtstorage: Make sure netfs storage was really mounted (#8887 )	2024-06-23 19:41:02 +05:30
Suresh Kumar Anaparti	5ab23cd9c9	Timeout config to copy the disks of remote KVM instance while importing the instance from an external host (#9213 ) * Added timeout config to copy the disks of remote KVM instance while importing the instance from an external host * Updated copy config units to mins * Cleanup remote converted file and local file when copy failed	2024-06-21 10:28:18 +05:30
Vishesh	74f5e52e6e	Fix unit test failure (#9238 )	2024-06-13 16:06:35 +05:30
Wei Zhou	b2ef53b8a2	kvm: replace ISO path in vm XML configuration during vm migration (#9212 ) * kvm: replace ISO path in vm XML configuration during vm migration * Update 9212: address comments * kvm: fix vm migration if there are multiple image stores	2024-06-12 16:01:23 +02:00
Suresh Kumar Anaparti	4ec0f823cf	ScaleIO volume live migration - use usable bytes from source disk to format the destination disk (#9174 )	2024-06-12 13:58:10 +05:30
Suresh Kumar Anaparti	2e3f76ec03	Improve error messaging / logs when listing VMs on the remote KVM host (for import) (#9204 )	2024-06-11 14:48:21 +02:00
Harikrishna	acae5c5b9e	kvm: Update the java doc for the method disconnectPhysicalDiskByPath (#9210 ) This PR addresses the issue #8789 The original issue is disconnectPhysicalDiskByPath() implementation in FibreChannelAdaptor always returns true irrespective of the success of the operation. This was already fixed in the PR #8889 . Ideally this method has to be called after choosing the right adapter based on the storage pool type of the volume path, but currently it is just called in a loop. `05b9b6e2e7/plugins/hypervisors/kvm/src/main/java/com/cloud/hypervisor/kvm/storage/KVMStoragePoolManager.java (L200-L212)` while trying to fix the case of running into the loop of all adapters by somehow passing the storage pool type to that caller cleanup() method but this is touching all over the code (which I fear it creates other regressions), instead I feel we can keep it the current way only since Fibrechannel adapter has already fixed. In this PR I've added the java doc explaining the method and situation.	2024-06-11 14:44:46 +05:30
Abhishek Kumar	10f4de0318	kvm: consider provisioning type for local data volumes (#9141 ) Fixes #8644 Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2024-06-10 11:38:31 +03:00
Rohit Yadav	3de1f8b4ba	Merge remote-tracking branch 'origin/4.18' into 4.19 Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2024-04-29 13:44:34 +05:30
João Jandre	cec6ade257	change live migration API used on kvm (#8952 )	2024-04-25 09:35:25 +02:00
Wei Zhou	0b857def68	New feature: Import/Unamange DATA volume from storage pool (#8808 )	2024-04-23 16:05:59 +02:00
Rohit Yadav	0fa71f5696	Merge remote-tracking branch 'origin/4.18' into 4.19	2024-04-23 15:21:44 +05:30
Rohit Yadav	5a52ca78ae	kvm: export sysinfo for arm64 domains for cloud-init to work (#8940 ) This fixes a limitation for arm64/aarch64 KVM hosts to correctly export the product name via sysconfig attribute. Without this `cloud-init` doesn't function correctly on arm64 platforms. Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2024-04-19 21:23:49 +02:00
João Jandre	8a101fbbc1	Updating pom.xml version numbers for release 4.18.3.0-SNAPSHOT Signed-off-by: João Jandre <48719461+JoaoJandre@users.noreply.github.com>	2024-04-17 11:11:57 -03:00
João Jandre	154566f914	Updating pom.xml version numbers for release 4.18.2.0 Signed-off-by: João Jandre <48719461+JoaoJandre@users.noreply.github.com>	2024-04-12 08:25:04 -03:00
Suresh Kumar Anaparti	d3e020a545	Mark libvirt events experimental, add properties flag (#8825 ) * Mark libvirt events experimental, add properties flag * unit test fixes --------- Co-authored-by: Marcus Sorensen <mls@apple.com>	2024-04-11 17:06:33 +05:30
Abhishek Kumar	ff3e9bd821	engine-storage: control download redirection Add a global setting to control whether redirection is allowed while downloading templates and volumes Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2024-04-04 14:11:05 +05:30
Wei Zhou	939d0b9011	engine-storage: control download redirection Add a global setting to control whether redirection is allowed while downloading templates and volumes core: some changes on SimpleHttpMultiFileDownloader similar as HttpTemplateDownloader Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> (cherry picked from commit b1642bc3bf58ccde9f56f632b5a9fe46a3eb5356) Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2024-04-04 11:19:20 +05:30
Wei Zhou	a7ec8738a2	kvm: fix NPE while import KVM VMs from other hosts (#8720 )	2024-03-04 09:46:28 +01:00
Harikrishna	c462be1412	New API "checkVolume" to check and repair any leaks or issues reported by qemu-img check (#8577 ) * Introduced a new API checkVolumeAndRepair that allows users or admins to check and repair if any leaks observed. Currently this is supported only for KVM * some fixes * Added unit tests * addressed review comments * add repair volume while granting access * Changed repair parameter to accept both leaks/all * Introduced new global setting volume.check.and.repair.before.use to do volume check and repair before VM start or volume attach operations * Added volume check and repair changes only during VM start and volume attach operations * Refactored the names to look similar across the code * Some code fixes * remove unused code * Renamed repair values * Fixed unit tests * changed version * Address review comments * Code refactored * used volume name in logs * Changed the API to Async and the setting scope to storage pool * Fixed exit value handling with check volume command * Fixed storage scope to the setting * Fix volume format issues * Refactored the log messages * Fix formatting	2024-02-29 14:41:49 +05:30
Vishesh	a8028eecbd	Merge remote-tracking branch 'origin/4.18' into 4.19	2024-02-13 11:44:20 +05:30
dahn	672206c312	kvm: ITCO watchdog added (#8282 ) * ITCO watchdog added * add inject-nmi action * Update plugins/hypervisors/kvm/src/main/java/com/cloud/hypervisor/kvm/resource/LibvirtVMDef.java Co-authored-by: Wei Zhou <weizhou@apache.org> --------- Co-authored-by: Wei Zhou <weizhou@apache.org>	2024-02-12 08:54:39 +01:00
Wei Zhou	b8904f75dd	Merge remote-tracking branch 'apache/4.18' into 4.19	2024-02-05 10:08:31 +01:00
Marcus Sorensen	9f1b34aeb2	Fix libvirt domain event listener by properly processing events (#8437 ) * Fix libvirt domain event listener by properly processing events * Add javadoc for setupEventListener --------- Co-authored-by: Marcus Sorensen <mls@apple.com>	2024-02-05 13:30:10 +05:30
Abhishek Kumar	a7b97ff3b0	Updating pom.xml version numbers for release 4.19.1.0-SNAPSHOT Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2024-02-02 18:06:04 +05:30
Lucas Martins	1c98b5a4e5	Change Cryptsetup validation (#8482 ) Co-authored-by: lucas.martins.scclouds <lucas.martins@scclouds.com.br>	2024-02-01 09:43:28 +01:00
Abhishek Kumar	2746225b99	Updating pom.xml version numbers for release 4.19.0.0 Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2024-01-29 10:21:52 +05:30
Vishesh	fedcf66de0	Externalise a few timeouts & fix timeout for hostSupportsUefi in libvirt ready command wrapper (#8547 ) This PR fixes bug introduced in #8502. Timeout for script execution was set to 60 ms instead of 60s which resulted in host not getting UEFI enabled. This is a blocker for 4.19 release. We do this by introducing a new agent parameter `agent.script.timeout` (default - 60 seconds) to use as a timeout for the script checking host's UEFI status. We also externalize the timeout for the ReadyCommand by introducing a new global setting `ready.command.wait` (default - 60 seconds). For ModifyStoragePoolCommand, we don't externalize the timeout to avoid confusion for the user. Since, the required timeout can vary depending on the provider in use and we are only setting the wait for default host listener for now. Instead, we reuse the global `wait` setting by dividing it by `5` making the default value of 6 minutes (1800/5 = 360s) for ModifyStoragePoolCommand. Note: the actual time, the MS waits is twice the wait set for a Command. Check reference code below. `19250403e6/engine/orchestration/src/main/java/com/cloud/agent/manager/AgentAttache.java (L406-L442)`	2024-01-27 23:36:13 +05:30
Vishesh	c3b77cb7b8	Fix host stuck in connecting state (#8502 ) There are a lot of test failures due to test_vm_life_cycle.py in multiple PRs due to host not available for migration of VMs. #8438 (comment) #8433 (comment) #7344 (comment) While debugging I noticed that the hosts get stuck in Connecting state because MS is waiting for a response of the ReadyCommand from the agent. Since we take a lock on connection and disconnection, restarting the agent doesn't work. To fix this, we have to restart the MS or wait for ~1 hour (default timeout). On the agent side, it gets stuck waiting for a response from the Script execution. To reproduce, run smoke/test_vm_life_cycle.py (TestSecuredVmMigration test class to be specific). Once the tests are complete, you will notice that some hosts are stuck in Connecting state. And restarting the agent fails due to the named lock. Locks on DB can be checked using the below query. SELECT * FROM performance_schema.metadata_locks INNER JOIN performance_schema.threads ON THREAD_ID = OWNER_THREAD_ID WHERE PROCESSLIST_ID <> CONNECTION_ID() \G; This PR adds a wait for the ready command and a timeout to the Script execution to ensure that the thread doesn't get stuck and the named lock from database is released.	2024-01-15 13:56:34 +05:30
Nicolas Vazquez	a3a4833c3e	Fixes for KVM unmanaged instances import on advanced network and VNC password (#8492 ) This PR fixes a regression caused by #8465 on advanced zones, import fails with: 2024-01-10 12:13:33,234 DEBUG [o.a.c.e.o.NetworkOrchestrator] (API-Job-Executor-3:ctx-991bbe9f job-128 ctx-f49517d4) (logid:d7b8e716) Allocating nic for vm 142272e8-9e2e-407b-9d7e-e9a03b81653c in network Network {"id": 204, "name": "Isolated", "uuid": "9679fac5-e3ac-4694-a57b-beb635340f39", "networkofferingid": 10} during import 2024-01-10 12:13:33,239 ERROR [o.a.c.v.UnmanagedVMsManagerImpl] (API-Job-Executor-3:ctx-991bbe9f job-128 ctx-f49517d4) (logid:d7b8e716) Failed to import NICs while importing vm: i-2-31-VM com.cloud.exception.InsufficientVirtualNetworkCapacityException: Unable to acquire Guest IP address for network Network {"id": 204, "name": "Isolated", "uuid": "9679fac5-e3ac-4694-a57b-beb635340f39", "networkofferingid": 10}Scope=interface com.cloud.dc.DataCenter; id=1 at org.apache.cloudstack.engine.orchestration.NetworkOrchestrator.importNic(NetworkOrchestrator.java:4582) at org.apache.cloudstack.vm.UnmanagedVMsManagerImpl.importNic(UnmanagedVMsManagerImpl.java:859) at org.apache.cloudstack.vm.UnmanagedVMsManagerImpl.importVirtualMachineInternal(UnmanagedVMsManagerImpl.java:1198) at org.apache.cloudstack.vm.UnmanagedVMsManagerImpl.importUnmanagedInstanceFromHypervisor(UnmanagedVMsManagerImpl.java:1511) at org.apache.cloudstack.vm.UnmanagedVMsManagerImpl.baseImportInstance(UnmanagedVMsManagerImpl.java:1342) at org.apache.cloudstack.vm.UnmanagedVMsManagerImpl.importUnmanagedInstance(UnmanagedVMsManagerImpl.java:1282) at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method) Also, addresses the VNC password field set instead of a fixed string	2024-01-12 14:14:01 +05:30
Nicolas Vazquez	59e78cbc45	Fix KVM unmanage disks path (#8483 ) This PR fixes the volumes path on KVM import unmanaged instances Fixes: #8479	2024-01-11 14:45:57 +05:30
slavkap	c569fe9119	Fix KVM import and list unmanaged VMs (#8445 ) VM import fixes 1 - Fix of VM insert for VMs with StorPool volumes 2 - Fix of list/insert unmanaged VMs with RBD volumes	2024-01-10 13:12:07 +05:30
kishankavala	ab20b1220f	KVM Ingestion - Import Instance (#7976 ) This PR adds new functionality to import KVM instances from an external host or from disk images in local or shared storage. Doc PR: https://github.com/apache/cloudstack-documentation/pull/356	2023-12-14 13:08:56 +05:30
Abhishek Kumar	82f7abddb3	Merge remote-tracking branch 'apache/4.18'	2023-12-13 11:24:15 +05:30
Bryan Lima	3bb318bab9	kvm: Add support for cgroupv2 (#8252 ) 1. Problem description In Apache CloudStack (ACS), when a VM is deployed in a host with the KVM hypervisor, an XML file is created in the assigned host, which has a property shares that defines the weight of the VM to access the host CPU. The value of this property has no unit, and it is a relative measure to calculate how much CPU a given VM will have in the host. However, this value has a limit, which depends on the version of cgroup utilized by the host's kernel. The problem lies at the range value of shares that varies between both versions: [2, 264144] for cgroups version 1; and [1, 10000] for cgroups version 2. Currently, ACS calculates the value of shares using Equation 1, presented below, where CPU is the number of cores and speed is the CPU frequency; both specified in the VM's compute offering. Therefore, if a compute offering has, for example, 6 cores at 2 GHz, the shares value will be 12000 and an exception will be thrown by libvirt if the host utilizes cgroup v2. The second version is becoming the default one in current Linux distributions; thus, it is necessary to address this limitation. Equation 1 shares = CPU * speed Fixes: #6744 2. Proposed changes To address the problem described, we propose to apply a scale conversion considering the max shares of the host. Using the same formula currently utilized by ACS, it is possible to calculate the maximum shares of a VM for a given host. In other words, using the number of cores and the nominal speed of the host's CPU as the upper limit of shares allowed to a VM. Then, this value will be scaled to the allowed interval of [1, 10000] of cgroup v2 by using a linear scale conversion. The VM shares would be calculated as Equation 2, presented below, where VM requested shares is the requested shares value calculated using Equation 1, cgroup upper limit is fixed with a value of 10000 (cgroups v2 upper limit), and host max shares is the maximum shares value of the host, calculated using Equation 1. Using Equation 2, the only case where a VM passes the cgroup v2 limit is when the user requests more resources than the host has, which is not possible with the current implementation of ACS. Equation 2 shares = (VM requested shares * cgroup upper limit)/host max shares To implement the proposal, the following APIs will be updated: deployVirtualMachine, migrateVirtualMachine and scaleVirtualMachine. When a VM is being deployed, a new verification will be added to find a suitable host. The max shares of each host will be calculated, and the VM calculated shares will be verified if it does not surpass the host's value. Likewise, the migration of VMs will have a similar new verification. Lastly, the scale of VMs will also have the same verification for the VM's host. To determine the max shares of a given host, we will use the same equation currently used in ACS for calculating the shares of VMs, presented in Section 1. When Equation 1 is used to determine the maximum shares of a host, CPU is the number of cores of the host, and speed is the nominal CPU speed, i.e., considering the CPU's base frequency. It is important to note that these changes are only for hosts with the KVM hypervisor using cgroup v2 for now.	2023-12-13 10:51:24 +05:30
Rene Glover	1031c31e6a	FiberChannel Multipath for KVM + Pure Flash Array and HPE-Primera Support (#7889 ) This PR provides a new primary storage volume type called "FiberChannel" that allows access to volumes connected to hosts over fiber channel connections. It requires Multipath to provide path discovery and failover. Second, the PR adds an AdaptivePrimaryDatastoreProvider that abstracts how volumes are managed/orchestrated from the connector to communicate with the primary storage provider, using a ProviderAdapter interface, allowing the code interacting with the primary storage provider API's to be simpler and have no direct dependencies on Cloudstack code. Lastly, the PR provides an implementation of the ProviderAdapter classes for the HP Enterprise Primera line of storage solutions and the Pure Flash Array line of storage solutions.	2023-12-09 11:31:33 +05:30
Abhishek Kumar	c599011ef5	Merge remote-tracking branch 'apache/4.18'	2023-12-08 18:06:15 +05:30
Wei Zhou	7ea068c4dc	kvm: fix error 'Failed to find passphrase for keystore: cloud.jks' when enable SSL for kvm agent (#7923 )	2023-12-07 09:10:11 +01:00
Nicolas Vazquez	371ad9f55b	New Feature: Import VMware VMs into KVM (#7881 ) This PR adds the capability in CloudStack to convert VMware Instances disk(s) to KVM using virt-v2v and import them as CloudStack instances. It enables CloudStack operators to import VMware instances from vSphere into a KVM cluster managed by CloudStack. vSphere/VMware setup might be managed by CloudStack or be a standalone setup. CloudStack will let the administrator select a VM from an existing VMware vCenter in the CloudStack environment or external vCenter requesting vCenter IP, Datacenter name and credentials. The migrated VM will be imported as a KVM instance The migration is done through virt-v2v: https://access.redhat.com/articles/1351473, https://www.ovirt.org/develop/release-management/features/virt/virt-v2v-integration.html The migration process timeout can be set by the setting convert.instance.process.timeout Before attempting the virt-v2v migration, CloudStack will create a clone of the source VM on VMware. The clone VM will be removed after the registration process finishes. CloudStack will delegate the migration action to a KVM host and the host will attempt to migrate the VM invoking virt-v2v. In case the guest OS is not supported then CloudStack will handle the error operation as a failure The migration process using virt-v2v may not be a fast process CloudStack will not perform any check about the guest OS compatibility for the virt-v2v library as indicated on: https://access.redhat.com/articles/1351473.	2023-12-07 12:59:56 +05:30
sato03	fdfbb4fad1	Prioritize hypervisor.uri configuration (#8254 ) Co-authored-by: Henrique Sato <henrique.sato@scclouds.com.br>	2023-12-06 16:43:04 -03:00
Daan Hoogland	14376ce298	Merge release branch 4.18 to main * 4.18: kvm: fix ide controller for rocky/alma vms (#8247)	2023-12-06 16:06:09 +01:00
Wei Zhou	db6dd52f44	kvm: fix ide controller for rocky/alma vms (#8247 )	2023-12-06 15:05:49 +01:00
Stephan Krug	267a457efc	Externalize KVM HA heartbeat frequency (#6892 ) Co-authored-by: Stephan Krug <stephan.krug@scclouds.com.br> Co-authored-by: GaOrtiga <49285692+GaOrtiga@users.noreply.github.com> Co-authored-by: dahn <daan.hoogland@gmail.com>	2023-11-16 09:17:17 +01:00
Daan Hoogland	05b9b6e2e7	Merge branch '4.18' into main	2023-11-13 11:36:51 +01:00
Abhishek Kumar	d0f3233fda	edge-zone,kvm,iso,cks: allow k8s deployment with direct-download iso (#8142 ) Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2023-11-10 13:56:05 +01:00
slavkap	2bb182c3e1	KVM Host HA enhancement for StorPool storage (#8045 ) Extending the current functionality of KVM Host HA for the StorPool storage plugin and the option for easy integration for the rest of the storage plugins to support Host HA This extension works like the current NFS storage implementation. It allows it to be used simultaneously with NFS and StorPool storage or only with StorPool primary storage. If it is used with different primary storages like NFS and StorPool, and one of the health checks fails for storage, there is an option to report the failure to the management with the global config kvm.ha.fence.on.storage.heartbeat.failure. By default this option is disabled when enabled the Host HA service will continue with the checks on the host and eventually will fence the host	2023-11-04 12:35:37 +05:30
Daan Hoogland	a15cb81c85	Merge remote-tracking branch 'apache/4.18' into main	2023-11-03 11:55:26 +01:00

1 2 3 4 5 ...

1153 Commits