cloudstack

mirror of https://github.com/apache/cloudstack.git synced 2025-11-04 20:29:27 +01:00

Author	SHA1	Message	Date
Abhishek Kumar	5e98405b38	Merge remote-tracking branch 'apple/apple-base418' into scalability-improvements	2024-07-22 16:12:19 +05:30
Vishesh	c2de75744e	kvm: Add support for cgroupv2 (#8252 ) (#459 ) * kvm: Add support for cgroupv2 (#8252) 1. Problem description In Apache CloudStack (ACS), when a VM is deployed in a host with the KVM hypervisor, an XML file is created in the assigned host, which has a property shares that defines the weight of the VM to access the host CPU. The value of this property has no unit, and it is a relative measure to calculate how much CPU a given VM will have in the host. However, this value has a limit, which depends on the version of cgroup utilized by the host's kernel. The problem lies at the range value of shares that varies between both versions: [2, 264144] for cgroups version 1; and [1, 10000] for cgroups version 2. Currently, ACS calculates the value of shares using Equation 1, presented below, where CPU is the number of cores and speed is the CPU frequency; both specified in the VM's compute offering. Therefore, if a compute offering has, for example, 6 cores at 2 GHz, the shares value will be 12000 and an exception will be thrown by libvirt if the host utilizes cgroup v2. The second version is becoming the default one in current Linux distributions; thus, it is necessary to address this limitation. Equation 1 shares = CPU * speed Fixes: #6744 2. Proposed changes To address the problem described, we propose to apply a scale conversion considering the max shares of the host. Using the same formula currently utilized by ACS, it is possible to calculate the maximum shares of a VM for a given host. In other words, using the number of cores and the nominal speed of the host's CPU as the upper limit of shares allowed to a VM. Then, this value will be scaled to the allowed interval of [1, 10000] of cgroup v2 by using a linear scale conversion. The VM shares would be calculated as Equation 2, presented below, where VM requested shares is the requested shares value calculated using Equation 1, cgroup upper limit is fixed with a value of 10000 (cgroups v2 upper limit), and host max shares is the maximum shares value of the host, calculated using Equation 1. Using Equation 2, the only case where a VM passes the cgroup v2 limit is when the user requests more resources than the host has, which is not possible with the current implementation of ACS. Equation 2 shares = (VM requested shares * cgroup upper limit)/host max shares To implement the proposal, the following APIs will be updated: deployVirtualMachine, migrateVirtualMachine and scaleVirtualMachine. When a VM is being deployed, a new verification will be added to find a suitable host. The max shares of each host will be calculated, and the VM calculated shares will be verified if it does not surpass the host's value. Likewise, the migration of VMs will have a similar new verification. Lastly, the scale of VMs will also have the same verification for the VM's host. To determine the max shares of a given host, we will use the same equation currently used in ACS for calculating the shares of VMs, presented in Section 1. When Equation 1 is used to determine the maximum shares of a host, CPU is the number of cores of the host, and speed is the nominal CPU speed, i.e., considering the CPU's base frequency. It is important to note that these changes are only for hosts with the KVM hypervisor using cgroup v2 for now. * Update overcommit ratio during live VM migration * minor refactoring --------- Co-authored-by: Bryan Lima <42067040+BryanMLima@users.noreply.github.com>	2024-06-27 12:22:17 +05:30
Abhishek Kumar	8f88103a29	FR72 - api,server: purge expunged resources (#405 ) This PR introduces the functionality of purging removed DB entries for CloudStack entities (currently only for VirtualMachine). There would be three mechanisms for purging removed resources: - Background task - CloudStack will run a background task which runs at a defined interval. Other parameters for this task can be controlled with new global settings. - API - New API `purgeExpungedResources`. It will allow passing the following parameters - resourcetype, batchsize, startdate, enddate - Config for service offering. Service offerings can be created with purgeresources parameter which would allow purging resources immediately on expunge. Following new global settings have been added: - `expunged.resources.purge.enabled`: Default: false. Whether to run a background task to purge the DB records of the expunged resources. - `expunged.resources.purge.resources`: Default: (empty). A comma-separated list of resource types that will be considered by the background task to purge the DB records of the expunged resources. Currently only VirtualMachine is supported. An empty value will result in considering all resource types for purging. - `expunged.resources.purge.interval`: Default: 86400. Interval (in seconds) for the background task to purge the DB records of the expunged resources. - `expunged.resources.purge.delay`: Default: 300. Initial delay (in seconds) to start the background task to purge the DB records of the expunged resources task. - `expunged.resources.purge.batch.size`: Default: 50. Batch size to be used during purging of the DB records of the expunged resources. - `expunged.resources.purge.start.time`: Default: (empty). Start time to be used by the background task to purge the DB records of the expunged resources. Use format `yyyy-MM-dd` or `yyyy-MM-dd HH:mm:ss`. - `expunged.resources.purge.keep.past.days`: Default: 30. The number of days in the past from the execution time of the background task to purge the DB records of the expunged resources for which the expunged resources must not be purged. To enable purging DB records of the expunged resource till the execution of the background task, set the value to zero. - `expunged.resource.purge.job.delay`: Default: 180. Delay (in seconds) to execute the purging of the DB records of an expunged resource initiated by the configuration in the offering. Minimum value should be 180 seconds and if a lower value is set then the minimum value will be used. Upstream PRs: https://github.com/apache/cloudstack/pull/8999 https://github.com/apache/cloudstack-documentation/pull/397 Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> Co-authored-by: Suresh Kumar Anaparti <suresh.anaparti@shapeblue.com>	2024-06-19 12:59:50 +05:30
Rohit Yadav	b03d1382e6	fix unit tests failures Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2024-05-23 10:23:32 +05:30
Vishesh	c21b6d8b52	Update volume's passphrase to null if diskOffering doesn't support encryption (#428 )	2024-04-23 09:46:20 -06:00
Marcus Sorensen	f896586925	Update version to 4.18.1.1 (#417 ) * Update version to 4.18.1.1 * Update changelog * Update changelog * Update changelog --------- Co-authored-by: Marcus Sorensen <mls@apple.com>	2024-04-08 09:27:57 -06:00
Abhishek Kumar	996ae9a959	engine-storage: control download redirection Add a global setting to control whether redirection is allowed while downloading templates and volumes Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2024-04-01 09:23:17 +05:30
Harikrishna	747d1101c1	New API "checkVolume" to check and repair any leaks or repair all issues (#362 ) * Introduced a new API "checkVolumeAndRepair" that allows users or admins to check and repair if any leaks observed. Currently this is supported only for KVM * some fixes * Added unit tests * addressed review comments * add repair volume while granting access * Changed repair parameter to accept both leaks/all values * Introduced new global setting volume.check.and.repair.before.use to do volume check and repair before VM start or volume attach operations * Added volume check and repair changes only during VM start and volume attach operations * Refactored the names to look similar across the code * Some code fixes * remove unused code * Renamed repair values * Addressed review comments * code refactored * used volume name in logs * Changed the API to Async and the setting scope to storage pool * Fixed exit value handling with check volume command * Fixed storage scope to the setting * Fixed volume format issues * Refactored the log messages * Fix formatting	2024-02-29 14:40:40 +05:30
Vishesh	f30e07b312	Fix host stuck in connecting state (#375 ) * Fix host stuck in connecting state (#8502) There are a lot of test failures due to test_vm_life_cycle.py in multiple PRs due to host not available for migration of VMs. #8438 (comment) #8433 (comment) #7344 (comment) While debugging I noticed that the hosts get stuck in Connecting state because MS is waiting for a response of the ReadyCommand from the agent. Since we take a lock on connection and disconnection, restarting the agent doesn't work. To fix this, we have to restart the MS or wait for ~1 hour (default timeout). On the agent side, it gets stuck waiting for a response from the Script execution. To reproduce, run smoke/test_vm_life_cycle.py (TestSecuredVmMigration test class to be specific). Once the tests are complete, you will notice that some hosts are stuck in Connecting state. And restarting the agent fails due to the named lock. Locks on DB can be checked using the below query. SELECT * FROM performance_schema.metadata_locks INNER JOIN performance_schema.threads ON THREAD_ID = OWNER_THREAD_ID WHERE PROCESSLIST_ID <> CONNECTION_ID() \G; This PR adds a wait for the ready command and a timeout to the Script execution to ensure that the thread doesn't get stuck and the named lock from database is released. * Externalise a few timeouts & fix timeout for hostSupportsUefi in libvirt ready command wrapper (#8547) This PR fixes bug introduced in #8502. Timeout for script execution was set to 60 ms instead of 60s which resulted in host not getting UEFI enabled. This is a blocker for 4.19 release. We do this by introducing a new agent parameter `agent.script.timeout` (default - 60 seconds) to use as a timeout for the script checking host's UEFI status. We also externalize the timeout for the ReadyCommand by introducing a new global setting `ready.command.wait` (default - 60 seconds). For ModifyStoragePoolCommand, we don't externalize the timeout to avoid confusion for the user. Since, the required timeout can vary depending on the provider in use and we are only setting the wait for default host listener for now. Instead, we reuse the global `wait` setting by dividing it by `5` making the default value of 6 minutes (1800/5 = 360s) for ModifyStoragePoolCommand. Note: the actual time, the MS waits is twice the wait set for a Command. Check reference code below. `19250403e6/engine/orchestration/src/main/java/com/cloud/agent/manager/AgentAttache.java (L406-L442)` * fixup	2024-02-21 13:44:53 +05:30
Suresh Kumar Anaparti	89f93746ac	Storage plugin support to check if volume on datastore requires access for migration (#380 ) * Check if volume on datastore requires access for migration, and grant/revoke volume access if requires * Updated default implementation for requiresAccessForMigration method in PrimaryDataStoreDriver	2024-02-20 11:32:32 -07:00
Abhishek Kumar	6a9cdedda4	api,server,ui: tagged resource limits (#327 ) Introduces the concept of tagged resource limits. Limits can be enforced on accounts and domains for the deployment of entities for a tagged resource. Current tagged resource limits can be used for the following resource types, Host limits user_vm cpu memory Storage limits volume primary_storage Following global settings can used to specify tags for which limit needs to be enforced, Host: resource.limit.host.tags Storage: resource.limit.storage.tags Option for specifying tagged resource limits and viewing tagged resource usage are made available in the UI. Enhances use of templatetag for VM deployment and template creation Adds option to list disk offering with suitability flag for a virtualmachine. A new parameter named virtualmachineid has been added to the listDiskOfferings API which when passed returns suitableforvirtualmachine param in the reponse.	2024-02-07 17:35:15 +05:30
Marcus Sorensen	40dd867198	Apple base418 storagepooltype as class (#351 ) * StoragePoolType as a class * Fix agent side StoragePoolType enum to class * Handle StoragePoolType for StoragePoolJoinVO * Since StoragePoolType is a class, it cannot be converted by @Enumerated annotation. Implemented conveter class and logic to utilize @Convert annotation. * Fix UserVMJoinVO for StoragePoolType * fixed missing imports * Since StoragePoolType is a class, it cannot be converted by @Enumerated annotation. Implemented conveter class and logic to utilize @Convert annotation. * Fixed equals for the enum. * removed not needed try/catch for prepareAttribute * Added license to the file. * Implemented "supportsPhysicalDiskCopy" for storage adaptor. (#352) Co-authored-by: mprokopchuk <mprokopchuk@apple.com> * Add javadoc to StoragePoolType class * Add unit test for StoragePoolType comparisons * StoragePoolType "==" and ".equals()" fix. * Fix for abstract storage adaptor set up issue * review comments --------- Co-authored-by: Marcus Sorensen <mls@apple.com> Co-authored-by: mprokopchuk <mprokopchuk@apple.com> Co-authored-by: mprokopchuk <mprokopchuk@gmail.com> Co-authored-by: Suresh Kumar Anaparti <suresh.anaparti@shapeblue.com>	2024-01-25 14:58:44 +05:30
Suresh Kumar Anaparti	e704b6e492	Fix reorder/list pools when cluster details are not set (#358 ) * Fix reorder/list pools when cluster details are not set * minor code improvements * added unit tests	2024-01-18 15:29:00 +05:30
kishankavala	99939d22a7	CleanUp Async Jobs after mgmt server maintenance (#356 ) * Cleanup Volume AsyncJob after mgmt server stop * Clean Up Vm async job resources during mggmt server stop * Use State.isTransitional method to identify trnsition states * Add cleanup for Network Async Job * Add license * Added RevertSnapshotting to volume transition state. Fixed spacing code style * Added transitional flag in Volume state * Updated network event for failed job, (re)added cleanup for volumes created from snapshots, and some code improvements * Added java doc for volume state constructor * Fixed cleanup SNAPSHOT_ID entry in volume details for failed volumes created from snapshots --------- Co-authored-by: Suresh Kumar Anaparti <suresh.anaparti@shapeblue.com>	2024-01-09 17:54:26 +05:30
Harikrishna	28be74e0b9	Add lock mechanism considering template id, pool id, host id (#345 ) * Add lock mechanism considering template id, pool id, host id * Added missing lock	2023-12-06 13:51:25 +05:30
slavkap	a768c96a6d	Create snapshot from VM snapshot without memory for NFS/Local storage (#8117 ) (cherry picked from commit 6ae3b73ca2d7bf12acdd03333749b7a0a8cc94ee) Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2023-10-26 12:18:56 +05:30
Rohit Yadav	18e7276df0	storage: allow VM snapshots without memory for KVM when global setting allows (#8062 ) This removes the conditional logic where comment notest to remove it after PR #5297 is merged that is applicable for ACS 4.18+. Only when the global setting is enabled and memory isn't selected, VM snapshot could be allowed for VMs on KVM that have qemu-guest-agent running. Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com> (cherry picked from commit 8350ce5aa40241d683749c1257813fa649e998be) Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2023-10-11 21:01:31 +05:30
Nicolas Vazquez	4dfa38aae9	plugins: Add Custom hypervisor minimal changes (#7692 ) Design document: https://cwiki.apache.org/confluence/display/CLOUDSTACK/%5BDRAFT%5D+Minimal+changes+to+allow+new+dynamic+hypervisor+type%3A+Custom+Hypervisor This PR introduces the minimal changes to add a new hypervisor type (internally named Custom in the codebase, and configurable display name), allowing to write an external hypervisor plugin as a Custom Hypervisor to CloudStack The custom hypervisor name is set by the setting: 'hypervisor.custom.display.name'. The new hypervisor type does not affect the behaviour of any CloudStack operation, it simply introduces a new hypervisor type into the system. CloudStack does not have any means to dynamically add new hypervisor types. The hypervisor types are internally preset by an enum defined within the CloudStack codebase and unless a new version supports a new hypervisor it is not possible to add a host of a hypervisor that is not in part of the enum. It is possible to implement minimal changes in CloudStack to support a new hypervisor plugin that may be developed privately This PR is an initial work on allowing new dynamic hypervisor types (adds a new element to the HypervisorType enum, but allows variable display name for the hypervisor) Replace the HypervisorType from a fixed enum to an extensible registry mechanism, registered from the hypervisor plugin - The new hypervisor type is internally named 'Custom' to the CloudStack services (management server and agent services, database records). - A new global setting ‘hypervisor.custom.display.name’ allows administrators to set the display name of the hypervisor type. The display name will be shown in the CloudStack UI and API. - In case the ‘hypervisor.list’ setting contains the display name of the new hypervisor type, the setting value is automatically updated after the ‘hypervisor.custom.display.name’ setting is updated. - The new Custom hypervisor type supports: - Direct downloads (the ability to download templates into primary storage from the hypervisor hosts without using secondary storage) - Local storage (use hypervisor hosts local storage as primary storage) - Template format: RAW format (the templates to be registered on the new hypervisor type must be in RAW format) - The UI is also extended to display the new hypervisor type and the supported features listed above. - The above are the minimal changes for CloudStack to support the new hypervisor type, which can be tested by integrating the plugin codebase with this feature. This PR allows the cloud administrators to test custom hypervisor plugins implementations in CloudStack and easily integrate it into CloudStack as a new hypervisor type ("Custom"), reducing the implementation to only the hypervisor supported specific storage/networking and the hypervisor resource to communicate with the management server. - CloudStack admin should be able to create a zone for the new custom hypervisor and add clusters, hosts into the zone with normal operations - CloudStack users should be able to execute normal VMs/volumes/network/storage operations on VMs/volumes running on the custom hypervisor hosts (cherry picked from commit 8b5ba13b817e7d53beb0e0012c27d065e96a708c) Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2023-09-27 16:19:42 +05:30
Wei Zhou	4bdff06acd	Updating pom.xml version numbers for release 4.18.1.0 Signed-off-by: Wei Zhou <weizhou@apache.org>	2023-09-07 08:50:50 +02:00
Vishesh	594c70dde0	Sync precommit config from main (#7732 ) Co-authored-by: John Bampton <jbampton@users.noreply.github.com> Co-authored-by: dahn <daan@onecht.net>	2023-07-07 11:18:16 +02:00
Harikrishna	40cc10a73d	Allow volume migrations in ScaleIO within and across ScaleIO storage clusters (#7408 ) * Live storage migration of volume in scaleIO within same storage scaleio cluster * Added migrate command * Recent changes of migration across clusters * Fixed uuid * recent changes * Pivot changes * working blockcopy api in libvirt * Checking block copy status * Formatting code * Fixed failures * code refactoring and some changes * Removed unused methods * removed unused imports * Unit tests to check if volume belongs to same or different storage scaleio cluster * Unit tests for volume livemigration in ScaleIOPrimaryDataStoreDriver * Fixed offline volume migration case and allowed encrypted volume migration * Added more integration tests * Support for migration of encrypted volumes across different scaleio clusters * Fix UI notifications for migrate volume * Data volume offline migration: save encryption details to destination volume entry * Offline storage migration for scaleio encrypted volumes * Allow multiple Volumes to be migrated with migrateVirtualMachineWithVolume API * Removed unused unittests * Removed duplicate keys in migrate volume vue file * Fix Unit tests * Add volume secrets if does not exists during volume migrations. secrets are getting cleared on package upgrades. * Fix secret UUID for encrypted volume migration * Added a null check for secret before removing * Added more unit tests * Fixed passphrase check * Add image options to the encypted volume conversion	2023-06-21 11:57:05 +05:30
Abhishek Kumar	32caf9057e	engine-storage: fix errored template becomes active (#7485 ) * engine-storage: fix errored template becomes active Fixes #7342 Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * test Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> --------- Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2023-05-12 15:23:16 +02:00
Marcus Sorensen	ec0f8bddf6	Support local storage live migration for direct download templates (#7453 ) Co-authored-by: Marcus Sorensen <mls@apple.com>	2023-05-04 17:37:58 -03:00
Nicolas Vazquez	2dc016adde	Fix for direct download templates with multiple bypassed references (#7400 ) This PR fixes an issue observed on multiple zones and direct download templates on KVM, in which a template gets multiple records on the template_store_ref table. When this happens, the template cannot be used as direct download. In case of a system VM template using direct download, system VM deployments fail	2023-04-13 12:48:29 +05:30
Harikrishna	b774ee5d11	vmware: Datastore cluster synchronization should check if the child datastores are in UP state or not (#7385 ) This fix ensures when datastore cluster in VMware is added as a primary storage pool in CloudStack then all the child datastores (which already exists in CS) should be in Up state. For example: 1. Datastore Cluster DS has two child datastores A and B in vCenter. (B is already added as a storage pool in CloudStack) 2. Now try to add datastore cluster DS into CloudStack as a primary storage pool 3. CloudStack tries to add child datastores A and B in CloudStack, since B is already there in CloudStack, it will reuse the existing storagepool entry and will keep under parent Storage pool DS. During Step 3 we are now checking if B is Up state or not.	2023-04-11 22:23:12 +05:30
Daan Hoogland	05cda2729f	Updating pom.xml version numbers for release 4.18.1.0-SNAPSHOT Signed-off-by: Daan Hoogland <daan@onecht.net>	2023-03-15 19:38:14 +01:00
Daan Hoogland	0574087284	Updating pom.xml version numbers for release 4.18.0.0 Signed-off-by: Daan Hoogland <daan@onecht.net>	2023-03-11 09:35:41 +01:00
Suresh Kumar Anaparti	d8c7e34b38	Improve global settings UI to be more intuitive/logical (#5797 ) Co-authored-by: Suresh Kumar Anaparti <suresh.anaparti@shapeblue.com> Co-authored-by: nvazquez <nicovazquez90@gmail.com> Co-authored-by: davidjumani <dj.davidjumani1994@gmail.com> Co-authored-by: dahn <daan.hoogland@gmail.com> Co-authored-by: dahn <daan@onecht.net>	2023-01-31 11:23:43 +01:00
João Jandre	61a722548f	Create API to reassign volume (#6938 )	2023-01-27 11:10:56 +01:00
John Bampton	d74f64a2e1	Use lowercase HTTP header field names so we are compatible with HTTP/2 (#7006 )	2023-01-23 11:17:54 +01:00
John Bampton	e65c22d883	Fix spelling (#6860 )	2022-11-13 10:56:15 +01:00
João Jandre	14937e1adb	Fixed NPE on volume creation from snapshot (#6839 ) Co-authored-by: João Jandre <joao@scclouds.com.br>	2022-10-26 08:44:01 +02:00
dahn	4a06363749	Ova download fix (#6758 )	2022-10-21 14:31:19 +02:00
Daniel Augusto Veronezi Salvador	2ca164ac96	Quota custom tariffs (#5909 ) Co-authored-by: GutoVeronezi <daniel@scclouds.com.br> Co-authored-by: dahn <daan.hoogland@gmail.com>	2022-10-17 10:03:50 +02:00
Daniel Augusto Veronezi Salvador	7936eb04e9	server: Fix delete parent snapshot (#6630 ) ACS + Xenserver works with differential snapshots. ACS takes a volume full snapshot and the next ones are referenced as a child of the previous snapshot until the chain reaches the limit defined in the global setting snapshot.delta.max; then, a new full snapshot is taken. PR #5297 introduced disk-only snapshots for KVM volumes. Among the changes, the delete process was also refactored. Before the changes, when one was removing a snapshot with children, ACS was marking it as Destroyed and it was keeping the Image entry on the table cloud.snapshot_store_ref as Ready. When ACS was rotating the snapshots (the max delta was reached) and all the children were already marked as removed; then, ACS would start removing the whole hierarchy, completing the differential snapshot cycle. After the changes, the snapshots with children stopped being marked as removed and the differential snapshot cycle was not being completed. This PR intends to honor again the differential snapshot cycle for XenServer, making the snapshots to be marked as removed when deleted while having children and following the differential snapshot cycle. Also, when one takes a volume snapshot and ACS backs it up to the secondary storage, ACS inserts 2 entries on table cloud.snapshot_store_ref (Primary and Image). When one deletes a volume snapshot, ACS first tries to remove the snapshot from the secondary storage and mark the entry Image as removed; then, it tries to remove the snapshot from the primary storage and mark the entry Primary as removed. If ACS cannot remove the snapshot from the primary storage, it will keep the snapshot as BackedUp; however, If it does not exist in the secondary storage and without the entry SNAPSHOT.DELETE on cloud.usage_event. In the end, after the garbage collector flow, the snapshot will be marked as BackedUp, with a value in the field removed and still being rated. This PR also addresses the correction for this situation. Co-authored-by: GutoVeronezi <daniel@scclouds.com.br>	2022-10-13 12:31:11 +05:30
Daniel Augusto Veronezi Salvador	f7b29856d1	Refactor SnapshotDataStoreDaoImpl (#6751 ) * Refactor SnapshotDataStoreDaoImpl and add unit tests * Create constants for duplicated literals * Refactor search builders Co-authored-by: GutoVeronezi <daniel@scclouds.com.br> Co-authored-by: dahn <daan.hoogland@gmail.com>	2022-10-11 13:53:02 +02:00
Codegass	69e158d77d	Refactor TestHttp.testHttpclient to avoid the Exception Suppression (#6733 ) * Refactor TestHttp.testHttpclient to avoid the Exception Suppression * Remove the unnecessary import	2022-10-09 15:22:32 +05:30
Harikrishna	713a236843	UserData as first class resource (#6202 ) This PR introduces a new feature to make userdata as a first class resource much like existing SSH keys. Detailed feature specification document: https://cwiki.apache.org/confluence/display/CLOUDSTACK/Userdata+as+a+first+class+resource	2022-10-05 17:34:59 +05:30
Marcus Sorensen	697e12f8f7	kvm: volume encryption feature (#6522 ) This PR introduces a feature designed to allow CloudStack to manage a generic volume encryption setting. The encryption is handled transparently to the guest OS, and is intended to handle VM guest data encryption at rest and possibly over the wire, though the actual encryption implementation is up to the primary storage driver. In some cases cloud customers may still prefer to maintain their own guest-level volume encryption, if they don't trust the cloud provider. However, for private cloud cases this greatly simplifies the guest OS experience in terms of running volume encryption for guests without the user having to manage keys, deal with key servers and guest booting being dependent on network connectivity to them (i.e. Tang), etc, especially in cases where users are attaching/detaching data disks and moving them between VMs occasionally. The feature can be thought of as having two parts - the API/control plane (which includes scheduling aspects), and the storage driver implementation. This initial PR adds the encryption setting to disk offerings and service offerings (for root volume), and implements encryption support for KVM SharedMountPoint, NFS, Local, and ScaleIO storage pools. NOTE: While not required, operations can be significantly sped up by ensuring that hosts have the `rng-tools` package and service installed and running on the management server and hypervisors. For EL hosts the service is `rngd` and for Debian it is `rng-tools`. In particular, the use of SecureRandom for generating volume passphrases can be slow if there isn't a good source of entropy. This could affect testing and build environments, and otherwise would only affect users who actually use the encryption feature. If you find tests or volume creates blocking on encryption, check this first. ### Management Server ##### API * createDiskOffering now has an 'encrypt' Boolean * createServiceOffering now has an 'encryptroot' Boolean. The 'root' suffix is added here in case there is ever any other need to encrypt something related to the guest configuration, like the RAM of a VM. This has been refactored to deal with the new separation of service offering from disk offering internally. * listDiskOfferings shows encryption support on each offering, and has an encrypt boolean to choose to list only offerings that do or do not support encryption * listServiceOfferings shows encryption support on each offering, and has an encrypt boolean to choose to list only offerings that do or do not support encryption * listHosts now shows encryption support of each hypervisor host via `encryptionsupported` * Volumes themselves don't show encryption on/off, rather the offering should be referenced. This follows the same pattern as other disk offering based settings such as the IOPS of the volume. ##### Volume functions A decent effort has been made to ensure that the most common volume functions have either been cleanly supported or blocked. However, for the first release it is advised to mark this feature as experimental, as the code base is complex and there are certainly edge cases to be found. Many of these features could eventually be supported over time, such as creating templates from encrypted volumes, but the effort and size of the change is already overwhelming. Supported functions: * Data Volume create * VM root volume create * VM root volume reinstall * Offline volume snapshot/restore * Migration of VM with storage (e.g. local storage VM migration) * Resize volume * Detach/attach volume Blocked functions: * Online volume snapshot * VM snapshot w/memory * Scheduled snapshots (would fail when VM is running) * Disk offering migration to offerings that don't have matching encryption * Creating template from encrypted volume * Creating volume from encrypted volume * Volume extraction (would we decrypt it first, or expose the key? Probably the former). ##### Primary Storage Support For storage developers, adding encryption support involves: 1. Updating the `StoragePoolType` for your primary storage to advertise encryption support. This is used during allocation of storage to match storage types that support encryption to storage that supports it. 2. Implementing encryption feature when your `PrimaryDataStoreDriver` is called to perform volume lifecycle functions on volumes that are requesting encryption. You are free to do what your storage supports - this could be as simple as calling a storage API with the right flag when creating a volume. Or (as is the case with the KVM storage types), as complex as managing volume details directly at the hypervisor host. The data objects passed to the storage driver will contain volume passphrases, if encryption is requested. ##### Scheduling For the KVM implementations specified above, we are dependent on the KVM hosts having support for volume encryption tools. As such, the hosts `StartupRoutingCommand` has been modified to advertise whether the host supports encryption. This is done via a probe during agent startup to look for functioning `cryptsetup` and support in `qemu-img`. This is also visible via the listHosts API and the host details in the UI. This was patterned after other features that require hypervisor support such as UEFI. The `EndPointSelector` interface and `DefaultEndpointSelector` have had new methods added, which allow the caller to ask for endpoints that support encryption. This can be used by storage drivers to find the proper hosts to send storage commands that involve encryption. Not all volume activities will require a host to support encryption (for example a snapshot backup is a simple file copy), and this is the reason why the interface has been modified to allow for the storage driver to decide, rather than just passing the data objects to the EndpointSelector and letting the implementation decide. VM scheduling has also been modified. When a VM start is requested, if any volume that requires encryption is attached, it will filter out hosts that don't support encryption. ##### DB Changes A volume whose disk offering enables encryption will get a passphrase generated for it before its first use. This is stored in the new 'passphrase' table, and is encrypted using the CloudStack installation's standard configured DB encryption. A field has been added to the volumes table, referencing this passphrase, and a foreign key added to ensure passphrases that are referenced can't be removed from the database. The volumes table now also contains an encryption format field, which is set by the implementer of the encryption and used as it sees fit. #### KVM Agent For the KVM storage pool types supported, the encryption has been implemented at Qemu itself, using the built-in LUKS storage support. This means that the storage remains encrypted all the way to the VM process, and decrypted before the block device is visible to the guest. This may not be necessary in order to implement encryption for /your/ storage pool type, maybe you have a kernel driver that decrypts before the block device on the system, or something like that. However, it seemed like the simplest, common place to terminate the encryption, and provides the lowest surface area for decrypted guest data. For qcow2 based storage, `qemu-img` is used to set up a qcow2 file with LUKS encryption. For block based (currently just ScaleIO storage), the `cryptsetup` utility is used to format the block device as LUKS for data disks, but `qemu-img` and its LUKS support is used for template copy. Any volume that requires encryption will contain a passphrase ID as a byte array when handed down to the KVM agent. Care has been taken to ensure this doesn't get logged, and it is cleared after use in attempt to avoid exposing it before garbage collection occurs. On the agent side, this passphrase is used in two ways: 1. In cases where the volume experiences some libvirt interaction it is loaded into libvirt as an ephemeral, private secret and then referenced by secret UUID in any libvirt XML. This applies to things like VM startup, migration preparation, etc. 2. In cases where `qemu-img` needs to use this passphrase for volume operations, it is written to a `KeyFile` on the cloudstack agent's configured tmpfs and passed along. The `KeyFile` is a `Closeable` and when it is closed, it is deleted. This allows us to try-with-resources any volume operations and get the KeyFile removed regardless. In order to support the advanced syntax required to handle encryption and passphrases with `qemu-img`, the `QemuImg` utility has been modified to support the new `--object` and `--image-opts` flags. These are modeled as `QemuObject` and `QemuImageOptions`. These `qemu-img` flags have been designed to supersede some of the existing, older flags being used today (such as choosing file formats and paths), and an effort could be made to switch over to these wholesale. However, for now we have instead opted to keep existing functions and do some wrapping to ensure backward compatibility, so callers of `QemuImg` can choose to use either way. It should be noted that there are also a few different Enums that represent the encryption format for various purposes. While these are analogous in principle, they represent different things and should not be confused. For example, the supported encryption format strings for the `cryptsetup` utility has `LuksType.LUKS` while `QemuImg` has a `QemuImg.PhysicalDiskFormat.LUKS`. Some additional effort could potentially be made to support advanced encryption configurations, such as choosing between LUKS1 and LUKS2 or changing cipher details. These may require changes all the way up through the control plane. However, in practice Libvirt and Qemu currently only support LUKS1 today. Additionally, the cipher details aren't required in order to use an encrypted volume, as they're stored in the LUKS header on the volume there is no need to store these elsewhere. As such, we need only set the one encryption format upon volume creation, which is persisted in the volumes table and then available later as needed. In the future when LUKS2 is standard and fully supported, we could move to it as the default and old volumes will still reference LUKS1 and have the headers on-disk to ensure they remain usable. We could also possibly support an automatic upgrade of the headers down the road, or a volume migration mechanism. Every version of cryptsetup and qemu-img tested on variants of EL7 and Ubuntu that support encryption use the XTS-AES 256 cipher, which is the leading industry standard and widely used cipher today (e.g. BitLocker and FileVault). Signed-off-by: Marcus Sorensen <mls@apple.com> Co-authored-by: Marcus Sorensen <mls@apple.com>	2022-09-27 10:20:59 +05:30
João Jandre	efbf74ee06	Added new logs to volume creation (#6689 ) Co-authored-by: João Paraquetti <joao@scclouds.com.br>	2022-09-26 19:11:14 -03:00
Abhishek Kumar	e720b72e15	Merge remote-tracking branch 'apache/4.17' into main	2022-08-31 17:38:30 +05:30
Abhishek Kumar	a21efe75df	vmware: fix vm snapshot with datastore cluster, drs (#6643 ) Fixes #6595 Sync volume datastore, path and chaininfo info while calculating snapshot chain size after snapshot operation is complete from vCenter.	2022-08-31 16:00:14 +05:30
Suresh Kumar Anaparti	75da982d73	Updated resource counter to include correct size after volume creation/resize and other improvements (#6587 ) * Updated resource counter to include correct size after volume creation/resize and other improvements - Recalculate resource counters for root domain in the periodic task - Update correct size in the primary_storage resource counter after volume creation/resize - Some code improvements * review and sonarcloud issues Co-authored-by: Suresh Kumar Anaparti <suresh.anaparti@shapeblue.com> Co-authored-by: Daan Hoogland <daan@onecht.net>	2022-08-16 10:41:42 +02:00
Paula Oliveira	9717ed9af2	Improve log messages on VolumeOrchestrator class (#6408 ) Co-authored-by: Paula Zomignani Oliveira <paula@scclouds.com.br>	2022-08-12 09:17:06 +02:00
John Bampton	f9347ecf2c	Fix spelling (#6597 )	2022-08-03 15:43:47 +05:30
Rohit Yadav	661956cc60	Merge remote-tracking branch 'origin/4.17'	2022-07-20 11:52:26 +05:30
Harikrishna	2c05b63495	kvm: Fix for Revert volume snapshot (#6527 ) This PR fixes the issue #6209 where the snapshot revert operation fails after certain volume operations like Migrate VM with volume / migrate volume / reinstall VM. The root cause of the issue after these volume operations, the primary storage entry is getting deleted for that volume. We have fixed it here to get the primary datastore entry wrt volume and continue the operation.	2022-07-20 11:34:02 +05:30
dahn	731a83babf	add global setting to allow parallel execution on vmware (#6413 ) * add global setting to allow parallel execution on vmware * cleanup setting distribution for vmware.create.full.clone * query setting in vmware guru * don´t touch other hypervisor's commands * guru hierarchy cleanup	2022-07-15 10:01:35 +02:00
nvazquez	84eed6db72	Merge branch '4.17'	2022-06-10 08:28:41 -03:00
dahn	90a0ee0b6c	fix pseudo random behaviour in pool selection (#6307 ) * refactor and log trace * tracelogs * shuffle pools with real randomiser * sinlge retrieval of async job context * some review comments addressed * Apply suggestions from code review Co-authored-by: Daniel Augusto Veronezi Salvador <38945620+GutoVeronezi@users.noreply.github.com> * log formatting * integration test for distribution of volumes over storages * move test to smoke tests * imports * sonarcloud issue # AYCOmVntKzsfKlhz0HDh * spellos * review comments * review comments * sonarcloud issues * unittest * import * Update AbstractStoragePoolAllocatorTest.java Co-authored-by: Daan Hoogland <dahn@onecht.net> Co-authored-by: Daniel Augusto Veronezi Salvador <38945620+GutoVeronezi@users.noreply.github.com>	2022-06-10 08:06:23 -03:00

1 2 3 4 5 ...

1098 Commits