cloudstack

mirror of https://github.com/apache/cloudstack.git synced 2025-10-26 08:42:29 +01:00

Author	SHA1	Message	Date
Daan Hoogland	650b5ec3da	Merge branch '4.20'	2025-05-27 18:18:39 +02:00
Pearl Dsilva	16fc2cd1f0	Merge branch '4.19' of https://github.com/apache/cloudstack into 4.20	2025-05-27 19:27:33 +05:30
dahn	bb79f0b727	engine/schema: create default network offering for vpc tier with conserve_mode=1 for fresh installation (#10744 ) (#10843 ) Co-authored-by: Wei Zhou <weizhou@apache.org>	2025-05-27 08:17:49 +02:00
Wei Zhou	842b2f8c24	Merge remote-tracking branch 'apache/4.20'	2025-05-19 21:25:37 +02:00
Wei Zhou	5444261902	test: fix several simulator CI failures (#10890 ) * test: fix several simulator CI failures * Inject dataStoreProviderManager	2025-05-19 18:33:14 +02:00
Harikrishna	b17808bfba	Introducing Storage Access Groups for better management for host and storage connections (#10381 ) * Introducing Storage Access Groups to define the host and storage pool connections In CloudStack, when a primary storage is added at the Zone or Cluster scope, it is by default connected to all hosts within that scope. This default behavior can be refined using storage access groups, which allow operators to control and limit which hosts can access specific storage pools. Storage access groups can be assigned to hosts, clusters, pods, zones, and primary storage pools. When a storage access group is set on a cluster/pod/zone, all hosts within that scope inherit the group. Connectivity between a host and a storage pool is then governed by whether they share the same storage access group. A storage pool with a storage access group will connect only to hosts that have the same storage access group. A storage pool without a storage access group will connect to all hosts, including those with or without a storage access group.	2025-05-19 11:33:29 +05:30
Daan Hoogland	8f8c685d17	Merge branch '4.19' into 4.20	2025-05-16 15:51:37 +02:00
Manoj Kumar	d5ba23c848	Introduce volume allocation algorithm global configuration (#10696 )	2025-05-16 14:06:42 +02:00
slavkap	c183fc9859	Prevent data corruption for StorPool volumes (#10799 )	2025-05-16 10:02:33 +02:00
Suresh Kumar Anaparti	95489b8bdd	Direct agents rebalance improvements with multiple management server nodes (#10674 ) Sometimes hypervisor hosts (direct agents) stuck with Disconnect state during agent rebalancing activity across multiple management server nodes. This issue was noticed during frequent restart of the management server nodes in the cluster. When there are multiple management server nodes in a cluster, if one or more nodes are shutdown/start/restart, CloudStack will rebalance the hosts among the remaining nodes or move the nodes to the newly joined management server nodes. During the rebalancing period multiple operations could happen including: - DirectAgentScan at interval of configured direct.agent.scan.interval - AgentRebalanceScan to identify and schedule rebalance agents - TransferAgentScan to transfer the host from original owner to future owner Current Rebalance behavior 1. For hosts that have AgentAttache && not forForward but in Disconnect state, CloudStack simply ignore these hosts without trying to ping again or update the status of the host. 2. For hosts that have AgentAttache && forForward, CloudStack removes the agent but still try to loadDirectlyConnectedHost. Improved Rebalance behavior During DirectAgentScan: scanDirectAgentToLoad(), identify hosts that for self-managed hosts that are in Disconnect state (disconnected after pingtimeout). 1. For hosts that have AgentAttache and is forForward, CloudStack should remove the agent 2. For hosts that have AgentAttache and is not forForward but in Disconnect state, CloudStack should try to investigate and update the status to Up if host is pingable. 3. For hosts that don't have AgentAttache, CloudStack should try to loadDirectlyConnectedHost.	2025-05-13 17:47:46 +05:30
João Jandre	6fdaf51ddc	KVM incremental snapshot feature (#9270 ) * KVM incremental snapshot feature * fix log * fix merge issues * fix creation of folder * fix snapshot update * Check for hypervisor type during parent search * fix some small bugs * fix tests * Address reviews * do not remove storPool snapshots * add support for downloading diff snaps * Add multiple zones support * make copied snapshots have normal names * address reviews * Fix in progress * continue fix * Fix bulk delete * change log to trace * Start fix on multiple secondary storages for a single zone * Fix multiple secondary storages for a single zone * Fix tests * fix log * remove bitmaps when deleting snapshots * minor fixes * update sql to new file * Fix merge issues * Create new snap chain when changing configuration * add verification * Fix snapshot operation selector * fix bitmap removal * fix chain on different storages * address reviews * fix small issue * fix test --------- Co-authored-by: João Jandre <joao@scclouds.com.br>	2025-05-12 10:50:30 -03:00
Pearl Dsilva	1e5d133033	Merge branch '4.20' of https://github.com/apache/cloudstack	2025-05-12 13:12:09 +05:30
Pearl Dsilva	a21f912be3	Merge branch '4.19' of https://github.com/apache/cloudstack into 4.20	2025-05-12 12:41:34 +05:30
Wei Zhou	7e2aa0efe4	engine/schema: create default network offering for vpc tier with conserve_mode=1 for fresh installation (#10744 )	2025-05-09 13:51:43 +05:30
Abhishek Kumar	919c9797cc	server: prevent duplicate HA works and alerts (#10624 ) Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2025-05-06 10:42:30 +02:00
Wei Zhou	fd74895ad0	New feature: Reconcile commands (CopyCommand, MigrateCommand, MigrateVolumeCommand) (#10514 )	2025-05-02 09:15:03 +02:00
Daan Hoogland	d7d9d131b2	Merge branch '4.20'	2025-05-01 15:44:09 +02:00
Suresh Kumar Anaparti	9f229600e6	Add new config (non-dynamic) for agent connections monitor thread, and keep timeunit to secs (in sync with the earlier Wait config) (#10525 )	2025-04-28 15:32:03 +02:00
Pearl Dsilva	2df1ac5106	Merge branch '4.20' of https://github.com/apache/cloudstack	2025-04-28 12:15:48 +05:30
Pearl Dsilva	0785ba046e	Merge branch '4.19' of https://github.com/apache/cloudstack into 4.20	2025-04-28 11:10:08 +05:30
Fabricio Duarte	9d263cd71b	Network Usage event model adjustments (#10755 )	2025-04-26 17:35:28 +02:00
Abhishek Kumar	12c077d704	api,ui: multi arch improvements (#10289 )	2025-04-25 11:02:27 +02:00
Daan Hoogland	3c75d9363b	Merge branch '4.20'	2025-04-17 15:59:41 +02:00
Daan Hoogland	d7765343ef	Merge branch '4.19' into 4.20	2025-04-17 15:40:10 +02:00
Wei Zhou	7b68615bd9	HA: set correct hostId of HA work for vm migration (#10591 )	2025-04-17 10:02:46 +02:00
Fabricio Duarte	ac6b1b382c	Migrate public templates that have URLs on data migration across secondary storages (#10364 ) Co-authored-by: Fabricio Duarte <fabricio.duarte@scclouds.com.br>	2025-04-15 13:48:45 +02:00
Suresh Kumar Anaparti	9dceae4614	MS maintenance improvements (#10417 ) * Update last agents during ms maintenance, and some code improvements * Send 503 (Service Unavailable) response status when maintenance or shutdown is initiated [Any load balancer in the clustered environment can avoid routing requests to this MS node] * Migrate systemvm agents before routing host agents, and some code improvements * Added events for ms maintenance and shutdown operations * Added the following ms maintenance and shutdown improvements - block new agent connections during prepare for maintenance of ms - maintain avoids ms list - propagate updated management servers list and lb algorithm in host and indirect.agent.lb.algorithm settings respectively, to systemvm (non-routing) agents - updated setup ms list and migrate agent connections to executor service - migrate agent connection through executor, and send the answer to the ms host that initiated the migration - re-initialize ssl handshake executor if it is shutdown - don't allow prepare for maintenance or shutdown when other management server nodes are in preparing states - don't allow trigger shutdown when management server is up and other management server nodes are in preparing states - stop agent connections monitor on ms maintenance - update avoid ms list in ready command - updated connected host from the client connection - update last agents in ms metrics from the database - updated some agent config descriptions - update last management server in the hosts during shutdown - added agents and lastagents in management server response - updated management server maintenance & shutdown unit tests - some code improvements * refactored code / addressed comments * removed shutdown testcase (maybe, calling System.exit) * Revert "removed shutdown testcase (maybe, calling System.exit)" This reverts commit e14b0717152ef6c8be102d61c80f42803a53172e. * avoid system.exit during shutdown test * code improvements * testcase fix * Fix cutoff time in agent connections monitor thread	2025-03-19 14:18:05 +05:30
Abhishek Kumar	1c1dad977e	Merge remote-tracking branch 'apache/4.20'	2025-03-06 09:55:27 +05:30
Pearl Dsilva	3aabedd447	UI: Proper explanation for the global setting to avoid ambiguity (#10042 )	2025-03-04 15:07:43 +01:00
Pearl Dsilva	bdae23ed53	Fix listing disk offerings for newly created VMs that haven't yet been started (#10476 )	2025-02-28 10:24:23 -05:00
Pearl Dsilva	3a28a87483	Merge branch '4.20' of https://github.com/apache/cloudstack	2025-02-27 11:20:25 -05:00
Abhishek Kumar	e8ac477e9f	engine/orchestration: fix missing vm powerstate update vm state (#10407 ) * engine/orchestration: fix missing vm powerstate update vm state Fixes #10406 VMs were not moving to Stopped state when PowerReportMissing is processed. Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * add unit tests Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * add license Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * add lenient Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> --------- Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2025-02-25 15:50:27 +05:30
Daan Hoogland	4a3686297d	Updating pom.xml version numbers for release 4.19.3.0-SNAPSHOT Signed-off-by: Daan Hoogland <daan@onecht.net>	2025-02-25 10:43:11 +01:00
Daan Hoogland	24b7c66251	Merge branch '4.20'	2025-02-24 14:33:12 +01:00
Nicole Schmidt	c80b8860e4	Fix hostId verification on unsuccessful expunge operation (#10418 )	2025-02-20 09:11:53 -05:00
Daan Hoogland	4e321d4356	Updating pom.xml version numbers for release 4.19.2.0 Signed-off-by: Daan Hoogland <daan@onecht.net>	2025-02-20 09:32:07 +01:00
Daan Hoogland	0dcb8da03a	Merge branch '4.20'	2025-02-12 16:54:05 +01:00
Daan Hoogland	4f3e8e8c5a	Merge branch '4.19' into 4.20	2025-02-12 15:00:51 +01:00
Rene Glover	3337f425ff	Primera pure patches & various small fixes (#10132 ) Co-authored-by: GLOVER RENE <rg9975@cs419-mgmtserver.rg9975nprd.app.ecp.att.com> Co-authored-by: Suresh Kumar Anaparti <sureshkumar.anaparti@gmail.com>	2025-02-07 13:19:34 +01:00
Daan Hoogland	2654890e86	Merge branch '4.20'	2025-02-01 21:20:08 +01:00
Daan Hoogland	085bd3bda5	Merge branch '4.19' into 4.20	2025-02-01 17:51:50 +01:00
Abhishek Kumar	0b5a5e8043	api,agent,server,engine-schema: scalability improvements (#9840 ) * api,agent,server,engine-schema: scalability improvements Following changes and improvements have been added: - Improvements in handling of PingRoutingCommand 1. Added global config - `vm.sync.power.state.transitioning`, default value: true, to control syncing of power states for transitioning VMs. This can be set to false to prevent computation of transitioning state VMs. 2. Improved VirtualMachinePowerStateSync to allow power state sync for host VMs in a batch 3. Optimized scanning stalled VMs - Added option to set worker threads for capacity calculation using config - `capacity.calculate.workers` - Added caching framework based on Caffeine in-memory caching library, https://github.com/ben-manes/caffeine - Added caching for account/use role API access with expiration after write can be configured using config - `dynamic.apichecker.cache.period`. If set to zero then there will be no caching. Default is 0. - Added caching for account/use role API access with expiration after write set to 60 seconds. - Added caching for some recurring DB retrievals 1. CapacityManager - listing service offerings - beneficial in host capacity calculation 2. LibvirtServerDiscoverer existing host for the cluster - beneficial for host joins 3. DownloadListener - hypervisors for zone - beneficial for host joins 5. VirtualMachineManagerImpl - VMs in progress- beneficial for processing stalled VMs during PingRoutingCommands - Optimized MS list retrieval for agent connect - Optimize finding ready systemvm template for zone - Database retrieval optimisations - fix and refactor for cases where only IDs or counts are used mainly for hosts and other infra entities. Also similar cases for VMs and other entities related to host concerning background tasks - Changes in agent-agentmanager connection with NIO client-server classes 1. Optimized the use of the executor service 2. Refactore Agent class to better handle connections. 3. Do SSL handshakes within worker threads 5. Added global configs to control the behaviour depending on the infra. SSL handshake could be a bottleneck during agent connections. Configs - `agent.ssl.handshake.min.workers` and `agent.ssl.handshake.max.workers` can be used to control number of new connections management server handles at a time. `agent.ssl.handshake.timeout` can be used to set number of seconds after which SSL handshake times out at MS end. 6. On agent side backoff and sslhandshake timeout can be controlled by agent properties. `backoff.seconds` and `ssl.handshake.timeout` properties can be used. - Improvements in StatsCollection - minimize DB retrievals. - Improvements in DeploymentPlanner allow for the retrieval of only desired host fields and fewer retrievals. - Improvements in hosts connection for a storage pool. Added config - `storage.pool.host.connect.workers` to control the number of worker threads that can be used to connect hosts to a storage pool. Worker thread approach is followed currently only for NFS and ScaleIO pools. - Minor improvements in resource limit calculations wrt DB retrievals Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> Co-authored-by: Abhishek Kumar <abhishek.mrt22@gmail.com> Co-authored-by: Rohit Yadav <rohit.yadav@shapeblue.com> * test1, domaindetails, capacitymanager fix Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * test2 - agent tests Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * capacitymanagertest fix Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * change Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * fix missing changes Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * address comments Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * revert marvin/setup.py Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * fix indent Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * use space in sql Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * address duplicate Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * update host logs Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * revert e36c6a5d07 Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * fix npe in capacity calculation Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * move schema changes to 4.20.1 upgrade Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * build fix Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * address comments Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * fix build Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * add some more tests Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * checkstyle fix Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * remove unnecessary mocks Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * build fix Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * replace statics Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * engine/orchestration,utils: limit number of concurrent new agent connections Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * refactor - remove unused Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * unregister closed connections, monitor & cleanup Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * add check for outdated vm filter in power sync Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> * agent: synchronize sendRequest wait Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> --------- Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com> Co-authored-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2025-02-01 12:28:41 +05:30
Wei Zhou	fbb1ff78d6	Static Routes: fix check on wrong global configuration (#10066 )	2025-01-31 11:04:13 +01:00
Suresh Kumar Anaparti	3b108b968f	Support for Management Server Maintenance Mode (#9854 ) * Support for Management Server Maintenance - New APIs: prepareForMaintenance and cancelMaintenance, with required parameter - managementserverid. - New management server states for maintenance: PreparingForMaintenance, Maintenance. - listHosts API with optional parameter – managementserverid, to list the hosts connected to the management server. - Support management server maintenance when more than one active management servers available. - Triggers transfer agents to other available management servers for maintenance, new agent command MigrateAgentConnectionCommand to initiate transfer of indirect agents. - New global config 'management.server.maintenance.timeout', to set the timeout (in mins) for the management server maintenance window, default: 60 mins. - UI changes: Prepare and Cancel Maintenance in Management Server section, Connected Agents tab, New fields for hosts and management servers. * Updated pending jobs check timer task with ScheduledExecutorService * keep maintenance state on trigger shutdown call when ms is in maintenance * add pending jobs count to ms response * during ms heartbeat, update state to up only when it's down * allow vm work jobs of async job created before prepare for maintenance * Revert "keep maintenance state on trigger shutdown call when ms is in maintenance" This reverts commit 607e13364679eac897f4d146bb3325ea7a61ba17. * skip maintenance test when multiple management servers are not available, and not configured in host setting for kvm	2025-01-29 13:31:15 +05:30
Daan Hoogland	048649d351	Merge release branch 4.20 to main * 4.20: server: investigate pending HA work when executing in new MS session (#10167) extra null guard (#10264)	2025-01-28 14:34:19 +01:00
Daan Hoogland	717ce981d4	Merge release branch 4.19 to 4.20 * 4.19: extra null guard (#10264)	2025-01-28 14:33:49 +01:00
Abhishek Kumar	33a37da9ec	server: investigate pending HA work when executing in new MS session (#10167 ) For HA work items that are created for host state change, checks must be done when execution is called in a new management server session. A new column, reason, has been added in cloud.op_ha_work table to track the reason for HA work. When HighAvailabilityManager starts it finds and puts all pending HA work items in Investigating state. During execution of the HA work if it is found in investigating state, checks are done to verify if the work is still valid. If the jobs is found to be invalid it is cancelled. Signed-off-by: Abhishek Kumar <abhishek.mrt22@gmail.com>	2025-01-28 14:39:31 +05:30
dahn	f652ad0d98	extra null guard (#10264 )	2025-01-27 14:14:31 +01:00
Daan Hoogland	98f5663954	Merge branch '4.20'	2025-01-24 17:10:43 +01:00
Daan Hoogland	34d2a3bc86	Merge branch '4.19' into 4.20	2025-01-24 17:01:42 +01:00

1 2 3 4 5 ...

1188 Commits