cloudstack

mirror of https://github.com/apache/cloudstack.git synced 2025-12-21 13:03:10 +01:00

Author	SHA1	Message	Date
Rohit Yadav	212e5ccfa7	CLOUDSTACK-9782: Host HA and KVM HA provider Host-HA offers investigation, fencing and recovery mechanisms for host that for any reason are malfunctioning. It uses Activity and Health checks to determine current host state based on which it may degrade a host or try to recover it. On failing to recover it, it may try to fence the host. The core feature is implemented in a hypervisor agnostic way, with two separate implementations of the driver/provider for Simulator and KVM hypervisors. The framework also allows for implementation of other hypervisor specific provider implementation in future. The Host-HA provider implementation for KVM hypervisor uses the out-of-band management sub-system to issue IPMI calls to reset (recover) or poweroff (fence) a host. The Host-HA provider implementation for Simulator provides a means of testing and validating the core framework implementation. Signed-off-by: Abhinandan Prateek <abhinandan.prateek@shapeblue.com> Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2017-08-30 18:06:48 +02:00
Daan Hoogland	70ef0788c9	CLOUDSTACK-9408: Fix download urls in sql and scripts This fixes the agreed upon url on download.cloudstack.org in various sql files and misc scripts. Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2017-04-20 12:33:33 +05:30
Sverrir Berg	751d3552dc	patchviasocket improve error handling more detailed error if host file not found or cannot be opened using mkstemp and mkdtemp for improved security improve resource cleanup in error conditions in unit test	2016-05-20 15:42:34 +00:00
Sverrir A. Berg	0acd3c12a2	Convert patchviasocket to python (removes perl dependency for KVM agent) As requested here: https://github.com/apache/cloudstack/pull/1495 No scripts are using perl so that install requirement can be removed. The new scripts are using standard python packages only. Includes extensive unit test.	2016-05-20 15:42:34 +00:00
Remi Bergsma	87fdb521f0	CLOUDSTACK-8443: don't try to fix co-mounted cgroups This setting works on CentOS 6 / RHEL 6 but does nothing, as "cpu" cgroup is not mounted. On CentOS 7 / RHEL 7 systemd does mount cgroups and "cpu" is co-mounted with "cpuacc". Hence, if we specify "cpu" then this results in an error because it can only use them both, or none. By removing the setting, we rely on the default of qemu, which is: cgroup_controllers = ["cpu", "devices", "memory", "blkio", "cpuacct", "net_cls"] Only if they are really mounted, they will be used. So, this will work on both version 6 and 7. The 'fix script' didn't work well, as after a reboot you'd still have qemu throwing errors. Now we can handle the co-mountedcgroups.	2015-08-24 15:49:40 +02:00
Remi Bergsma	d1cb4c7d50	RHEL 7 and CentOS 7 need the same fix	2015-08-19 16:30:24 +02:00
Remi Bergsma	14013d5d1b	fixing white space and formatting	2015-08-19 16:24:44 +02:00
Remi Bergsma	7bce656b40	make sure sync cannot block reboot The recent discussed improvement has the risk that if 'sync' hangs, the reboot may be delayed in the same way as the 'reboot' command would do. To work around, we're adding a 5 second timeout. If it cannot sync in 5 seconds, it will not succeed anyway and we should proceed the reset. @snuf: Could we use your OVM3 heartbeat script for other hypervisors as well? One way to do it seems like a nice idea :-)	2015-04-09 12:18:21 +02:00
Remi Bergsma	c59308b0ee	write logfile just before rebooting the host As discussed with @wido @pyr and @nuxro added an extra log line. Tested it and it logs fine (tested to local disk) when syncing first: Apr 3 15:31:23 mcctest2 heartbeat: kvmheartbeat.sh system because it was unable to write the heartbeat to the storage By the way, it did also log to the agent.log but this extra log has the benefit of ending up in the system log so you'll probably find it easier there. Existing logs: 2015-04-03 15:27:23,943 WARN [kvm.resource.KVMHAMonitor] (Thread-24:null) write heartbeat failed: timeout, retry: 0 2015-04-03 15:28:23,944 WARN [kvm.resource.KVMHAMonitor] (Thread-24:null) write heartbeat failed: timeout, retry: 1 2015-04-03 15:29:23,946 WARN [kvm.resource.KVMHAMonitor] (Thread-24:null) write heartbeat failed: timeout, retry: 2 2015-04-03 15:30:23,948 WARN [kvm.resource.KVMHAMonitor] (Thread-24:null) write heartbeat failed: timeout, retry: 3 2015-04-03 15:31:23,950 WARN [kvm.resource.KVMHAMonitor] (Thread-24:null) write heartbeat failed: timeout, retry: 4 2015-04-03 15:31:23,950 WARN [kvm.resource.KVMHAMonitor] (Thread-24:null) write heartbeat failed: timeout; reboot the host This closes #145 Signed-off-by: Rohit Yadav <rohit.yadav@shapeblue.com>	2015-04-04 14:17:37 +05:30
Remi Bergsma	2b41f98346	reboot much faster in case of storage failure When storage cannot be reached, it does not make sense to reboot as it will try to flush buffers, umount NFS mounts, etc. This will not work and thus cause a long delay. With this change, the box will reboot immediately (like pressing the reset button).	2015-04-01 19:45:16 +02:00
Kishan Kavala	4f3de024de	Add script to ensure cgroups are not co-mounted in rhel7/lxc. If required, script will unmount co-mounted cgroups and remount them seperately	2014-09-11 14:34:40 +05:30
tuna	c7dab82dc4	move cloudstack_pluginlib	2013-12-09 23:33:15 +07:00
tuna	3df8b912fc	add kvm support & LB service	2013-12-09 23:33:14 +07:00
Sheng Yang	83c13fcf27	CLOUDSTACK-2614: Fix the permission of patchviasocket.pl It's non-executable now, which cause trouble on deb package.	2013-05-29 14:24:49 -07:00
Marcus Sorensen	f66b9b570f	Send only \n rather than \r\n to agent socket when sending cmdline to system VMS BUG-ID: CLOUDSTACK-1732 Signed-off-by: Marcus Sorensen <marcus@betterservers.com> 1365622030 -0600	2013-04-10 13:27:10 -06:00
Marcus Sorensen	9ad54a082c	Summary: KVM - use virtio socket to communicate config to system vms Detail: This gets rid of the patchdisk method of passing cmdline and authorized_keys to KVM system VMs. It instead passes them to a virtio socket, which the KVM guest reads from the character device /dev/vport0p1 during cloud-early-config. Tested to work on CentOS 6.3 and Ubuntu 12.04. Should work with even older versions of libvirt. Signed-off-by: Marcus Sorensen <marcus@betterservers.com> 1362691685 -0700	2013-03-07 14:28:05 -07:00
Marcus Sorensen	ae2b673e03	Summary: fix cloud-agent KVM calls to cloudstack-agent for centos6.3 BUG-ID: CLOUDSTACK-1234 Bugfix-for: 4.1, master Reported-by: Rayees Namathponnan Signed-off-by: Marcus Sorensen <marcus@betterservers.com> 1360736064 -0700	2013-02-12 23:16:22 -07:00
Edison Su	189c20b665	umount loop device. If patch disk created from loop, then need to umount it in loop device, otherwise, you can only create 7 routers on one kvm host	2013-01-18 14:27:40 -08:00
Edison Su	f497c7c031	Bug: HA takes a lot of time to migrate VMs (trigger HA) to another KVM host if there are multiple storage pools in a cluster. The issue is as follows: 1. When CloudStack detects that a host is not responding to ping requests it'll send a fence command for this host to another host in the cluster. 2. The agent takes a long time to respond to this check if the storage is fenced. This is because the agent checks if the first host is writing to its heartbeat file on all pools in the cluster. It is doing this in a sequential manner on all storage pool. Making a fix to get rid of sleep, wait during HA. The behavior is now similar to Xenserver. RB: https://reviews.apache.org/r/6133/ Send-by:devdeep.singh@citrix.com	2012-07-25 10:17:09 -07:00
David Nalley	d630fa8697	license header changes for scripts folder from Chip Childers	2012-06-23 00:58:00 -04:00
frank	2f634c0913	Switch to Apache license	2012-04-03 04:50:05 -07:00
Sheng Yang	edcf95a6f0	Change KVM's delimiter from "," to "%" In order not to confuse with commandline parameters	2012-03-13 15:59:09 -07:00
frank	52610ffcb3	add copyright header to shell scripts	2012-01-11 18:41:53 -08:00
Edison Su	6841e265d3	bug 10330: finally merge Rommer's CLVM patch status 10330: resolved fixed	2011-11-02 18:03:58 -07:00
alena	8a7feb8ec1	Merge branch '2.2.y' Conflicts: agent/src/com/cloud/agent/resource/computing/LibvirtComputingResource.java api/src/com/cloud/agent/api/routing/LoadBalancerConfigCommand.java api/src/com/cloud/agent/api/to/FirewallRuleTO.java api/src/com/cloud/agent/api/to/IpAddressTO.java api/src/com/cloud/agent/api/to/PortForwardingRuleTO.java api/src/com/cloud/api/ApiConstants.java api/src/com/cloud/api/BaseCmd.java api/src/com/cloud/api/ResponseGenerator.java api/src/com/cloud/api/commands/CreateFirewallRuleCmd.java api/src/com/cloud/api/commands/CreateIpForwardingRuleCmd.java api/src/com/cloud/api/commands/CreateLoadBalancerRuleCmd.java api/src/com/cloud/api/commands/CreatePortForwardingRuleCmd.java api/src/com/cloud/api/commands/DeleteLoadBalancerRuleCmd.java api/src/com/cloud/api/commands/ListCapabilitiesCmd.java api/src/com/cloud/api/commands/UpdateNetworkCmd.java api/src/com/cloud/api/response/CapabilitiesResponse.java api/src/com/cloud/network/Network.java api/src/com/cloud/network/NetworkService.java api/src/com/cloud/network/firewall/FirewallService.java api/src/com/cloud/network/lb/LoadBalancingRule.java api/src/com/cloud/network/lb/LoadBalancingRulesService.java api/src/com/cloud/network/rules/FirewallRule.java api/src/com/cloud/network/rules/RulesService.java api/src/com/cloud/offering/NetworkOffering.java client/tomcatconf/commands.properties.in cloud.spec core/src/com/cloud/agent/resource/virtualnetwork/VirtualRoutingResource.java core/src/com/cloud/hypervisor/xen/resource/CitrixHelper.java core/src/com/cloud/hypervisor/xen/resource/CitrixResourceBase.java core/src/com/cloud/storage/template/DownloadManagerImpl.java core/src/com/cloud/vm/DomainRouterVO.java debian/cloud-deps.install patches/systemvm/debian/config/etc/init.d/cloud-early-config patches/systemvm/debian/config/root/ipassoc.sh patches/systemvm/debian/config/root/loadbalancer.sh scripts/vm/hypervisor/kvm/rundomrpre.sh scripts/vm/hypervisor/xenserver/vmops server/src/com/cloud/agent/manager/AgentAttache.java server/src/com/cloud/agent/manager/AgentManagerImpl.java server/src/com/cloud/agent/manager/AgentMonitor.java server/src/com/cloud/agent/manager/ClusteredAgentManagerImpl.java server/src/com/cloud/alert/ClusterAlertAdapter.java server/src/com/cloud/api/ApiResponseHelper.java server/src/com/cloud/api/ApiServer.java server/src/com/cloud/cluster/ClusterManagerImpl.java server/src/com/cloud/configuration/Config.java server/src/com/cloud/configuration/ConfigurationManager.java server/src/com/cloud/configuration/ConfigurationManagerImpl.java server/src/com/cloud/configuration/DefaultComponentLibrary.java server/src/com/cloud/deploy/FirstFitPlanner.java server/src/com/cloud/ha/HighAvailabilityManagerImpl.java server/src/com/cloud/host/dao/HostDaoImpl.java server/src/com/cloud/hypervisor/xen/discoverer/XcpServerDiscoverer.java server/src/com/cloud/network/LoadBalancerVO.java server/src/com/cloud/network/NetworkManager.java server/src/com/cloud/network/NetworkManagerImpl.java server/src/com/cloud/network/dao/FirewallRulesDao.java server/src/com/cloud/network/dao/FirewallRulesDaoImpl.java server/src/com/cloud/network/element/DhcpElement.java server/src/com/cloud/network/element/VirtualRouterElement.java server/src/com/cloud/network/firewall/FirewallManagerImpl.java server/src/com/cloud/network/lb/LoadBalancingRulesManagerImpl.java server/src/com/cloud/network/router/VirtualNetworkApplianceManager.java server/src/com/cloud/network/router/VirtualNetworkApplianceManagerImpl.java server/src/com/cloud/network/rules/FirewallManager.java server/src/com/cloud/network/rules/FirewallRuleVO.java server/src/com/cloud/network/rules/PortForwardingRuleVO.java server/src/com/cloud/network/rules/RulesManagerImpl.java server/src/com/cloud/network/rules/StaticNatRuleImpl.java server/src/com/cloud/network/security/SecurityGroupListener.java server/src/com/cloud/network/security/SecurityGroupManagerImpl.java server/src/com/cloud/offerings/NetworkOfferingVO.java server/src/com/cloud/server/ConfigurationServerImpl.java server/src/com/cloud/server/ManagementServerImpl.java server/src/com/cloud/storage/StorageManager.java server/src/com/cloud/storage/StorageManagerImpl.java server/src/com/cloud/storage/dao/VMTemplateHostDaoImpl.java server/src/com/cloud/storage/download/DownloadMonitorImpl.java server/src/com/cloud/upgrade/DatabaseUpgradeChecker.java server/src/com/cloud/upgrade/dao/Upgrade228to229.java server/src/com/cloud/upgrade/dao/Upgrade229to2210.java server/src/com/cloud/user/AccountManagerImpl.java server/src/com/cloud/vm/UserVmManagerImpl.java server/src/com/cloud/vm/VirtualMachineManagerImpl.java server/src/com/cloud/vm/dao/DomainRouterDao.java server/src/com/cloud/vm/dao/DomainRouterDaoImpl.java setup/db/create-index-fk.sql setup/db/create-schema.sql setup/db/db/schema-222to224.sql setup/db/db/schema-227to228.sql setup/db/db/schema-228to229.sql setup/db/db/schema-229to2210.sql tools/testClient/README ui/scripts/cloud.core.instance.js utils/src/com/cloud/utils/SerialVersionUID.java utils/src/com/cloud/utils/db/ConnectionConcierge.java utils/src/com/cloud/utils/db/Merovingian2.java utils/src/com/cloud/utils/db/Transaction.java utils/src/com/cloud/utils/nio/Link.java utils/src/com/cloud/utils/nio/NioConnection.java utils/src/com/cloud/utils/time/InaccurateClock.java	2011-08-22 20:28:30 -07:00
Edison Su	0bc6026d6b	fix description of addcluster api, and a typo in one script	2011-08-01 14:28:39 -07:00
Edison Su	31549b337e	bug 10622: use /root/.ssh/id_rsa.cloud instead of ~/.ssh/id_rsa.cloud status 10622: resolved fixed	2011-07-29 17:06:44 -07:00
Edison Su	a3da10e9b8	bug 10622: use /root/.ssh/id_rsa.cloud instead of ~/.ssh/id_rsa.cloud status 10622: resolved fixed	2011-07-13 17:49:43 -07:00
Edison Su	47380dc20e	fix add host	2011-05-12 15:03:15 -04:00
David Nalley	74fcb0d06e	bug:9702 - fixing ssh key path	2011-05-03 14:21:44 -04:00
Edison Su	d8ee7d9fc3	if storage network disconnected, reboot the host	2011-04-14 17:46:54 -04:00
Frank	92155522f2	Add license header to files	2011-04-14 11:23:14 -07:00
edison	5c73e46113	bug 7296: add kvm.public.network.device and kvm.private.network.device from UI status 7296: resolved fixed	2010-12-04 20:01:10 -08:00
edison	007783f6cf	add more logs when taking heartbeat, and make ha enabled even in oss	2010-11-10 09:49:03 -08:00
edison	4bc63e5c32	Enable KVM HA on nfs storage	2010-11-09 22:03:22 -08:00
edison	139ff273ff	fix systemvm template for kvm, using chiradeep's latest debian template	2010-10-05 20:41:39 -07:00
edison	d933f19d35	bug 5800: add cluster for KVM TODO: need to make sure the host cpu is from the same vender in a cluster	2010-09-10 16:00:36 -07:00
edison	867b49edb6	Don't install console proxy agent on agent	2010-08-25 15:26:33 -07:00
edison	53df26daaa	"Add host" for kvm: The sequence: 1. add host in UI 2. scp setup_agent.sh to agent host, and execute it. This script receives hostip,zoneid, podid and guid, then runs "cloud-setup-agent" and "cloud-setup-console-proxy". Here, we assume that network/hostname and cloud-agent are already configed and installed. 3. Write a dummy kvm resource into the database, then wait for agent connects to server, by polling the database for every 1 minutes. If it finds the agent is in UP state in database, then return, or wait for at least 10 minutes.	2010-08-13 23:19:36 -07:00
edison	dc005b9b9c	add setup_agent.sh, the first script running during add host Make cloud-setup-agent/console-proxy in unattended mode	2010-08-11 19:31:07 -07:00
Manuel Amador (Rudd-O)	05c020e1f6	Source code committed	2010-08-11 09:13:29 -07:00

41 Commits