Idhal virtual cluster prototype
From WikiOAR
Contents |
Mechanism
Secured update procedure
At the very first boot, a virtual node gets the server GPG public key and never get it again. Then, periodically (at boot and from cron), the virtual node makes an http query to get an update script from the server. The server signs it and the node checks the signature of the script before executing it locally:
The update script
The script looks like:
UPDATES="/var/lib/idhal-updates" # Update #000 if [ ! -f $UPDATES/000 ] then # # do update 000 # touch $UPDATES/000 fi # Update #001 if [ ! -f $UPDATES/001 ] then # # do update 001 # touch $UPDATES/001 fi # and so on... -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.6 (GNU/Linux) iD8DBQFL6V1yWciA6gGguMwRAqU+AJ93IOh+lxAsuYFUUVlrsM6HGorUJACff4ko s/bE76z4S4MzZZ1EkN+UyGk= =wfbT -----END PGP SIGNATURE-----
The key is that an update (a scriptlet) is executed only once.
The first update sets up the tunnel
Update
if [ ! -f $UPDATES/000 ] then wget -q -O /tmp/gen_certif http://129.88.70.253:8080/cgi-bin/gen_certif?idhaltype=vnode || exit 1 chmod 700 /tmp/gen_certif /tmp/gen_certif || exit 1 rm -f /tmp/gen_certif /etc/init.d/openvpn stop /etc/init.d/openvpn start sleep 10 touch $UPDATES/000 fi
gen_certif cgi-bin script
This script generates a unique ID and a certificate for the virtual node. Then, it outputs a bash script that makes an Openvpn configuration and copies the certificate.
#!/usr/bin/ruby
require 'fileutils'
require "cgi"
cgi = CGI.new
idhaltype = cgi['idhaltype']
EASY_RSA_DIR="/etc/openvpn/easy-rsa"
def get_next_id
id=0
keys_dir=Dir.new(EASY_RSA_DIR + "/keys")
keys_dir.each do |file|
if client_id=file.scan(/client(\d+)\.crt/)[0]
if client_id[0].to_i > id
id=client_id[0].to_i
end
end
end
return id+1
end
Dir::chdir(EASY_RSA_DIR)
id=get_next_id
system(". ./vars > /dev/null && ./pkitool client#{id} > /dev/null 2>&1")
f = File.new("#{EASY_RSA_DIR}/keys/client#{id}.type", "w")
f.puts idhaltype
f.close
puts "Content-type: text/plain"
puts
puts "#!/bin/bash"
puts "if [ -d /etc/openvpn ] ; then"
puts "cat > /etc/openvpn/vpns.conf << EOF
client
dev tun
proto tcp
remote 129.88.70.253 4242
ca ca.crt
cert client.crt
key client.key
keepalive 10 60
cipher none
up /etc/openvpn/update-resolv-conf
down /etc/openvpn/update-resolv-conf
EOF"
puts "cat > /etc/openvpn/client.crt << EOF"
IO.foreach("#{EASY_RSA_DIR}/keys/client#{id}.crt") { |line| puts line }
puts "EOF"
puts "cat > /etc/openvpn/client.key << EOF"
IO.foreach("#{EASY_RSA_DIR}/keys/client#{id}.key") { |line| puts line }
puts "EOF"
puts "chmod 600 /etc/openvpn/client.key"
puts "fi"
The VPN server side configuration
Openvpn
port 4242 proto tcp dev tun0 ca ca.crt cert server.crt key server.key dh dh1024.pem server 10.134.0.0 255.255.0.0 #ifconfig-pool-persist ipp.txt keepalive 10 120 max-clients 1000 persist-key persist-tun status /var/log/openvpn-status.log # Script where we create/activate OAR nodes client-connect /etc/openvpn/client-up.bash client-disconnect /etc/openvpn/client-down.bash # G5K Routes pushing push "route 129.88.70.0 255.255.255.192" [...] # DNS pushing push "dhcp-option DNS 129.88.70.61" push "dhcp-option DNS grenoble.grid5000.fr"
DNS
Every virtual ip address is statically declared into DNS like this example:
vnode-7-38 A 10.134.7.38
OAR server
Resources are added automatically when new tunnels come up. Resources are disabled (but not removed) when tunel goes down. This is made inside the client-up/down openvpn scripts:
- /etc/openvpn/client-up.bash:
#!/bin/bash
set -e
NODE_NAME=`host -t A $ifconfig_pool_remote_ip |awk -F": " '{if ($1=="Name") print $2}'`
CLIENT_NAME=`host -t A $trusted_ip |awk -F": " '{if ($1=="Name") print $2}'`
NODE=`oarnodes --sql "ip='$ifconfig_pool_remote_ip'"`
if [ "$NODE" = "" ]
then
oarnodesetting -a -h "$NODE_NAME" -p "ip=$ifconfig_pool_remote_ip"
fi
HIDHAL_TYPE=`cat /etc/openvpn/easy-rsa/keys/$common_name.type 2>/dev/null || true`
if [ "$HIDHAL_TYPE" != "" ]
then
oarnodesetting -p "idhaltype=$HIDHAL_TYPE" \
-p "cluster=$HIDHAL_TYPE" -h "$NODE_NAME"
fi
oarnodesetting -p "idhalcn=$common_name" \
-p "idhalclientip=$trusted_ip" \
-p "idhalconnectedsince=`date`" \
-p "idhalclientname=$CLIENT_NAME" \
-h $NODE_NAME
echo "/usr/local/sbin/send_root_key.sh $NODE_NAME" | at now + 1 minute
oarnodesetting -s Alive -h "$NODE_NAME"
- /etc/openvpn/client-down.bash
#!/bin/bash
set -e
NODE_NAME=`host -t A $ifconfig_pool_remote_ip |awk -F": " '{if ($1=="Name") print $2}'` >> /tmp/down
oarnodesetting -s Absent -h "$NODE_NAME" >> /tmp/down
echo "$NODE_NAME" >> /tmp/down
Here is a sample resource:
168
network_address : vnode-2-98.grenoble.grid5000.fr
properties : besteffort=YES,cluster=vnode,cpuset=0,deploy=NO,desktop_computing=NO,idhalclientip=152.77.57.112,idhalclientname=browalle.ujf-grenoble.fr,idhalcn=client317,idhalconnectedsince=Thu Jul 3 20:32:35 CEST 2008,idhaltype=vnode,ip=10.134.2.98,network_address=vnode-2-98.grenoble.grid5000.fr,type=default
state : Absent
Caveats
The main problem with this solution is that every communication between the nodes is made through the VPN. Actually, we want that the nodes that are on a same LAN or routed network, use their local interface to communicate directly. The solution might be on DNS side...

