Monday, July 9, 2007

Entry 7

e3@e3-desktop:~$
e3@e3-desktop:~$ ure_condor\\\
>
bash: ure_condor\: command not found
e3@e3-desktop:~$ ssh-l condor gf1.ucs.indiana.edu
bash: ssh-l: command not found
e3@e3-desktop:~$ ssh -l condor gf1.ucs.indiana.edu
The authenticity of host 'gf1.ucs.indiana.edu (156.56.104.81)' can't be established.
RSA key fingerprint is ee:a3:09:37:27:88:c5:df:ea:e8:c3:ae:0f:7c:08:6d.
Are you sure you want to continue connecting (yes/no)? yes
Warning: Permanently added 'gf1.ucs.indiana.edu,156.56.104.81' (RSA) to the list of known hosts.
Connection closed by 156.56.104.81
e3@e3-desktop:~$ ure_condor
bash: ure_condor: command not found
e3@e3-desktop:~$ ssh -l condor gf1.ucs.indiana.edu
condor@gf1.ucs.indiana.edu's password:
[condor@gridfarm001 condor]$
[condor@gridfarm001 condor]$
[condor@gridfarm001 condor]$ pico condor_ illianathomas.submit
[condor@gridfarm001 condor]$
[condor@gridfarm001 condor]$ cp condor_test.sh condor_illianathomas.sh
cp: cannot stat `condor_test.sh': No such file or directory
[condor@gridfarm001 condor]$ pwd
/home/condor
[condor@gridfarm001 condor]$ ls
bleck condor_config cp.worked
condor condor_config.old cp.worked.also
condor-6.7.19 condor-local dead.letter
condor-6.8.3 condor-test junkerrelli.err
condor-6.8.3-linux-x86-rhel3-dynamic.tar.gz condor-test.save mbox
condor_amandabland.submitn cp.sub test
[condor@gridfarm001 condor]$ pico condor_illianathomas.sh
[condor@gridfarm001 condor]$ chmod +x condor_illianathomas.sh
[condor@gridfarm001 condor]$ ./condor_illianathomas.sh
total 111604
-rw-r--r-- 1 condor condor 800 May 24 2006 bleck
drwxr-xr-x 10 condor condor 4096 Jan 11 16:36 condor
drwxr-xr-x 3 condor condor 4096 May 10 2006 condor-6.7.19
drwxr-xr-x 3 condor condor 4096 Jan 4 2007 condor-6.8.3
-rw-r--r-- 1 condor condor 114063659 Jan 10 15:02 condor-6.8.3-linux-x86-rhel3-dynamic.tar.gz
-rw-rw-r-- 1 condor condor 32 Jul 9 14:56 condor_amandabland.submit
-rw-rw-r-- 1 condor condor 1 Jul 9 14:53 condor_amandabland.submitn
lrwxrwxrwx 1 condor condor 37 Jan 11 16:40 condor_config -> /home/condor/condor/etc/condor_config
lrwxrwxrwx 1 condor condor 35 May 23 2006 condor_config.old -> /usr/local/condor/etc/condor_config
-rwxrwxr-x 1 condor condor 34 Jul 9 14:56 condor_illianathomas.sh
-rwxrwxr-x 1 condor condor 63 Jul 9 14:57 condor_jeaime.submit
drwxr-xr-x 5 condor condor 4096 Jan 11 16:39 condor-local
drwxrwxr-x 2 condor condor 4096 Jul 9 14:45 condor-test
-rw------- 1 condor condor 47 Jul 9 14:51 condor-test.save
-rw------- 1 condor condor 3305 May 23 2006 cp.sub
-rw------- 1 condor condor 3305 May 23 2006 cp.worked
-rw------- 1 condor condor 3305 May 23 2006 cp.worked.also
-rw------- 1 condor condor 7345 May 31 2006 dead.letter
-rw-rw-r-- 1 condor condor 0 May 24 2006 junkerrelli.err
-rw------- 1 condor condor 32644 May 24 2006 mbox
-rw-rw-r-- 1 condor condor 0 May 24 2006 test
gridfarm001.ucs.indiana.edu
[condor@gridfarm001 condor]$ pico condor_illianathomas.classad
[condor@gridfarm001 condor]$
[condor@gridfarm001 condor]$
[condor@gridfarm001 condor]$ more condor_illianathomas.classad
Universe = vanilla
Executable = condor_illianathomas.sh
Log = illianathomas.log
Output = illianathomas.out
Error = illianathomas.error
Queue

[condor@gridfarm001 condor]$ condor_submit condor_illianathomas.classad
Submitting job(s)
ERROR: Failed to connect to local queue manager
AUTHENTICATE:1003:Failed to authenticate with any method
AUTHENTICATE:1004:Failed to authenticate using GSI
GSI:5003:Failed to authenticate. Globus is reporting error (851968:41). There is probably a problem with your credentials. (Did you run grid-proxy-init?)
AUTHENTICATE:1004:Failed to authenticate using KERBEROS
AUTHENTICATE:1004:Failed to authenticate using FS
[condor@gridfarm001 condor]$ echo $CONDOR_CONFIG
/home/condor/condor/etc/condor_config
[condor@gridfarm001 condor]$ condor queue
ERROR: unknown command condor_queue
Usage: condor [command] [general-options] [targets]
where [general-options] can be zero or more of:
-help gives this usage information
-version prints the version
-pool hostname use the given central manager to find daemons
where [targets] can be zero or more of:
-all all hosts in your pool (overrides other targets)
hostname given host
given "sinful string"
(for compatibility with other Condor tools, you can also use:)
-name hostname given host
-addr given "sinful string"
(if no targets are specified, the local host is used)

Valid commands are:
off, on, restart, reconfig, reschedule, vacate, checkpoint

Use "condor [command] -help" for more information on a given command.

[condor@gridfarm001 condor]$ condor_queue
-bash: condor_queue: command not found
[condor@gridfarm001 condor]$ condor_q


-- Submitter: gridfarm001.ucs.indiana.edu : <156.56.104.81:2006> : gridfarm001.ucs.indiana.edu
ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
19.0 nobody 5/23 18:35 0+00:00:02 X 0 9.8 cp cp.sub cp.worke
20.0 condor 5/23 18:35 0+00:00:01 X 0 9.8 cp cp.sub cp.worke
21.0 condor 5/23 18:37 0+00:00:01 X 0 9.8 cp cp.sub cp.worke
22.0 condor 5/23 18:41 0+00:00:02 X 0 9.8 cp cp.sub cp.worke
23.0 condor 5/23 18:49 0+00:00:01 X 0 9.8 cp /tmp/cp.sub /tm
24.0 condor 5/23 18:50 0+00:00:02 X 0 9.8 cp /home/condor/cp
25.0 condor 5/23 21:44 0+00:00:01 C 0 9.8 ls
27.0 condor 5/23 22:11 0+00:00:01 C 0 9.8 ls
29.0 condor 5/23 22:39 0+00:00:01 C 0 9.8 ls
30.0 condor 5/23 22:51 0+00:00:01 C 0 9.8 ls
31.0 condor 5/23 23:00 0+00:00:01 C 0 9.8 cp /home/condor/cp
32.0 condor 5/23 23:20 0+00:00:02 C 0 9.8 ls /home/condor/cp
33.0 condor 5/24 09:25 0+00:00:01 C 0 9.8 ls
36.0 condor 5/24 09:46 0+00:00:01 C 0 9.8 ls
37.0 condor 5/24 10:23 0+00:00:01 C 0 9.8 ls
39.0 condor 5/24 10:32 0+00:00:01 C 0 9.8 ls /tmp
40.0 condor 5/24 10:41 0+00:00:01 C 0 9.8 ls /tmp
41.0 condor 5/24 10:57 0+00:00:01 C 0 9.8 ls /tmp
42.0 condor 5/24 11:22 0+00:00:01 C 0 9.8 ls /tmp
43.0 gateway 5/24 11:37 0+00:00:01 C 0 9.8 ls /tmp
44.0 lkejrlekj 5/24 11:44 0+00:00:02 C 0 9.8 ls /tmp
45.0 lkejrlekj 5/24 14:48 0+00:00:01 C 0 9.8 ls /tmp
46.0 lkejrlekj 5/24 14:52 0+00:00:01 C 0 9.8 ls /tmp
47.0 lkejrlekj 5/24 15:09 0+00:00:01 C 0 9.8 ls /tmp
48.0 root 5/24 20:45 0+00:00:00 X 0 9.8 ls
49.0 root 5/24 20:47 0+00:00:00 X 0 9.8 ls
50.0 root 5/24 20:48 0+00:00:00 X 0 9.8 ls /tmp/junk1 /tmp
51.0 root 5/24 20:50 0+00:00:00 X 0 9.8 cp /tmp/junk1 /tmp
54.0 root 5/25 09:43 0+00:00:00 X 0 9.8 ls
55.0 root 5/25 09:46 0+00:00:00 X 0 9.8 ls
56.0 root 5/25 09:59 0+00:00:01 X 0 9.8 ls -tlr
57.0 root 5/25 09:59 0+00:00:00 X 0 9.8 ls -tlr
58.0 root 5/25 10:03 0+00:00:00 X 0 9.8 ls
59.0 root 5/25 10:06 0+00:00:00 X 0 9.8 ls
60.0 root 5/25 12:43 0+00:00:00 X 0 9.8 ls
61.0 root 5/25 12:49 0+00:00:00 X 0 9.8 ls
62.0 root 5/25 12:49 0+00:00:01 C 0 9.8 ls
63.0 root 5/25 12:53 0+00:00:01 C 0 9.8 ls
64.0 root 5/31 17:27 0+00:00:01 X 0 9.8 ls -ltr
65.0 root 5/31 17:31 0+00:00:00 X 0 9.8 ls -ltr
66.0 root 5/31 17:43 0+00:12:50 X 0 9.8 ls -ltr
67.0 root 6/9 09:52 0+00:00:00 X 0 9.8 ls
69.0 root 6/9 15:49 0+01:13:16 I 0 9.8 ls
70.0 root 6/9 15:57 0+00:00:00 X 0 9.8 ls
71.0 root 6/9 15:57 0+00:00:00 X 0 9.8 ls
75.0 root 6/9 17:52 0+00:00:01 X 0 9.8 ls
76.0 root 6/9 17:56 0+00:00:01 X 0 9.8 ls
77.0 root 6/9 18:05 0+00:00:01 C 0 9.8 ls
78.0 root 6/21 16:08 0+21:50:35 X 0 9.8 ls
79.0 condor 7/9 15:39 0+00:00:00 I 0 9.8 condor_illianathom
80.0 condor 7/9 15:44 0+00:00:00 I 0 9.8 condor_jeaime.subm
81.0 condor 7/9 15:45 0+00:00:00 I 0 9.8 condor_andrea.subm

4 jobs; 4 idle, 0 running, 0 held
[condor@gridfarm001 condor]$ condor_submit condor_ illianathomas.classad

ERROR: Failed to open command file (No such file or directory)
You have new mail in /var/spool/mail/condor
[condor@gridfarm001 condor]$ condorq
-bash: condorq: command not found
[condor@gridfarm001 condor]$ condor_q


-- Submitter: gridfarm001.ucs.indiana.edu : <156.56.104.81:2006> : gridfarm001.ucs.indiana.edu
ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
83.0 condor 7/9 15:56 0+00:00:00 I 0 9.8 condor_andrea.subm

1 jobs; 1 idle, 0 running, 0 held
[condor@gridfarm001 condor]$ ls *illianathomas*
condor_illianathomas.classad condor_illianathomas.sh illianathomas.error illianathomas.log illianathomas.out
[condor@gridfarm001 condor]$ more illianathomas.error
[condor@gridfarm001 condor]$ more illianathomas.out
total 111660
-rw-rw-r-- 1 condor condor 0 Jul 9 15:45 andrea.error
-rw-rw-r-- 1 condor condor 181 Jul 9 15:54 andrea.log
-rw-rw-r-- 1 condor condor 0 Jul 9 15:45 andrea.out
-rw-r--r-- 1 condor condor 800 May 24 2006 bleck
drwxr-xr-x 10 condor condor 4096 Jan 11 16:36 condor
drwxr-xr-x 3 condor condor 4096 May 10 2006 condor-6.7.19
drwxr-xr-x 3 condor condor 4096 Jan 4 2007 condor-6.8.3
-rw-r--r-- 1 condor condor 114063659 Jan 10 15:02 condor-6.8.3-linux-x86-rhel3-dynamic.tar.gz
drwxr-xr-x 5 condor condor 4096 Jan 11 16:39 condor-local
drwxrwxr-x 2 condor condor 4096 Jul 9 14:45 condor-test
-rw------- 1 condor condor 47 Jul 9 14:51 condor-test.save
-rw-rw-r-- 1 condor condor 33 Jul 9 14:59 condor-tyrone.submit
-rw-rw-r-- 1 condor condor 133 Jul 9 15:06 condor_amandabland.classad
-rwxrwxr-x 1 condor condor 33 Jul 9 15:01 condor_amandabland.sh
-rw-rw-r-- 1 condor condor 1 Jul 9 14:53 condor_amandabland.submitn
-rw-rw-r-- 1 condor condor 119 Jul 9 15:14 condor_andrea.classad
-rwxrwxr-x 1 condor condor 16 Jul 9 14:58 condor_andrea.submmit
-rw-rw-r-- 1 condor condor 135 Jul 9 15:07 condor_camden.classad
-rwxrwxr-x 1 condor condor 33 Jul 9 14:58 condor_camden.submit
lrwxrwxrwx 1 condor condor 37 Jan 11 16:40 condor_config -> /home/condor/condor/etc/condor_config
lrwxrwxrwx 1 condor condor 35 May 23 2006 condor_config.old -> /usr/local/condor/etc/condor_config
-rw-rw-r-- 1 condor condor 142 Jul 9 15:06 condor_illianathomas.classad
-rwxrwxr-x 1 condor condor 34 Jul 9 14:56 condor_illianathomas.sh
-rw-rw-r-- 1 condor condor 156 Jul 9 15:10 condor_jeaime.classadd
-rwxrwxr-x 1 condor condor 50 Jul 9 15:09 condor_jeaime.submit
-rw-rw-r-- 1 condor condor 0 Jul 9 15:44 condor_jeaime_error.log
-rw-rw-r-- 1 condor condor 0 Jul 9 15:44 condor_jeiame_out.out
-rw-rw-r-- 1 condor condor 125 Jul 9 15:09 condor_tyrone.classad
-rwxrwxr-x 1 condor condor 33 Jul 9 15:01 condor_tyrone.submit
-rw------- 1 condor condor 3305 May 23 2006 cp.sub
-rw------- 1 condor condor 3305 May 23 2006 cp.worked
-rw------- 1 condor condor 3305 May 23 2006 cp.worked.also
-rw------- 1 condor condor 7345 May 31 2006 dead.letter
-rw-rw-r-- 1 condor condor 0 Jul 9 15:56 illianathomas.error
-rw-rw-r-- 1 condor condor 346 Jul 9 15:56 illianathomas.log
-rw-rw-r-- 1 condor condor 0 Jul 9 15:56 illianathomas.out
-rw-rw-r-- 1 condor condor 181 Jul 9 15:54 jeaime_log.log
-rw-rw-r-- 1 condor condor 0 May 24 2006 junkerrelli.err
-rw------- 1 condor condor 32644 May 24 2006 mbox
-rw-rw-r-- 1 condor condor 1794 Jul 9 15:00 mylist.txt
-rw-rw-r-- 1 condor condor 0 May 24 2006 test
gridfarm001.ucs.indiana.edu
[condor@gridfarm001 condor]$

Entry 6

Condor is running on a cluster of machines. Not just ClassAd, but instead many jobs

Basic principles: there are a bunch of command lines ( i.e.- condor_q, condor_submit)

you can get and idea on how it is running

create a job property lines (i.e.- classad)

to take this further: there is an incremental job id with a .0 at the end. the first number is the cluster number. to do things in a sequence you can express that in your job file with multiple queue statements with different .1, .2, .3

why do this? there are parameter sweep studies (problems in computational science) if you want to know all the values no an x axis you can use condor to figure this out.

END OF CONDOR FOR TODAY

next we will upload Linux and Fedora (other 3 cpus) (Open Source Red Hat)

Entry 5

the way condors numbering scheme is set up we have a number system like 83.0....83 is the job NUMBER and .1 or .2 would be a job that falls under it.

Entry 4

ls means LIST---duh illiana lol




OUR JOBS FINALLY WORKED!!!!

4 jobs; 4 idle, 0 running, 0 held
[condor@gridfarm001 condor]$ condor_submit condor_ illianathomas.classad

ERROR: Failed to open command file (No such file or directory)
You have new mail in /var/spool/mail/condor
[condor@gridfarm001 condor]$ condorq
-bash: condorq: command not found
[condor@gridfarm001 condor]$ condor_q


-- Submitter: gridfarm001.ucs.indiana.edu : <156.56.104.81:2006> : gridfarm001.ucs.indiana.edu
ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
83.0 condor 7/9 15:56 0+00:00:00 I 0 9.8 condor_andrea.subm

1 jobs; 1 idle, 0 running, 0 held
[condor@gridfarm001 condor]$ ls *illianathomas*
condor_illianathomas.classad condor_illianathomas.sh illianathomas.error illianathomas.log illianathomas.out
[condor@gridfarm001 condor]$ more illianathomas.error
[condor@gridfarm001 condor]$ more illianathomas.out
total 111660
-rw-rw-r-- 1 condor condor 0 Jul 9 15:45 andrea.error
-rw-rw-r-- 1 condor condor 181 Jul 9 15:54 andrea.log
-rw-rw-r-- 1 condor condor 0 Jul 9 15:45 andrea.out
-rw-r--r-- 1 condor condor 800 May 24 2006 bleck
drwxr-xr-x 10 condor condor 4096 Jan 11 16:36 condor
drwxr-xr-x 3 condor condor 4096 May 10 2006 condor-6.7.19
drwxr-xr-x 3 condor condor 4096 Jan 4 2007 condor-6.8.3
-rw-r--r-- 1 condor condor 114063659 Jan 10 15:02 condor-6.8.3-linux-x86-rhel3-dynamic.tar.gz
drwxr-xr-x 5 condor condor 4096 Jan 11 16:39 condor-local
drwxrwxr-x 2 condor condor 4096 Jul 9 14:45 condor-test
-rw------- 1 condor condor 47 Jul 9 14:51 condor-test.save
-rw-rw-r-- 1 condor condor 33 Jul 9 14:59 condor-tyrone.submit
-rw-rw-r-- 1 condor condor 133 Jul 9 15:06 condor_amandabland.classad
-rwxrwxr-x 1 condor condor 33 Jul 9 15:01 condor_amandabland.sh
-rw-rw-r-- 1 condor condor 1 Jul 9 14:53 condor_amandabland.submitn
-rw-rw-r-- 1 condor condor 119 Jul 9 15:14 condor_andrea.classad
-rwxrwxr-x 1 condor condor 16 Jul 9 14:58 condor_andrea.submmit
-rw-rw-r-- 1 condor condor 135 Jul 9 15:07 condor_camden.classad
-rwxrwxr-x 1 condor condor 33 Jul 9 14:58 condor_camden.submit
lrwxrwxrwx 1 condor condor 37 Jan 11 16:40 condor_config -> /home/condor/condor/etc/condor_config
lrwxrwxrwx 1 condor condor 35 May 23 2006 condor_config.old -> /usr/local/condor/etc/condor_config
-rw-rw-r-- 1 condor condor 142 Jul 9 15:06 condor_illianathomas.classad
-rwxrwxr-x 1 condor condor 34 Jul 9 14:56 condor_illianathomas.sh
-rw-rw-r-- 1 condor condor 156 Jul 9 15:10 condor_jeaime.classadd
-rwxrwxr-x 1 condor condor 50 Jul 9 15:09 condor_jeaime.submit
-rw-rw-r-- 1 condor condor 0 Jul 9 15:44 condor_jeaime_error.log
-rw-rw-r-- 1 condor condor 0 Jul 9 15:44 condor_jeiame_out.out
-rw-rw-r-- 1 condor condor 125 Jul 9 15:09 condor_tyrone.classad
-rwxrwxr-x 1 condor condor 33 Jul 9 15:01 condor_tyrone.submit
-rw------- 1 condor condor 3305 May 23 2006 cp.sub
-rw------- 1 condor condor 3305 May 23 2006 cp.worked
-rw------- 1 condor condor 3305 May 23 2006 cp.worked.also
-rw------- 1 condor condor 7345 May 31 2006 dead.letter
-rw-rw-r-- 1 condor condor 0 Jul 9 15:56 illianathomas.error
-rw-rw-r-- 1 condor condor 346 Jul 9 15:56 illianathomas.log
-rw-rw-r-- 1 condor condor 0 Jul 9 15:56 illianathomas.out
-rw-rw-r-- 1 condor condor 181 Jul 9 15:54 jeaime_log.log
-rw-rw-r-- 1 condor condor 0 May 24 2006 junkerrelli.err
-rw------- 1 condor condor 32644 May 24 2006 mbox
-rw-rw-r-- 1 condor condor 1794 Jul 9 15:00 mylist.txt
-rw-rw-r-- 1 condor condor 0 May 24 2006 test
gridfarm001.ucs.indiana.edu
[condor@gridfarm001 condor]$

Entry 3

Marlon Pierce's ---- http://communitygrids.blogspot.com/feeds/posts/default




the first time it failed because we had two incompatible versions of condor running.

[condor@gridfarm001 condor]$ pico condor_illianathomas.classad
[condor@gridfarm001 condor]$
[condor@gridfarm001 condor]$
[condor@gridfarm001 condor]$ more condor_illianathomas.classad
Universe = vanilla
Executable = condor_illianathomas.sh
Log = illianathomas.log
Output = illianathomas.out
Error = illianathomas.error
Queue

[condor@gridfarm001 condor]$ condor_submit condor_illianathomas.classad
Submitting job(s)
ERROR: Failed to connect to local queue manager
AUTHENTICATE:1003:Failed to authenticate with any method
AUTHENTICATE:1004:Failed to authenticate using GSI
GSI:5003:Failed to authenticate. Globus is reporting error (851968:41). There is probably a problem with your credentials. (Did you run grid-proxy-init?)
AUTHENTICATE:1004:Failed to authenticate using KERBEROS
AUTHENTICATE:1004:Failed to authenticate using FS
[condor@gridfarm001 condor]$ echo $CONDOR_CONFIG
/home/condor/condor/etc/condor_config
[condor@gridfarm001 condor]$

Entry 2

3.2 Submitting a Job {http://www.dma.unina.it/~murli/ISSGC06/condor/public_html/submit_first.html}
-Creating a Classad

Now that you have a job, you just have to tell Condor to run it. Put the following text into a file called submit:

[condor@gridfarm001 condor]$ pico condor_illianathomas.classad
[condor@gridfarm001 condor]$ more condor_illianathomas.classad
Universe = vanilla
Executable = condor_illianathomas.sh
Log = illianathomas.log
Output = illianathomas.out
Error = illianathomas.error
Queue


Entry 1

This is the first blog entry

The ssh connected us to the condor at Indiana University. All of our commands are not happening on our computers here at ECSU but instead in a computer in Indiana. Why are we doing this? Because Unbuntu was being a pain and did not want to agree with the condor over here at ECSU. This is why we downloaded another version of Linux. There are many different flavors of Linux that do several different jobs.

Terminal Entry:

e3@e3-desktop:~$
e3@e3-desktop:~$ ure_condor\\\
>
bash: ure_condor\: command not found
e3@e3-desktop:~$ ssh-l condor gf1.ucs.indiana.edu
bash: ssh-l: command not found
e3@e3-desktop:~$ ssh -l condor gf1.ucs.indiana.edu
The authenticity of host 'gf1.ucs.indiana.edu (156.56.104.81)' can't be established.
RSA key fingerprint is ee:a3:09:37:27:88:c5:df:ea:e8:c3:ae:0f:7c:08:6d.
Are you sure you want to continue connecting (yes/no)? yes
Warning: Permanently added 'gf1.ucs.indiana.edu,156.56.104.81' (RSA) to the list of known hosts.
Connection closed by 156.56.104.81
e3@e3-desktop:~$ ure_condor
bash: ure_condor: command not found
e3@e3-desktop:~$ ssh -l condor gf1.ucs.indiana.edu
condor@gf1.ucs.indiana.edu's password:
[condor@gridfarm001 condor]$
[condor@gridfarm001 condor]$