<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Pile of Dross &#187; Work</title>
	<atom:link href="http://www.drossy.net/blog/category/work/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.drossy.net/blog</link>
	<description>Semi-literate ramblings of Derek Ross</description>
	<lastBuildDate>Tue, 16 Dec 2008 22:21:42 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>21/02/06 &#8211; 07/03/06</title>
		<link>http://www.drossy.net/blog/2006/03/07/210206-070306/</link>
		<comments>http://www.drossy.net/blog/2006/03/07/210206-070306/#comments</comments>
		<pubDate>Tue, 07 Mar 2006 23:09:54 +0000</pubDate>
		<dc:creator>Derek</dc:creator>
				<category><![CDATA[Dcache]]></category>
		<category><![CDATA[Work]]></category>

		<guid isPermaLink="false">http://www.drossy.net/blog/archives/183</guid>
		<description><![CDATA[7th March T1 &#8211; multiple T2 transfer, GS not around &#8211; only IC and QMUL doing transfers &#8211; initiated transfers to OX, GLA, BHAM and MAN myself at 10am, stopped at noon ESC-Services Network Operations meeting 6th March Monday Morning Ops Meeting SC phone conference 3rd March RC report Tried to get replication status information [...]]]></description>
			<content:encoded><![CDATA[<p>7th March</p>
<p>T1 &#8211; multiple T2 transfer, GS not around &#8211; only IC and QMUL doing transfers &#8211; initiated transfers to OX, GLA, BHAM and MAN myself at 10am, stopped at noon<br />
ESC-Services Network Operations meeting</p>
<p>6th March</p>
<p>Monday Morning Ops Meeting<br />
SC phone conference</p>
<p>3rd March</p>
<p>RC report<br />
Tried to get replication status information out of slony </p>
<p>2nd March</p>
<p>T1 to multiple T2 transfer tests  &#8211; attempting to diagnose network fault previously seen</p>
<p>1st March</p>
<p>GridPP-Storage phone conference<br />
SC phone conference</p>
<p>28th February</p>
<p>T1 to multiple T2 Transfer tests, started at 10am asked to stop by Site Networking at 2:50pm due to affecting other site traffic<br />
Asked DK about APEL publishing- RAS had noticed we&#8217;d stopped publishing<br />
Attended meeting about Lustre<br />
Did some lingering rpm updates from run by ST</p>
<p>27th February</p>
<p>Monday Morning Operations Meeting<br />
Noted RAL-LCG2 failing SFT due to CA certificate update, asked LCG-ROLLOUT when new release via LCG expected &#8211; no reply<br />
Mailed GS about files to be use in transfer test- replicated the intended file over more disk servers. Debugged some transfers failing due to permission problems &#8211; permision changer script hadn&#8217;t started back aup again after restart<br />
Mailed person who had sent e-mail about OPN to CERN outage asking about current status &#8211; still down</p>
<p>24th February</p>
<p>Applied various outstanding updates to systems<br />
RC Report with ST</p>
<p>23rd February</p>
<p>Found 2 CMS transfers, to same pool, through same gridftp server at FNAL at same time -1 failed &#8211; 1 succeeded, reported these to TB, who replied telling me about problems with the stager at FNAL.</p>
<p>22nd February</p>
<p>GridPP-Storage phone conference</p>
<p>21st February </p>
<p>Debugging CMS transfers</p>
]]></content:encoded>
			<wfw:commentRss>http://www.drossy.net/blog/2006/03/07/210206-070306/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>20/02/2006/</title>
		<link>http://www.drossy.net/blog/2006/02/20/20022006/</link>
		<comments>http://www.drossy.net/blog/2006/02/20/20022006/#comments</comments>
		<pubDate>Mon, 20 Feb 2006 21:46:51 +0000</pubDate>
		<dc:creator>Derek</dc:creator>
				<category><![CDATA[Dcache]]></category>
		<category><![CDATA[RT]]></category>
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.drossy.net/blog/archives/182</guid>
		<description><![CDATA[Shutdown startup: vacuumed postgres db&#8217;s on 350 and pnfs &#8211; lots of disk space reclaimed, upgraded pnfs to 8.1.3. Installed slony on pnfs and setup replication to 438 &#8211; still needs logrotation and startup scripts to be done, would have taken less time than it did but vi decide to be too clever by half [...]]]></description>
			<content:encoded><![CDATA[<p>Shutdown startup: vacuumed postgres db&#8217;s on 350 and pnfs &#8211; lots of disk space reclaimed, upgraded pnfs to 8.1.3. Installed slony on pnfs and setup replication to 438 &#8211; still needs logrotation and startup scripts to be done, would have taken less time than it did but vi decide to be too clever by half and not show me that the files I was editing were msdos style and not unix. CB was trying to dcap write access working, pointed him towards gsidcap but our system was still in pieces at that point so couldn&#8217;t really help out that much. Altered pools once they came up to correct settings for multiple io mover queues. Checked gftp doors now using new gftp queue.<br />
Deleted 200+ tickets from helpdesk after mailstorm due to batch scheduler wierdness.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.drossy.net/blog/2006/02/20/20022006/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>17/02/2006</title>
		<link>http://www.drossy.net/blog/2006/02/18/17022006/</link>
		<comments>http://www.drossy.net/blog/2006/02/18/17022006/#comments</comments>
		<pubDate>Sat, 18 Feb 2006 23:20:07 +0000</pubDate>
		<dc:creator>Derek</dc:creator>
				<category><![CDATA[Dcache]]></category>
		<category><![CDATA[Work]]></category>

		<guid isPermaLink="false">http://www.drossy.net/blog/?p=179</guid>
		<description><![CDATA[Mailed TB logs of a failed FNAL-RAL transfer Installed various kernel updates Built postgres 8.1.3 for SL3 Assisted in farm shutdown]]></description>
			<content:encoded><![CDATA[<p>Mailed TB logs of a failed FNAL-RAL transfer<br />
Installed various kernel updates<br />
Built postgres 8.1.3 for SL3<br />
Assisted in farm shutdown</p>
]]></content:encoded>
			<wfw:commentRss>http://www.drossy.net/blog/2006/02/18/17022006/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>26/01/2006 &#8211; 16/02/2006</title>
		<link>http://www.drossy.net/blog/2006/02/16/26012006-16022006/</link>
		<comments>http://www.drossy.net/blog/2006/02/16/26012006-16022006/#comments</comments>
		<pubDate>Thu, 16 Feb 2006 21:40:34 +0000</pubDate>
		<dc:creator>Derek</dc:creator>
				<category><![CDATA[Dcache]]></category>
		<category><![CDATA[RT]]></category>
		<category><![CDATA[Work]]></category>

		<guid isPermaLink="false">http://www.drossy.net/blog/archives/176</guid>
		<description><![CDATA[16th February Built postgres 8.1.3 for SL3, helped OS with dCache PoolManager Mailed Zeus about 2 zero length files Did various RGMA requests 15th February Continued configuring new SL4 postgres server Built slony 1.5 for postgres 8.1.3 GridPP-Storage phone conf Shadowed ST doing relocateable WN upgrade to LCG 2.7.0 14th February Mailed CERN about link [...]]]></description>
			<content:encoded><![CDATA[<p>16th February</p>
<p>Built postgres 8.1.3 for SL3, helped OS with dCache PoolManager<br />
Mailed Zeus about 2 zero length files<br />
Did various RGMA requests</p>
<p>15th February</p>
<p>Continued configuring new SL4 postgres server<br />
Built slony 1.5 for postgres 8.1.3<br />
GridPP-Storage phone conf<br />
Shadowed ST doing relocateable WN upgrade to LCG 2.7.0</p>
<p>14th February</p>
<p>Mailed CERN about link &#8211; turned out to be CERN configuration issue<br />
Mailed Lancaster about pingable host on their end of UKLight for more monitoring<br />
Installed new SL4 postgres server</p>
<p>13th February</p>
<p>Monday Morning Ops meeting<br />
Tweaked RT&#8217;s web ui on replies to not attempt to set Ticket owner to current owner &#8211; was interacting with autotaking<br />
Restarted gftp servers &#8211; all stuck at max transfers<br />
Noticed UKLight down, mailed Site Networking<br />
Setup multiple io queues on disk servers &#8211; began restarting quiet ones &#8211; leave rest till powerdown<br />
Configured gftp servers to use gftp queue</p>
<p>10th February</p>
<p>Meeting with ST- reviewed Job plan<br />
Setup autotaking of tickets on reply in RT<br />
Reviewed SFT failures for RC report</p>
<p>9th February</p>
<p>TOAST meeting<br />
Mailed TB about huge number of errors reported in dCache logs from file acces from lcgui02 &#8211; looks like files not being closed properly &#8211; but still not really resolved.<br />
Added query for grid v non-grid usage to T 1 metrics page on wiki</p>
<p>8th February</p>
<p>266,270 couldn&#8217;t access yumit &#8211; turned out to be nscd still using ip address of old system &#8211; nscd -i hosts got things working again<br />
Helpdesk fell over &#8211; rebooted</p>
<p>7th February</p>
<p>Installed new certificates &#8211; but left  keys encrypted causing gridftp transfers to fail for 4 hours &#8211; fixed<br />
Checked GridPP-Storage table&#8217;s Tier 1 historical numbers for RAS<br />
Supplied UKLight plots to MJB</p>
<p>6th February</p>
<p>Bulk requested 8 host certificates, provided feedback on experience to JJ and MV<br />
Supplied gridusage plots to ST<br />
Sent around updated TOAST agenda</p>
<p>3th February</p>
<p>Holiday</p>
<p>2nd February</p>
<p>Holiday</p>
<p>1st February</p>
<p>Holiday</p>
<p>31st January</p>
<p>Reported 2 problems with yum it to CC<br />
Talking with PS, decide that RT < -> UKIROC Footprints problem was down to problematical site mail server, configured helpdesk to not use that mail server.</p>
<p>30th January</p>
<p>Monday morning ops meeting<br />
Asked ca people about bulk cert request script<br />
Mailed CC &#038; ST about yumit not displaying packages in host detail<br />
Updated scarf helpdesk aliases to point to HPCSG&#8217;s footprints box</p>
<p>27th January</p>
<p>Supplied RAS with Grid vs Non-Grid CPU time totals </p>
<p>26th January</p>
<p>Mailed GC some questions for CHEP</p>
]]></content:encoded>
			<wfw:commentRss>http://www.drossy.net/blog/2006/02/16/26012006-16022006/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>25/01/2006</title>
		<link>http://www.drossy.net/blog/2006/01/25/25012006/</link>
		<comments>http://www.drossy.net/blog/2006/01/25/25012006/#comments</comments>
		<pubDate>Wed, 25 Jan 2006 21:24:39 +0000</pubDate>
		<dc:creator>Derek</dc:creator>
				<category><![CDATA[Dcache]]></category>
		<category><![CDATA[Work]]></category>

		<guid isPermaLink="false">http://www.drossy.net/blog/archives/174</guid>
		<description><![CDATA[Setuping up Babar&#8217;s space on dcache Asked for input on Experiments activites vs GridPP milestones &#8211; decided on free-for-all, got people to schedule T1-T2 transfers over the remainder of the month Attended GridPP-Storage phone conference Helped out OS with dCache problem reinstalled Marley]]></description>
			<content:encoded><![CDATA[<p>Setuping up Babar&#8217;s space on dcache<br />
Asked for input on Experiments activites vs GridPP milestones &#8211; decided on free-for-all, got people to schedule T1-T2 transfers over the remainder of the month<br />
Attended GridPP-Storage phone conference<br />
Helped out OS with dCache problem<br />
reinstalled Marley</p>
]]></content:encoded>
			<wfw:commentRss>http://www.drossy.net/blog/2006/01/25/25012006/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>24/01/2006</title>
		<link>http://www.drossy.net/blog/2006/01/25/24012006/</link>
		<comments>http://www.drossy.net/blog/2006/01/25/24012006/#comments</comments>
		<pubDate>Wed, 25 Jan 2006 21:11:36 +0000</pubDate>
		<dc:creator>Derek</dc:creator>
				<category><![CDATA[Dcache]]></category>
		<category><![CDATA[Work]]></category>

		<guid isPermaLink="false">http://www.drossy.net/blog/archives/173</guid>
		<description><![CDATA[Transfers to CERN finshed at around midday &#8211; 150MB/s achieved overnight Lancaster did some transfes from early afternoon &#8211; 833Mb/s Marley&#8217;s disk reported errors &#8211; handed over to GP Assisted CB with dCache problem]]></description>
			<content:encoded><![CDATA[<p>Transfers to CERN finshed at around midday &#8211; 150MB/s achieved overnight<br />
Lancaster did some transfes from early afternoon &#8211; 833Mb/s<br />
Marley&#8217;s disk reported errors &#8211; handed over to GP<br />
Assisted CB with dCache problem</p>
]]></content:encoded>
			<wfw:commentRss>http://www.drossy.net/blog/2006/01/25/24012006/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>23/01/2006</title>
		<link>http://www.drossy.net/blog/2006/01/24/23012006/</link>
		<comments>http://www.drossy.net/blog/2006/01/24/23012006/#comments</comments>
		<pubDate>Tue, 24 Jan 2006 14:03:51 +0000</pubDate>
		<dc:creator>Derek</dc:creator>
				<category><![CDATA[Dcache]]></category>
		<category><![CDATA[Work]]></category>

		<guid isPermaLink="false">http://www.drossy.net/blog/archives/172</guid>
		<description><![CDATA[(Makes it easier to read all of long log entries if they have a title, so going to try and remember and put relevant dates in work entries) Made various tweaks to dCache -restarted all gridftp door over course of the morning &#8211; this fixed an balance issue we were seeing with SC transfers avoiding [...]]]></description>
			<content:encoded><![CDATA[<p>(Makes it easier to read all of long log entries if they have a title, so going to try and remember and put relevant dates in work entries)</p>
<p>Made various tweaks to dCache -restarted all gridftp door over course of the morning &#8211; this fixed an balance issue we were seeing with SC transfers avoiding gftp0444 -probably due to lingering connections to that system<br />
Restarted some more pools<br />
Cleared the CMS files from nfs39 left over when we gave it to LHCb, as dCache decides to use pools based on the amount of free space, not the the amount of &#8220;freeable&#8221; space so nfs39&#8242;s pools weren&#8217;t getting as much use as they might have.<br />
Noticed queue of transfers on nfs39 &#8211; so raised maximum movers &#8211; immediate data movement out of server to lhcb jobs on batch farm, still not understood why things had got queued though &#8211; possibly too many jobs opening multiple files?</p>
<p>Attended:<br />
Monday morning ops meeting<br />
Tech discussion (gave talk)<br />
SC3 phone conference (mailed RAL&#8217;s report afterwards to JS too)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.drossy.net/blog/2006/01/24/23012006/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title></title>
		<link>http://www.drossy.net/blog/2006/01/20/169/</link>
		<comments>http://www.drossy.net/blog/2006/01/20/169/#comments</comments>
		<pubDate>Fri, 20 Jan 2006 17:05:11 +0000</pubDate>
		<dc:creator>Derek</dc:creator>
				<category><![CDATA[Dcache]]></category>
		<category><![CDATA[Work]]></category>

		<guid isPermaLink="false">http://www.drossy.net/blog/archives/169</guid>
		<description><![CDATA[Friday Jan 2oth More babysitting SC3 rerun, csfnfs51 gave too many files errors &#8211; restarted it Research for talk on Dcache and SRB on Monday Various errata rpms applied to systems Meeting with ST Thursday 19th Monitoring SC3 rerun Research for talk]]></description>
			<content:encoded><![CDATA[<p>Friday Jan 2oth</p>
<p>More babysitting SC3 rerun, csfnfs51 gave too many files errors &#8211; restarted it<br />
Research for talk on Dcache and SRB on Monday<br />
Various errata rpms applied to systems<br />
Meeting with ST</p>
<p>Thursday 19th</p>
<p>Monitoring SC3 rerun<br />
Research for talk</p>
]]></content:encoded>
			<wfw:commentRss>http://www.drossy.net/blog/2006/01/20/169/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title></title>
		<link>http://www.drossy.net/blog/2006/01/19/165/</link>
		<comments>http://www.drossy.net/blog/2006/01/19/165/#comments</comments>
		<pubDate>Thu, 19 Jan 2006 02:14:39 +0000</pubDate>
		<dc:creator>Derek</dc:creator>
				<category><![CDATA[Dcache]]></category>
		<category><![CDATA[Work]]></category>

		<guid isPermaLink="false">http://www.drossy.net/blog/archives/165</guid>
		<description><![CDATA[Baby sat SC3 rerun: rebooted gftp0447 discovered what seemed to be two dcache-pool services running on csfnfs63, at least lots of java process were still there after a service dcache-pool stop, killing all them and restarting dcache-pool seems to have got rid of the poolRestarted messages in the PoolManger logs, however csfnfs63 is still taking [...]]]></description>
			<content:encoded><![CDATA[<p>Baby sat SC3 rerun:<br />
 rebooted gftp0447<br />
 discovered what seemed to be two dcache-pool services running on csfnfs63, at least lots of java process were still there after a service dcache-pool stop, killing all them and restarting dcache-pool seems to have got rid of the poolRestarted messages in the PoolManger logs, however csfnfs63 is still taking data in much faster than any other disk server.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.drossy.net/blog/2006/01/19/165/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title></title>
		<link>http://www.drossy.net/blog/2006/01/17/163/</link>
		<comments>http://www.drossy.net/blog/2006/01/17/163/#comments</comments>
		<pubDate>Tue, 17 Jan 2006 11:51:11 +0000</pubDate>
		<dc:creator>Derek</dc:creator>
				<category><![CDATA[Dcache]]></category>
		<category><![CDATA[Work]]></category>

		<guid isPermaLink="false">http://www.drossy.net/blog/archives/163</guid>
		<description><![CDATA[Ops meeting SC3 phone conference Rebooked ops meeting for next 52 weeks Kept an eye on dCache hosts, nfs39 showing too many open files error so restarted with updated ulimit -n value &#8211; must remember to reinstate that after future upgrades.]]></description>
			<content:encoded><![CDATA[<p>Ops meeting<br />
SC3 phone conference<br />
Rebooked ops meeting for next 52 weeks<br />
Kept an eye on dCache hosts, nfs39 showing too many open files error so restarted with updated ulimit -n value &#8211; must remember to reinstate that after future upgrades.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.drossy.net/blog/2006/01/17/163/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
