IETF-ID LTASP disambiguates DIAP

February 23rd, 2010

The project has moved the IETF-ID from DIAP to LTASP. Long-Term Archive Storage Protocol disambiguates Distributed Internet Archiving Protocol. More accurately describing the aims of the project. The name of DIASER, this GPL software product, will not change. Abstract taken from: http://www.ietf.org/id/draft-brasher-ltasp-02.txt

Long-term archiving storage fundamentally begins with archive data Accumulation, then Replication and then Management. Using A->R->M, LTASP has been created to solve mid-range and below, long-term archiving requirements of the small-medium enterprise. Where tape has been deployed in the past, LTASP now offers an alternative solution designed to be more robust and manageable in the long term than network attached storage devices or simple disk storage alone.

Damian

DIASER Ubuntu LTS sudo workaround – variables defined

January 24th, 2010

‘DIR_B’         => { ARGCOUNT => ARGCOUNT_ONE, DEFAULT => “/home/” },
‘DIR_C’         => { ARGCOUNT => ARGCOUNT_ONE, DEFAULT => “/home/” },

# additional variables to cope with the Ubuntu (and other distros) forced
# use of sudo; if the USE_SUDO variable is positive then commands that
# require root access will use sudo with the -S switch,
# i.e. echo “root_password” | sudo -S command
# also the username of the account to connect to – instead of root -
# will be used: SUDO_ACCOUNT_NAME_{node}
‘USE_SUDO’      => { ARGCOUNT => ARGCOUNT_ONE, DEFAULT => “0″ },
‘SUDO_ACCOUNT_NAME_A’ => { ARGCOUNT => ARGCOUNT_ONE, DEFAULT => “usera” },
‘SUDO_ACCOUNT_NAME_B’ => { ARGCOUNT => ARGCOUNT_ONE, DEFAULT => “userb” },
‘SUDO_ACCOUNT_NAME_C’ => { ARGCOUNT => ARGCOUNT_ONE, DEFAULT => “userc” },

# user defined variables for the DIASER filling mechanism / fill_diaser.pl
‘FILL_START_TIME’   => { ARGCOUNT => ARGCOUNT_ONE, DEFAULT => “10″ },
‘VOLUME_DIR’    => { ARGCOUNT => ARGCOUNT_ONE, DEFAULT => “/mnt/bkp/” },
‘DIFF_CONST_PREFIX’ => { ARGCOUNT => ARGCOUNT_ONE, DEFAULT => “diff” },

Feels like spring!

January 24th, 2010

DSCF1703

ebook – setup or streamline your Open Source software project

January 3rd, 2010

Excerpt: “This ebook is designed to help you gain some deeper insight into a small scale, technical Open Source software development collaboration. Perhaps you, the reader, have started, or are part way through, a project; or you are a manager or investor wanting to dig a little deeper into the inner workings of Open Source software development. As the designer, developer and author of a project I recently completed a stage of collaboration that took the software from a prototype to a beta-1 evaluation product. I’ll be talking you through the stages of development from initial conception to beta-1. – Damian

Walking With the Elephants – Out now.

Coincidentally, Mark Webbink, former general counsel of Red Hat, Inc. and presently a visiting professor of law at New York Law School, has written extensively about open source software licensing, working another sort of (legal IT) elephant; Walking With Elephants


Hurdy Gurdy called Lyon

December 14th, 2009

I started restoring this 1840’s French Baroque style Hurdy Gurdy in March and have now reached the stage where I can tune the instrument well. There are many elements involved when tuning a Hurdy Gurdy and part of the restoration process is reinstating the fine control and balance between the body, wheel, strings, keys, tangents, nut, rosin and other crucial components. As a teenager I learned to play the piano to grade 5 then stopped, fortunately my music reading skills have remained largely intact, I’m aged 38 now. I have really enjoyed the restoration but now eager to improve my playing technique of this magical instrument and of course my repertoire. It is a nice way to balance time spent working on a computer.

I’m certainly no virtuoso, a mere novice, but shall endeavour to improve when time allows. Lyon still has some clacking of the keys when they return to the open position due to gravity, I intend to replace the cloth key dampening which has worn away over the years. I also need to reinstate the tirant. I’m learning some simple tunes in G-C tuning and practising the exercises to control the buzz created by the trompette. I am really pleased that I have been able to achieve a sweet sound with two strings; a chanterelle and the trompette and will add more strings to the wheel when I am ready.

Here is: Dining Table played by Mathew Szostak. For some virtuoso Hurdy Gurdy music, view and listen to a performance given by Bouffard & Chabenat.

Click on the image below to see a selection of photographs taken during the restoration from March – December 2009.

Links to resources:

Neil Brook Hurdy Gurdy

Northern Renaissance Instruments

Doreen Muskett

DIAP IETF Internet Draft v11 released

November 21st, 2009

DIASER is the GPL software product, there are no plans to make DIAP a software product.

http://www.ietf.org/id/draft-brasher-diap-11.txt

Abstract

DIAP has been created to solve mid-range and below, long term
archiving requirements of the small medium enterprise.  Where tape
has been deployed in the past, DIAP now offers an alternative
solution designed to be more robust and manageable in the long term
than network attached storage devices or simple disk storage alone.
The system provides a well defined structure for storing and managing
long term archives.

Collaborative SQL

November 14th, 2009

I think this is a nice short example of some collaborative work written with the use of a mailing list; here is the thread where I proposed the initial SQL outline, from there it was modified then implemented:

http://www.hants.lug.org.uk/lurker/message/20090113.115628.7a17529e.en.html


CC-GNU GPL

GPL http://creativecommons.org/licenses/GPL/2.0/ 2.0

#---------------------------------------------------------------------------
# Hampshire Linux Users Group meeting register of attendees.
# Executed once to define database and tables with constraints.
#---------------------------------------------------------------------------
#DROP DATABASE bab_register;
CREATE DATABASE bab_register DEFAULT CHARACTER SET utf8;

USE bab_register;

DROP TABLE IF EXISTS attendee;
CREATE TABLE attendee (
id INT NOT NULL AUTO_INCREMENT,
name VARCHAR(50) NOT NULL,
email VARCHAR(150),
member ENUM('yes','no') NOT NULL,
new_attendee ENUM('no','yes') NOT NULL,
PRIMARY KEY (id)
) COMMENT "HLUG BaB meeting register Southampton Uni (ECS)";

DROP TABLE IF EXISTS macs;
CREATE TABLE macs (
id INT NOT NULL AUTO_INCREMENT,
attendee_id INT NOT NULL,
mac VARCHAR(17) NOT NULL,
PRIMARY KEY (id),
FOREIGN KEY (attendee_id) REFERENCES attendee (id)
) COMMENT "MAC Address entries, more than one per attendee";

OSS Vault building

October 28th, 2009

DIASER – beta-1, we are currently working on project sustainability, making contact with potential trial end users and project maintenance continues…

Vault Building

Vault Building

Read more about DIASER…

Gold railings, sea and a rainbow

October 22nd, 2009
Gold railings and sea - Thessaloniki, Greece

Gold railings and sea - Thessaloniki, Greece

I rather liked this one and felt inspired after walking along the promenade for an hour… Thessaloniki.

Rainbow in Nafplion

Rainbow in Nafplion

This was a lucky shot considering the weather… Nafplion.

UNR – out of the box on tour

October 19th, 2009

I have just returned from a  trip which took me to Debrovnik, then to my final destination in Athens to stay with family, via Sarajevo, Belgrade and Thessaloniki. Planes, buses and trains. I took my Acer Aspire One, 110-Ab and installed Ubuntu Netbook Remix to disk using the image transferred to a 1GiB USB memory stick the day before I left and used it out of the box. Download here. I usually use Fedora based operating systems for personal computing but need to stay familiar with modern Debian based distributions for development purposes so took this opportunity to test drive UNR. I’m familiar with the Debian environment.

My computing needs were minimal. I used Firefox, OOo Writer, F-Spot Photo Manager, Gimp, File Browser, Skype, the command line to make a few adjustments and a few games of Lango. I used numerous wireless facilities. My Camera is a Fujifilm F10 6.3 Mega Pixel and the direct USB connection worked without additional software and appeared as an external memory device simply accessible using the file browser.

Interface – there is no doubt the Ubuntu colour scheme has improved. The browns are more toned and the orange a better match for the browns. Coffee and chocolate with a little orange zest and smooth multi-coloured icons seem to work well together. Two years ago, I installed then removed Ubuntu after a few days due to the colour combination which I simply didn’t like. No problems this time. The desktop layout is intuitive and clear and made good use of the AA1 screen. I was happy to swap between applications using the task bar like icons located in the top left hand side of the screen. Applications are  easy to find. A niggle, it took me a few minutes to realise the grey scroll bar on the right existed.

Performance – Application start time was acceptable if a little on the hesitant side. Boot time is fine. The default sound server presented problems when I attempted to use Skype, much used on this trip, I was a little irritated that it took me two hours to make Skype work and had a poor Wifi link at the time. I was on holiday so did not want to spend a long time fixing this. As switching from PulseAudio sound server to ALSA in Sound Preferences did not fix Skype I simply broke PulseAudio by removing some of its key packages and re-installed Skype. Video in Skype no longer worked but I didn’t care as I was not planning to use video. Video still worked with Cheese Webcam Booth. Once Skype was working as long as the network connection was solid there were no more problems. I used some of my trip time, whilst based in Thessaloniki, to write a first draft of a short eBook about small development collaboration, which I plan to make available for sale soon. I wrote just over 10K words using the AA1 and OOo Writer in a couple of days. The screen size is an obvious drawback but UNR font rendering is clear and enabled me to craft the words comfortably. The AA1 screen is also exceptionally clear, one of the features I really like about these particular netbooks.

Network – Wireless worked well most of the time. One in three or four WLAN connections required a reboot and or a network restart. This may be a driver problem or due to the fact I used a number of WiFi clouds during the two week trip and the connection became, unscientifically ‘clogged’, too many low power WiFi clouds in the region? Perhaps a later edition of Netbook would have coped better, the Acer Aspire One, 110-Ab is one of the earliest models. Ubunutu supply an alternative proprietary driver, the use of this did not improve the connection, in fact WLAN stopped working completely when I enabled it, so I continued with the default. Wired LAN worked perfectly.

Power management – suspend worked, hibernate failed. But see this document for additional AA1 specific configuration; here a hibernate fix is described. The AA1 system fan was a little noisy at times, I know there are some scripts available to quieten the fan. I rather hoped this had been handled automatically. But the weather was really warm at times and I undertook most of my computing whilst drinking coffee in a cafe, so the extra noise was unnoticeable. AA1 froze only once during the whole trip, probably due to overheating. I always achieved the specified two hours battery life.

Applications – Very good selection for such a small installation image. I needed to additionally install Skype, Gimp and Java. Skype required a significant number of additional package dependencies, I used apt-get install from the command line and satisfied these after an hour. Most people will use the straightforward GUI package manager. Finding the right Java package took a little extra effort, I eventually used apt-get to install Java from the pre-defined multiverse package repository rather than the version downloaded directly from Sun. Adobe-flashplugin worked out of the box? I don’t have a history entry relating to the installation of adobe-flashplugin, it must have either been trivial to install or it was working out of the box. I needed Java Firefox plugin for the FaceBook photograph upload tool every few days for the duration of the trip. Automatic updates work well, as I often had a poor WiFi connection I disabled them for the duration of the trip as the pop-up messages became annoying each time I reconnected, two steps were required and this took about quarter of an hour to achieve, finding my way around the Administration and Preferences settings. As for the command line, I don’t often use sudo on a daily basis to execute commands as root, but as I was using the command line sparingly it did not matter.

Final Thoughts – I am sure the additional effort spent improving the colour scheme is paying and will continue to pay dividends. Not essential, depending on your skills and resources, but I would suggest tuning UNR for performance before embarking on a trip and perhaps taking additional time to ensure all the applications you need will work properly before travelling. I really like the fact UNR is available as a tiny 1GiB install image and installation is exceptionally simple, see this guide. The platform felt solid and served its purpose well. An elegant, lightweight solution for a reasonable initial investment of my time.

Get UNR here…

Also first posted on Hampshire Linux Users Group

VDAP, a vulnerability discovery announce protocol?

September 27th, 2009

When you first see this: “Linux Kernel ’sock_sendpage()’ NULL Pointer Dereference Vulnerability” in an email or twitter it does not mean much at first glance.

When you realise this is a kernel issue if you are a Linux systems administrator you will start scanning, thinking and then you read some more and realise this is serious. Then after 10 minutes the ramifications hit home depending on the systems you maintain and responsibility level on a scale between 1 (a single NetBook running UNR) and 10, plus some off the scale, some may know what I mean.

My sentiments towards the Google security team are ambivalent. They were doing a job and did it well. I feel that the Open Source universe (and propriety) does not have a satisfactory model for announcing and fixing serious security vulnerabilities. The impact on users, businesses, code maintainers, managers all the way through the various components of the producer to consumer can be heavy and I believe the process of announcement to fix could be much much better.

The Google security team announcement (Actually the news was leaked) on the 13th September will have had a negative impact on Open Source communities, like any accident does, but it seems that lessons ought to be learnt.

A simple model could be:

1) Security vulnerability found.
2) Developer(s) contacted privately before announcement is made public.
3) Developer fix privately forwarded to major vendors.
4) Major vendors generate patch and make it available.
5) Public announcement is made.

This point will have been made time and again, could or should a protocol be made law? If a vulnerability has taken a team of top security experts to discover then the likelihood of an individual or organisation finding the same vulnerability in the same amount of time must be slight in general.

I find it difficult to see the benefits of making a vulnerability public before contacting the developer at least. Should a large multi-national like Google be allowed to uncover an error then tell the whole world when it feels like it? Is that ethical? Could it be seen as an act of aggression? (see comment below) (again, as reported above, the news was leaked).

From now on I’m going to make a point of reading more about this process and following moves to change the current vulnerability discovery announce protocol.

Damian Brasher

Original post on Hampshire Linux Users Group

DIASER, Open Source Data Vault Application

September 27th, 2009

Open Source Data Vault Application

Proactive archiving… an advanced disk based backup volume management system with triple redundancy for HE and SME. A quick and low-cost way to make an environment more robust and data more accessible by archiving in multiple places. This replication also provides fast retrieval of archived data from all node hosting locations. A Perl installer creates the system. The application provides a decentralised, self-contained and managed storage utility.

Nodes can be dedicated to storage or used for existing services over unused bandwidth. DIASER works in user space over SSH. The software is based on DIAP which is a storage architecture designed to structure months to years of long term sustainable archiving space including retrospective archiving.

The application is beta-1. Please download to evaluate and trial the software, currently being tested with Red Hat Enterprise Linux and Fedora 10. I aim to test with other Linux distributions as well as ActivePerl, freeBSD and MacOSX.

http://www.diaser.org.uk

Damian Brasher

Data Vitality

September 27th, 2009

DIAP Rule of Thumb
Circumstantial observation of my email archive, at 272MBytes, having never deleted an email permanently and the file, ../mail, has been in use for 4 years. During this time my available xDSL line Bandwidth has increased, 2004 500MBits/sec to 1GBit/sec, 2008 1GBit/sec to 6GBits/Sec this is about 150% yearly increase whereas my mailbox has increased yearly by about 50%. It is this difference which DIAP attempts to use classing my email record as ‘mission critical’. Other record types will increase at different rates, as will bandwidth depending on location, but probably less than the average yearly bandwidth increase. This idea needs expanding but forms the foundation for the usefulness of DIAP, describing a DIAP rule of thumb. DIAP can also be viewed as a technique.
A pilot survey was run between June 08 and Aug 08 on Hampshire Lug ML named:- ‘[Hampshire] [OT] Identifying the importance and value of data’. This form the basis of a decision matrix tool to help DIAP ® users select data to use in DIAP ® system.Question 2)

“Following on from a survey I posted back in June, see below, designed to help make decisions about data vitality and importance to individuals and organisations, as well as find out a little bit more about the relationship between the importance of data types and their size, I have some results to publish. The survey was exclusive to Hants Lug and deliberately kept low key and has been a very useful exercise even though the number of participants has been relatively small – but very good for such a small readership, quality not quantity.

Thanks to those that took part and those that provided feedback and constructive criticism also thanks to HL ML readers for living with the thread. I will now design a new survey with some sort of incentive and float this to a much larger audience. The results I have are enough to incorporate a DIAP (R) decision matrix on the project website.

6 Participants:

Question 1)

Rate FIVE of these DATA TYPES if they were lost completely how best describes the effect to your users (and or yourself) organisation or home occupants.

Results:

*total damage – cost crippling – traumatic

2 [participants specified] Documents
1 Presentations
1 Photographs
1 Email boxes on a server
2 Code repository
1 Website code

*massive damage – high cost – devastating

1 Spreadsheets
1 .txt files
1 Email boxes on a server

*major damage – very costly – extremely upsetting

1 pdf documents
1 Photographs

*significant damage – significant cost – very upsetting

1 Spreadsheets
2 Accounting Software data
1 Photographs
1 server configuration files
1 music files
1 Virtual machine images

*damage – expensive – annoying

1 Documents
1 pdf documents
1 MS .pst file
1 Email boxes on a server
1 music files
1 ISO images
1 Virtual machine images
1 Code repository

Size the FIVE choices you made in the previous question

Results:

1 MB         100 MB        8 items
>100 MB  500 MB        1 item
>500 MB   1 GB            8 items
>1 GB       10 GB          8 items
>10 GB     50 GB          0 items
>50 GB     100 GB        2 items
>100 GB                        1 item

So from just this small pilot survey I can deduce qualitatively with reasonable certainly that importance of data, subjective to the individual or organisation, does not depend heavily on the data type.

That importance of data in relation to file size is loosely inversely proportional. So the most important files are generally the smallest in size. This is very encouraging information for the DIAP (R) project.

Thanks again to readers and participants.”

Damian Brasher

DIASER: Long term distributed internet archiving application

September 26th, 2009

Requested by: http://code.google.com/p/jiscri/wiki/ProjectDocumentation

According to: JISCRI finalProgressPost requirements.

In this post you are required to address each one of the below elements, this post along with your final prototype will be evaluated for its readiness to launch to the end user. If selected, we’ll be using this post as the final “advertisement” that will go in an ‘Argos-like catalogue’ of JISC Software Prototypes (a vision of the future to come!). We hope to put this publication on the desk of as many senior managers in UK HE/FE as possible as well as other potential investors (e.g. NGOs and VCs) in the New Year so please write this Final post with that audience in mind.

Description of Prototype:

Proactive archiving… an advanced disk based backup volume management system with triple redundancy for HE and SME. A quick and low-cost way to make an environment more robust and data more accessible by archiving in multiple places. This replication also provides fast retrieval of archived data from all node hosting locations. A Perl installer creates the system. The application provides a decentralised, self-contained and managed storage utility.

Screenshot or diagram of prototype:

Diagram:

Vault Building

Vault Building

Flash Demo:

http://www.diaser.org.uk/about.html#configure

Screenshot:

DIASER Usage
DIASER Usage
Bandwidth
Bandwidth

Link to working prototype:

http://sourceforge.net/projects/diaser

http://sourceforge.net/projects/diaser/files/

Link to end user documentation:

http://www.diaser.org.uk/files/DiaserDocsv1.1.pdf

Link to code repository or API:

http://diaser.svn.sourceforge.net/viewvc/diaser

Link to technical documentation:

http://www.diaser.org.uk/manual.html

Date prototype was launched: 2nd June 2009

Project Team Names, Emails and Organisations:

PM/Dev: Damian Lajos Brasher

d {.} brasher @ omii {.} ac {.} uk OMII-UK ECS Interlinux Ltd

Neil P Chue Hong: Project advisor

n {.} ChueHong  @  omii {.} ac {.} uk OMII-UK ECS

Dr Simon J Hettrick: Documentation, graphics and marketing

s {.} hettrick @ omii {.} ac {.} uk OMII-UK ECS

Project Websites:

http://www.diaser.org.uk

http://www.diap.org.uk/background_ip

http://www.omii.ac.uk/wiki/DIASER (OMII-UK)

http://walkingwiththeelephants.co.uk (Sustainability)

http://interlinux.co.uk/wordpress (Final Project Post)

PIMS entry:

https://pims.jisc.ac.uk/projects/view/1267

Table of Content for Project Posts

Microblog posts: hash tag #diaser #jiscri

http://twitter.com/#search?q=%23diaser

DIASER SourceForge RSS news feed

News

18/12/09 v0.2.7 beta-1 released

17/12/09 eBook introduction site live

14/12/09 Interlinux Ltd UK Tax year tasks completed

21/11/09 DIAP IETF Internet Draft v11 released

05/11/09 v0.2.6 beta-1 released

30/09/09 v0.2.5 beta-1 released

21/09/09 v0.2.5 beta-1 released

02/08/09 Screencast – demo / mini tutorial released

21/07/09 DIASER beta-1 released

09/07/09 Attended JISC inf11 projects start-up meeting

04/07/09 Community feedback motivate DIASER architectural changes

25/06/09 Manual for DIASER released

04/06/09 Latest 4 month development roadmap, alpha -> beta-2

02/06/09 Build yourself a private storage cloud…

02/06/09 Deployable alpha of DIASER released

22/05/09 Projected release life cycle

22/05/09 Project memory and thinking

21/05/09 Dec 2006 – Mar 2009

17/05/09 Rollout of first fully deployable alpha planned wk 25th May

08/05/09 DIASER moves from alpha-dev to alpha

27/04/09 DIASER funded by UK Joint Information Systems Committee

Other

Download Statistics SourceForge total 1612 since April 2009

      Rank Pages Downloads

Dec 2009 * N/D 8,063 124
Nov 2009 1,585 8,917 183
Oct 2009 1,901 6,200 164
Sep 2009 1,547 11,562 291
Aug 2009 1,317 7,606 423
Jul 2009 1,026 9,588 300
Jun 2009 1,529 4,463 28
May 2009 1,258 4,468 51
Apr 2009 3,429 2,164 53
Dec 2009 * N/D 8,063
Nov 2009 1,585 8,917
Oct 2009 1,901 6,200
Sep 2009 1,547 11,562
Aug 2009 1,317 7,606
Jul 2009 1,026 9,588
Jun 2009 1,529 4,463
May 2009 1,258 4,468
Apr 2009 3,429 2,164

Google analytics for diaser.org.uk :

Analytics_www.diaser.org.uk_20090407-20091222

Analytics_www.diaser.org.uk_20090407-20091222