From: kolos@pcatd88.cern.ch on behalf of Kolos Serguei [Serguei.Kolos@cern.ch] Sent: Monday, July 22, 2002 3:55 PM To: Giuseppe Mornacchi Subject: AWG ideas Hi Giuseppe Here are some ideas about the TDAQ Architecture group. I agree with the Livio's proposal about a general responsibility if the group - "Coordination of definition of the overall TDAQ architecture including major system elements and their relationships and interfaces". In addition to that here are some more specific issues which in my opinion can be tackled in the context of the AWG: 1. the TDAQ operational model including aspects related to changing the run mode (e.g. from physics to calibration), run number, etc. 2. the TDAQ Partition operational model including issues related to a splitting and joining partitions online, defining partitions hierarchy, etc. 3. the diagnostics and error recovery in the TDAQ system. It seems a distributed diagnostics and error recovery scheme is one of the possible solutions for such a large system as the ATLAS TDAQ. In this case a distributed model shall be defined and agreed among all the TDAQ (sub)systems. 4. the general fault tolerance model for the TDAQ. This may include a set of scenarios which describe what happen with some of the TDAQ (sub)systems if one or more of the others encounter errors. In addition those scenarios can specify how to resolve the faults and which (sub)systems are responsible. Here there is a tight reference to the point 3 of this proposals. Cheers, Sergei