source: projects/specs/trunk/h/heartbeat/heartbeat-vl.spec @ 521

Revision 521, 60.2 KB checked in by daisuke, 14 years ago (diff)

import VineSeed package specs

Line 
1%define           ENABLE_SNMP_SUBAGENT 0
2%define           ENABLE_MGMT 1
3%define           uid 24
4%define           gname haclient
5%define           uname hacluster
6
7Summary:          Heartbeat subsystem for High-Availability Linux
8Summary(ja):      高可用性 Linux のための HeartBeat サブシステム
9Name:             heartbeat
10Version:          2.0.8
11Release:          4%{_dist_release}
12License:          GPL/LGPL
13URL:              http://linux-ha.org/
14Group:            System Environment/Daemons
15Source0:          http://linux-ha.org/download/%{name}-%{version}.tar.gz
16Patch0:           heartbeat-fedora-pam.patch
17Patch1:           heartbeat-fedora-chkconfig.patch
18BuildRoot:        %{_tmppath}/%{name}-%{version}-%{release}-root-%(%{__id_u} -n)
19BuildRequires:    glib2-devel
20BuildRequires:    iputils
21BuildRequires:    libnet-devel
22BuildRequires:    libtool-ltdl-devel
23BuildRequires:    net-snmp-devel
24BuildRequires:    bzip2-devel
25BuildRequires:    ncurses-devel
26%if %{ENABLE_MGMT}
27BuildRequires:    gnutls-devel > 2
28BuildRequires:    pam-devel
29BuildRequires:    python-devel
30BuildRequires:    swig
31BuildRequires:    curl-devel
32%endif
33Requires:         stonith = %{version}-%{release}
34Requires:         pils = %{version}-%{release}
35Requires:         ldirectord = %{version}-%{release}
36Requires:         pygtk2 >= 2.4
37Requires(post):   /sbin/chkconfig
38Requires(preun):  /sbin/chkconfig
39%if %{ENABLE_MGMT}
40Requires:         gettext
41%endif
42
43%description
44heartbeat is a basic high-availability subsystem for Linux-HA.
45It will run scripts at initialization, and when machines go up or down.
46This version will also perform IP address takeover using gratuitous ARPs.
47It supports "n-node" clusters with significant capabilities for managing
48resources and dependencies.
49
50In addition it continues to support the older release 1 style of
512-node clustering.
52
53It implements the following kinds of heartbeats:
54        - Serial ports
55        - UDP/IP multicast (ethernet, etc)
56        - UDP/IP broadcast (ethernet, etc)
57        - UDP/IP heartbeats
58        - "ping" heartbeats (for routers, switches, etc.)
59           (to be used for breaking ties in 2-node systems)
60
61%package -n ldirectord
62Summary:          Monitor daemon for maintaining high availability resources
63Summary(ja):      高可用リソースを管理するための監視デーモン
64Group:            System Environment/Daemons
65Requires:         ipvsadm
66Requires:         %{name} = %{version}
67Requires(post):   /sbin/chkconfig
68Requires(preun):  /sbin/chkconfig
69
70%description -n ldirectord
71ldirectord is a stand-alone daemon to monitor services of real
72for virtual services provided by The Linux Virtual Server
73(http://www.linuxvirtualserver.org/). It is simple to install
74and works with the heartbeat code (http://www.linux-ha.org/).
75
76%package -n stonith
77Summary:        Provides an interface to Shoot The Other Node In The Head
78Group:          System Environment/Daemons
79Requires:       pils = %{version}-%{release}
80
81%description -n stonith
82The STONITH module (a.k.a. STOMITH) provides an extensible interface
83for remotely powering down a node in the cluster.  The idea is quite simple:
84When the software running on one machine wants to make sure another
85machine in the cluster is not using a resource, pull the plug on the other
86machine. It's simple and reliable, albeit admittedly brutal.
87#'
88
89%package -n pils
90Summary:        Provides a general plugin and interface loading library
91Group:          System Environment/Daemons
92
93%description -n pils
94PILS is an generalized and portable open source Plugin and Interface Loading
95System. PILS was developed as part of the Open Cluster Framework reference
96implementation, and is designed to be directly usable by a wide variety of
97other applications.
98PILS manages both plugins (loadable objects), and the interfaces these plugins
99implement. PILS is designed to support any number of plugins implementing any
100number of interfaces.
101
102%package devel
103Summary:        Heartbeat development package
104Summary(ja):    Heartbeat 開発パッケージ
105Group:          Development/Libraries
106Requires:       heartbeat = %{version}-%{release}
107
108%description devel
109Heartbeat development package
110
111%prep
112%setup -q
113%patch0 -p1
114%patch1 -p1
115 
116%build
117# disable-fatal-warnings flag used to disable gcc4.x warnings of 'difference in signedness'
118CFLAGS=${RPM_OPT_FLAGS} \
119%configure \
120  --disable-fatal-warnings \
121  --disable-static \
122%if %{ENABLE_MGMT}
123  --enable-mgmt
124%else
125  --disable-mgmt
126%endif
127make
128
129%install
130rm -rf $RPM_BUILD_ROOT
131mkdir -p $RPM_BUILD_ROOT
132make DESTDIR=$RPM_BUILD_ROOT install
133(
134  cd $RPM_BUILD_ROOT/etc/ha.d/resource.d
135  ln -s /usr/sbin/ldirectord ldirectord
136)
137
138mkdir -p $RPM_BUILD_ROOT/%{_sysconfdir}/ha.d/conf
139
140# cleanup
141[ -d $RPM_BUILD_ROOT/usr/man ] && rm -rf $RPM_BUILD_ROOT/usr/man
142[ -d $RPM_BUILD_ROOT/usr/share/libtool ] && rm -rf $RPM_BUILD_ROOT/usr/share/libtool
143find $RPM_BUILD_ROOT -type f -name *.la -exec rm -f {} ';'
144
145sed -i -e '1i# -*-Shell-script-*-' $RPM_BUILD_ROOT/%{_libdir}/heartbeat/ocf-shellfuncs
146sed -i -e '1i# -*-Shell-script-*-' $RPM_BUILD_ROOT/%{_sysconfdir}/ha.d/shellfuncs
147chmod -x $RPM_BUILD_ROOT/%{_libdir}/heartbeat/ocf-shellfuncs
148chmod -x $RPM_BUILD_ROOT/%{_sysconfdir}/ha.d/shellfuncs
149chmod -x $RPM_BUILD_ROOT/%{_libdir}/heartbeat-gui/pymgmt.py
150
151%clean
152if [ -n "${RPM_BUILD_ROOT}"  -a "${RPM_BUILD_ROOT}" != "/" ]; then
153  rm -rf $RPM_BUILD_ROOT
154fi
155
156%pre
157/usr/sbin/groupadd -g %{uid} %{gname} &>/dev/null || :
158/usr/sbin/useradd -M -r -u %{uid} -s /sbin/nologin -d /var/lib/heartbeat/cores/hacluster \
159        -c 'heartbeat user' -g %{gname} %{uname} &>/dev/null || :
160
161%post
162/sbin/ldconfig
163/sbin/chkconfig --add heartbeat
164
165%preun
166/sbin/chkconfig --del heartbeat
167
168%postun
169/sbin/ldconfig
170test "$1" != 0 || /usr/sbin/userdel  %{uname} &>/dev/null || :
171test "$1" != 0 || /usr/sbin/groupdel %{gname} &>/dev/null || :
172
173%post -n ldirectord
174/sbin/chkconfig --add ldirectord
175
176%preun -n ldirectord
177/sbin/chkconfig --del ldirectord
178
179%post -n stonith -p /sbin/ldconfig
180
181%postun -n stonith -p /sbin/ldconfig
182
183%files
184%doc %{_datadir}/doc/%{name}-%{version}
185%defattr(-,root,root)
186%dir %{_sysconfdir}/ha.d
187%{_sysconfdir}/ha.d/harc
188%config(noreplace) %{_sysconfdir}/ha.d/shellfuncs
189%{_sysconfdir}/ha.d/rc.d
190%config(noreplace) %{_sysconfdir}/ha.d/README.config
191%{_libdir}/heartbeat
192%{_libdir}/heartbeat-gui
193%{_libdir}/libapphb.so.*
194%{_libdir}/libccmclient.so.*
195%{_libdir}/libcib.so.*
196%{_libdir}/libclm.so.*
197%{_libdir}/libcrmcommon.so.*
198%{_libdir}/libtransitioner.so.*
199%{_libdir}/libhbclient.so.*
200%{_libdir}/liblrm.so.*
201%{_libdir}/libpengine.so.*
202%{_libdir}/libplumb.so.*
203%{_libdir}/libplumbgpl.so.*
204%{_libdir}/librecoverymgr.so.*
205%{_libdir}/libstonithd.so.*
206%{_prefix}/lib/ocf
207%{_sysconfdir}/ha.d/resource.d/
208%exclude %{_sysconfdir}/ha.d/resource.d/ldirectord
209%{_sysconfdir}/init.d/heartbeat
210%config(noreplace) %{_sysconfdir}/logrotate.d/heartbeat
211%dir %{_var}/lib/heartbeat
212%dir %{_var}/lib/heartbeat/cores
213%dir %attr (0755, root, root) %{_var}/lib/heartbeat/cores/root
214%dir %attr (0755, nobody, nobody) %{_var}/lib/heartbeat/cores/nobody
215%dir %attr (0755, hacluster, haclient) %{_var}/lib/heartbeat/cores/hacluster
216%dir %{_var}/run/heartbeat
217%attr (0755, hacluster, haclient) %{_bindir}/cl_status
218%{_bindir}/cl_respawn
219%{_sbindir}/crmadmin
220%{_sbindir}/cibadmin
221%{_sbindir}/ccm_tool
222%{_sbindir}/crm_diff
223%{_sbindir}/crm_uuid
224%{_sbindir}/crm_mon
225%{_sbindir}/crm_sh
226%{_sbindir}/iso8601
227%{_sbindir}/crm_master
228%{_sbindir}/crm_standby
229%{_sbindir}/crm_attribute
230%{_sbindir}/crm_resource
231%{_sbindir}/crm_verify
232%{_sbindir}/attrd_updater
233%{_sbindir}/crm_failcount
234%{_sbindir}/ocf-tester
235%dir %attr (755, hacluster, haclient) %{_var}/run/heartbeat/ccm
236%dir %attr (755, hacluster, haclient) %{_var}/run/heartbeat/crm
237%dir %attr (755, hacluster, haclient) %{_var}/lib/heartbeat/crm
238%dir %attr (755, hacluster, haclient) %{_var}/lib/heartbeat/pengine
239%{_mandir}/man1/cl_status.1*
240%{_mandir}/man1/ha_logger.1*
241%{_mandir}/man1/hb_standby.1*
242%{_mandir}/man1/hb_takeover.1*
243%{_mandir}/man1/hb_addnode.1*
244%{_mandir}/man1/hb_delnode.1*
245%{_mandir}/man8/heartbeat.8*
246%{_mandir}/man8/apphbd.8*
247%{_mandir}/man8/ha_logd.8*
248%{_mandir}/man8/cibadmin.8*
249%{_mandir}/man8/crm_resource.8*
250%if %{ENABLE_SNMP_SUBAGENT}
251/LINUX-HA-MIB.mib
252%endif
253%if %{ENABLE_MGMT}
254%{_libdir}/libhbmgmt.so.*
255%{_libdir}/libhbmgmtclient.so.*
256%{_libdir}/libhbmgmtcommon.so.*
257%{_libdir}/libhbmgmttls.so.*
258%config(noreplace) %{_sysconfdir}/pam.d/hbmgmtd
259%{_datadir}/locale/zh_CN/LC_MESSAGES/haclient.mo
260%endif
261
262%files -n ldirectord
263%doc doc/COPYING
264%doc doc/README
265%doc ldirectord/ldirectord.cf
266%defattr(-,root,root)
267%{_sysconfdir}/ha.d/conf
268%{_sbindir}/ldirectord
269%{_sbindir}/supervise-ldirectord-config
270%config(noreplace) %{_sysconfdir}/logrotate.d/ldirectord
271%{_sysconfdir}/init.d/ldirectord
272%{_sysconfdir}/ha.d/resource.d/ldirectord
273%{_mandir}/man8/ldirectord.8*
274%{_mandir}/man8/supervise-ldirectord-config.8*
275
276%files -n stonith
277%doc doc/COPYING
278%doc doc/README
279%defattr(-,root,root)
280%{_libdir}/*.so.*
281%{_libdir}/stonith/
282%{_sbindir}/stonith
283%{_sbindir}/meatclient
284%{_mandir}/man8/stonith.8*
285%{_mandir}/man8/meatclient.8*
286
287%files -n pils
288%doc doc/COPYING
289%doc doc/README
290%defattr(-,root,root)
291/usr/include/pils
292%{_libdir}/libpils.*
293%{_libdir}/pils/
294
295%files devel
296%doc %{_datadir}/doc/%{name}-%{version}
297%defattr(-,root,root)
298%{_includedir}/heartbeat/
299%{_includedir}/clplumbing/
300%{_includedir}/saf/
301%{_includedir}/ocf/
302%{_includedir}/stonith/
303%{_includedir}/pils/
304%{_libdir}/*.so
305
306%changelog
307* Sun May 17 2009 Ryoichi INAGAKI <ryo1@bc.wakwak.com> 2.0.8-4
308- spec in UTF-8
309- changed devel Group to Development/Libraries
310
311* Tue May 05 2009 Satoshi IWAMOTO <satoshi.iwamoto@nifty.ne.jp> 2.0.8-3
312- new versioning policy
313- rebuild with gnutls 2.6.6, libtool 2.2.6 and curl 7.19.4
314- add BuildRequires: curl-devel
315
316* Sun Jan 06 2008 Shu KONNO <owa@bg.wakwak.com> 2.0.8-0vl2
317- rebuilt with net-snmp-5.4.1
318
319* Sat May 19 2007 Daisuke SUZUKI <daisuke@linux.or.jp> 2.0.8-0vl1
320- new upstream release
321- build with new environment
322
323* Mon Oct 30 2006 Daisuke SUZUKI <daisuke@linux.or.jp> 2.0.7-0vl2
324- rebuild with gnutls-1.4
325- add Japanese summaries
326
327* Mon Oct 23 2006 Daisuke SUZUKI <daisuke@linux.or.jp> 2.0.7-0vl1
328- initial build for Vine Linux based on FE package.
329
330* Wed Aug 30 2006 Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.7-2
331- rebuild for Fedora Extras 6
332
333* Wed Aug 16 2006  Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.7-1
334- upstream version 2.0.7
335
336* Sat Jul 15 2006  Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.6-2
337- added BuildReqs: ncurses-devel
338
339* Fri Jul 14 2006  Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.6-1
340- upstream version 2.0.6
341
342* Fri Jun 16 2006  Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.5-2
343- bump for gnutls change in devel
344
345* Thu Apr 27 2006  Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.5-1
346- upstream version 2.0.5
347- removed patch2 - ownership of /heartbeat/crm/cib.xml is no longer
348  set in cts/CM_LinuxHAv2.py.in
349
350* Wed Mar 29 2006  Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.4-2
351- Version 2.0.4
352
353* Wed Mar  1 2006  Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.3-9
354- changed user creation
355- added patch2 heartbeat-2.0.3-fedora-ccmuser.patch 
356
357* Wed Mar  1 2006  Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.3-8
358- specifically excluded ldirectord symlink from heartbeat package
359- removed user and group deletion in postun
360- renamed subpackages ldirectord, pils and stonith to lose prefix heartbeat
361  by using -n
362
363* Tue Feb 28 2006  Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.3-7
364- fixed more rpmlint errors and warnings
365
366* Sat Feb 25 2006  Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.3-6
367- fixed number of rpmlint warnings and errors (still ignores some..)
368- generate 'predictable' uid and gid with fedora-usermgmt to use with
369  configure flag -with-ccmuser-id and groupadd, useradd
370- added Buildreq's: libtool-ltdl-devel, fedora-usermgmt-setup
371  net-snmp-devel, bzip2-devel
372- removed *.so duplication in heartbeat and heartbeat-devel
373- changed file sections
374
375* Fri Feb 24 2006  Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.3-5
376- useradd with fedora-usermgmt
377- added *.so file to -devel sub-package
378
379* Sat Feb 18 2006  Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.3-4
380- removed all perl requires; should be picked up by rpmbuild automagically
381- changed system user creation hacluster part to use baseid:
382  (http://fedoraproject.org/wiki/Packaging/UserCreation)
383
384* Thu Feb 16 2006  Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.3-3
385- removed Requires: python and gnutls
386- changed _libdir/ocf -> _prefix/lib/ocf
387- reversed subpackages depend on basepackage
388- removed Req swig (kept BuildReq)
389- added Req pygtk2
390
391* Wed Feb 15 2006  Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.3-2
392- fixes for various rpmlint errors and warnings
393- fixed setup -q
394- make subpackages depend on basepackage, not reverse
395- clean buildroot at beginning of install
396- replaced a number of hardcoded paths with RPM macros
397- Changed Group from Networking/Daemons to System Environment/Daemons
398- enable mgmt option
399
400* Sun Feb 12 2006  Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.3-1
401- rebuilt for Fedora Extras
402
403* Fri Feb 10 2006  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
404+ Version 2.0.3 - Bug fixes and significant new features.
405  + Management Daemon/Library and GUI client
406    + provide a management library for manamgement daemon and CIM provider
407    + provide a management daemon and a basic GUI management tool
408  + CIM enablement
409    + CIM (Common Information Model) enablement - works with
410      sblim-sfcb, OpenWBEM, and Pegasus CIMOMs
411    - not yet compiled into our binary RPMs because of dependencies
412  + CRM (Cluster Resource Manager) General
413    + All shutdowns go via the PE/TE - preserves inter-resource ordering
414    + Support for future changes to the CIB (depreciation of cib_fragment)
415    + Overhaul of IPC and HA channel callback logic
416    + Many improvments to the quality and quantity (reduced) of logging
417  + CRMd
418    + Timerless elections - when everyone has voted we're done
419    + Use the replace notification from the CIB to re-update our copy with
420      our view of our peers.
421    + Reliably detect if the LRM connection is still active.
422    + Elections
423      + newer versions defer to older ones in DC elections
424        (opposite of current behavior)
425      + this means that only once the complete cluster has been upgraded will
426        we start acting like the new version and accept new config options
427      + it also means newer PE's and TE's (the most complex pieces) don't need
428        "act like the old version" options and can rely on all slaves being at
429        least as up-to-date as they are
430      + people can run mixed clusters as long as they want
431        (until they want the new PE features)
432      + new DCs only update the version number in the CIB if they have a
433        higher value
434      + nodes that start and have a lower version than that stored in the CIB
435        shut themselves down (the CRM part anyway)
436      + this prevents an admin from introducing old nodes back into an upgraded
437        cluster. It probably doesn't fully understand the config and may not
438        support the actions the PE/TE requires.
439  + CIB (Common Information Base daemon)
440    + Make sure "query only" connections cant modify the CIB
441    + Periodically dump some stats about what the CIB has been doing.
442    + Verify there are no memory leaks
443    + Performance enhancements
444    + Prevent a single CIB client from blocking everyone else
445    + Clients Can be notified of full CIB replacements
446    + record_config_changes option in ha.cf for those worried about
447      the amount of logging.  Defaults to "on".
448    + suppress_cib_writes CIB option replaced with in enable_config_writes ha.cf
449      (enable_config_writes to be removed in 2.0.4)
450    + Never write the status section to disk
451    + Check permissions for the on-disk CIB at startup
452    + Dont trash unreadable on-disk CIBs
453    + Fix for updates made against the whole CIB (not just one section)
454  + PEngine (Policy Engine)
455    + Many improvements to the handling of resource groups
456    + Support "anonymous" clones
457    + Fix stonith ordering
458    + Order DC shutdowns after everyone else's
459    + Support short resource names (for group and clone resources)
460    + The ordering and colocation of grouped resources is now optional
461    + Support probing new nodes for active resources.
462    + All "probe" actions are controlled by the PE.
463      + No resource may be started until the probing is complete.
464      + Do not probe for resources we know to be active on unprobed nodes
465    + When looking for monitor ops, only mark it optional if it was already
466      active on the node we're interested in.
467    + Detect changes to class/type/provider/parameters and force a restart
468      of the resource
469    + New record_pengine_inputs option in ha.cf for those worried about
470      the amount of logging.  Defaults to "on".
471    + Differentiate between config and processing errors
472      + reduces the frequency that we need to log the complete CIB
473    + Make notify for master/slave work
474    + New CIB option: stop_orphan_actions (boolean)
475      If a resource is no longer defined, we can optionally stop it
476    + New CIB option: stop_orphan_actions (boolean)
477      If a monitor op for a given interval is no longer defined, we can
478      optionally stop it
479    + Add support for time and phase-of-the-moon based constraints
480    + Improved failure handling: avoiding false positives
481    + Always create orphaned resources - so they show up in crm_mon
482    + Do not require sequential clone numbers starting at 0
483  + TEngine (transition engine)
484    + Detect old stonith ops
485  + CLIs (Command Line interfaces)
486    + Create a --one-shot option for crm_mon
487    + Switch a number of CLI tools to use the new syncronous connections
488    + Log errors to stderr where they will be seen and therefore useful
489    + Support migration and un-migration of resources and resource groups
490    + Create crm_verify for checking configuration validity
491    + Simplify the passing of XML to cibadmin
492  + Known open bugs worth mentioning:
493    + 1075, 1080, 1081, 1084, 1085, 1064, 1069, 756, 984
494    + 1050, 1082, 1037, 1079
495   
496* Thu Sep 22 2005  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
497+ Version 2.0.2 - small bug fix only release
498  + Fixed a bug in ping directive - it works again
499  + Added a check to BasicSanityCheck to check ping and ping_group directives
500  + fixed cl_status nodestatus to return 0 if a node has status "ping"
501  + fixed a memory leak in the CRM's LRM interface code
502  + fixed code which deterimines which version of the CRM becomes
503    the DC when basic CIB schema versions differ.  It now prefers
504    the older version to be DC instead of the newer version.
505
506* Wed Sep 14 2005  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
507+ Version 2.0.1 -
508  + Communication Layer
509    + netstring encoding format is changed to be more efficient
510    + add compression capability for big messages
511  + Add man pages for hb_standby/hb_takeover   
512  + The assert triggered by 2.0.0 has been fixed
513  + CIB can now contain XML comments and/or be in DOS format   
514  + Includes implementation of the ISO8601 date format
515  + New CLI tools for changing cluster preferences, node attributes
516    and node standby
517  + Improved recovery and placement of group resources
518  + Detection of failed nodes by the Policy Engine is fixed
519  + New Policy Engine features
520    http://www.linux-ha.org/ClusterResourceManager/DTD1.0/Annotated :
521      sections 1.5.[8,9,10,12]
522    + Constraints and instance attributes can now be active conditionally
523    + Rules can now contain other rules
524    + Date/Time based expressions are supported
525    + Cloned resources can now optionally be notified before and after
526      any of its peers are stopped or started.
527    + The cluster can re-evaluate the configuration automatically after
528      a defined interval of idleness
529  + Removed a flow control message which was very annoying when operating
530    in a mixed 1.x/2.x environment
531  -- Known Bugs :-( --
532    - Bug 859 - FSA took too long to complete action - fully recovered from
533    - Bug 882 - IPC channel not connected during shutdown - harmless
534    - Bug 879 - Failed actions cause extra election - harmless
535 Each of these occurs about once or twice in 5000 test iterations
536       - This is probably > 10K failovers
537    - rsc_location constraints cannot have rules that contain other rules
538      (fixed in CVS after release)
539* Fri Jul 29 2005  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
540+ Version 2.0.0 - First stable release of the next generation of the Linux-HA project
541  + Basic Characteristics described here:
542        http://linux-ha.org/FactSheetv2
543  + Core infrastructure improvments:
544    + Messaging (message acknowledging and flow control)
545    + Logging   (logging daemon)
546  + Release 1.x style (2-node) clusters fully supported
547  + Multi-node support (so far up to 16-node clusters tested)
548        See http://linux-ha.org/GettingStartedV2 for more information
549  + New components:
550    + Cluster Information Base    (replicated resource configuration)
551    + Cluster Resource Manager    (supporting 1->N nodes)
552    + Modular Policy Engine       (controlling resource placement)
553    + Local Resource Manager      (policy free, not cluster aware)
554    + Stonith Daemon              (stand-alone fencing  subsytem)
555  + Support for OCF and LSB resource agents
556  + Support for composite resource types (groups, clones)
557  + Support for a rich set of resource location and ordering constraints
558  + Conversion tool for existing haresources
559  + Resources monitored by request
560  + Resource "maintenance" mode
561  + Several failback, failure and "No Quorum" behaviours to choose from
562        (global defaults and per action or resource)
563  + Sample cluster state and configuration monitoring tools
564
565  Known issues in 2.0.0:
566    - Under some rare circumstances the cluster manager will time out
567      while stabilizing a new cluster state.  This appears to be
568        otherwise harmless - the cluster is actually fine.
569        http://www.osdl.org/developer_bugzilla/show_bug.cgi?id=770
570    - Under some rare circumstances, a dev assert will be triggered
571        in unpack.c.  This results in the pengine getting restarted.
572        This is annoying, but not a disaster.
573        http://www.osdl.org/developer_bugzilla/show_bug.cgi?id=797
574
575* Tue May 23 2005  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
576+ Version 1.99.5 - Near-final beta of 2.0.0 release
577  + many bug fixes - code looks very stable at this point
578    -- well tested at this point on 4 and 8 node clusters.
579
580* Thu Apr 07 2005  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
581+ Version 1.99.4 - Near-final beta of 2.0.0 release
582  + many bug fixes since 1.99.1
583  + new external STONITH model - fully supports scripting interface
584  + tested through 12 node clusters successfully
585  + No serious defects found in testing
586  + Easier-to-understand locational constraints model
587  + Many bug fixes of many kinds
588  + Important bug fixes to OCF IPaddr resource agent
589  + Resources are monitored only on request
590  + See http://wiki.linux-ha.org/ClusterResourceManager/Setup
591    for basic ideas about getting started.
592  + Release 1 style (2-node) clusters still fully supported
593  + Release 2 style clusters support 1-N node clusters
594        (where N is probably something like 8-32)
595
596* Tue Mar 20 2005  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
597+ Version 1.99.3 - Near-final beta "technology preview" of 2.0.0 release
598  + many bug fixes since 1.99.1
599  + tested through 12 node clusters with reasonable success
600  + new STONITH API
601
602* Sun Feb 20 2005  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
603+ Version 1.99.2 - Near-final beta "technology preview" of 2.0.0 release
604  + Many many many changes.  Far too many to describe here.
605  + See http://wiki.linux-ha.org/ClusterResourceManager/Setup
606    for certain basic ideas about getting started.
607
608* Mon Oct 11 2004  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
609+ Version 1.99.1 - *early* beta series - preparing for 2.0.0
610  + Andrew provided a number of fixes to the CRM and 2.0 features
611  + Fixed a problem with retrying failed STONITH operations
612
613* Mon Oct 11 2004  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
614+ Version 1.99.0 - *early* beta series - preparing for 2.0.0
615  + All STABLE changes noted below have been ported to this branch
616  + Included in this release is a beta of the next generation of Heartbeat
617        resource manager developed by Andrew Beekhof. 
618        http://linuxha.trick.ca/NewHeartbeatDesign is a good place to learn
619        more about this effort. Please examine crm/README, crm/test/README
620        and crm/crm-1.0.dtd for example usage and configuration.
621  + Also included is the L(ocal) R(esource) M(anager) developed by IBM China
622        which is an integral part of the NewHeartbeatDesign.
623  + Known caveats:
624    - STONITH as a whole has seen a code cleanup and should be tested
625      carefully.
626    - The external STONITH plug-in has undergone major surgery and
627      probably doesn't work yet.
628    - the new CRM is not perfectly stable with 3 nodes yet.
629  + PLEASE see http://osdl.org/developer_bugzilla/enter_bug.cgi?product=Linux-HA
630    and use it to report quirks and issues you find!
631 
632* Sat Sep 18 2004  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
633+ Version 1.2.3 (stable)
634  + fixed a serious error which causes heartbeat to misbehave after about
635        10 months of continuous operation
636  + Made our ARP packets more RFC compliant
637  + Extended apcmastersnmp code to deal with new devices
638  + fixed a bug concerning simultaneous stops of both machines causing one
639        of them to not shut down.
640  + added an option to suppress reporting on packet corruption
641  + fixed it so that we don't create the FIFO by the RPM
642  + made cl_status setgid so anyone can run it, and fixed exit codes
643  + eliminated a serious memory leak associated with client code
644  + packaged doc files which had been missed before
645  + fixed many many small bugs and memory leaks detected by BEAM
646  + added several new test cases
647  + fixed longstanding bug in plugin unloading
648  + fixed a shutdown hang problem
649  + several fixes for Solaris, FreeBSD
650  + Solaris packaging now included in base
651  + fixed a bug related to the apache resource agent not handling
652        quoted parameters
653  + added use_apphbd parameter to have heartbeat register
654        with apphbd instead of watchdog device when desired
655  + changed apphbd to default its config file to /etc
656  + added snmp subagent code
657  + added hbaping communications plugin
658  + added external STONITH plugin
659  + ldirectord: fixed a bug where real servers that were are
660        present in multiple virtual services will only be added
661        to one virtual service.
662
663* Mon May 11 2004  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
664+ Version 1.2.2 (stable)
665  + Fixed several format string errors in communication plugins
666  + Fixed a bug which kept us from diagnosing errors in non-aliased interfaces
667  + Fixed a bug in ipaddr which caused an infinite loop when auto_failback on
668  + Updated Debian things...
669  + Added IPv6addr resource agent
670  + Added ibmhmc STONITH plugin
671  + Added cl_status command
672  + Fixed a bug regarding restarts when auto_failback is on...
673  + Fixed a couple of bugs in sha1 authentication method for very long keys
674  + Fixed a bug in the portblock resource agent so that it no longer blocks
675          ports on the loopback interface
676  + Increased the time allowed for split brain test before it declares failure
677
678+ Version 1.2.1 (stable)
679  + Netstrings can now be used for our on-the-wire data format
680  + Perl/SWIG bindings added for some heartbeat libraries
681  + Significant improvements to SAF data checkpointing API
682  + Implemented unbuffered ipcsocket code for SAF APIs
683  + Many Solaris fixes -- except for ipfail, Solaris works
684  + Significant library restructuring
685  + Watchdog device NOWAYOUT is now overridded if defaulted
686  + Watchdog device now kills machine instantly after deadtime
687        instead of after one minute
688  + Hostnames should now be treated case-independently...
689  + Added new client status APIs - client_status() and cstatus_callback()
690  + Fixed bug with auto_failback and quick full restarts
691  + We now automatically reboot when resources fail to stop correctly...
692  + We now check the status of the configured STONITH device hourly...
693  + STONITH operations repeat after a 5 second delay, not immediately...
694  + Added hb_takeover command - complement to hb_standby
695  + Added documentation on how to use evlog/TCP to enable testing to
696        take place without losing messages due to UDP message forwarding
697  + Several new tests from Mi, Jun - split brain, bandwidth, failure
698        detection time.
699  + Fix to LVM resource from Harald Milz <hm@muc.de>
700  + Fixed FreeBSD authentication problems breaking ipfail
701  + Fixed .so loading on Debian
702  + Fixed false complaints about resource scripts (from Jens Schmalzing)
703  + Fixed false stop failure from LinuxSCSI  (from Jens Schmalzing <j.s@lmu.de>)
704
705
706
707* Thu Apr 15 2004  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
708+ Version 1.3.0 - beta series
709  + Netstrings can now be used for our on-the-wire data format
710  + Perl/SWIG bindings added for some heartbeat libraries
711  + Significant improvements to SAF data checkpointing API
712  + Implemented unbuffered ipcsocket code for SAF APIs
713  + Many Solaris fixes -- except for ipfail, Solaris works
714  + Significant library restructuring
715  + Watchdog device NOWAYOUT is now overridded if defaulted
716  + Watchdog device now kills machine instantly after deadtime
717        instead of after one minute
718  + Hostnames should now be treated case-independently...
719  + Added new client status APIs - client_status() and cstatus_callback()
720  + Fixed bug with auto_failback and quick full restarts
721  + We now automatically reboot when resources fail to stop correctly...
722  + We now check the status of the configured STONITH device hourly...
723  + STONITH operations repeat after a 5 second delay, not immediately...
724  + Added hb_takeover command - complement to hb_standby
725  + Added documentation on how to use evlog/TCP to enable testing to
726        take place without losing messages due to UDP message forwarding
727  + Several new tests from Mi, Jun - split brain, bandwidth, failure
728        detection time.
729  + Fix to LVM resource from Harald Milz <hm@muc.de>
730
731* Tue Feb 16 2004  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
732+ Version 1.2.0
733  + Replaced the nice_failback option with the auto_failback option.
734        THIS OBSOLETES THE NICE_FAILBACK OPTION. READ THE DOCS FOR HOW
735        TO UPGRADE SMOOTHLY.
736  + Added a new feature to hb_standby which allows you to give up
737          any specific category of resources:  local, foreign, or all.
738        The old behavior is "all" which is the default.
739        This allows you to put a auto_failback no cluster into
740          an active/active configuration on demand.
741  + ipfail now works properly with auto_failback on (active/active)
742  + ipfail now has "hysteresis" so that it doesn't respond immediately
743        to a network failure, but waits a little while so that the
744        damage can be properly assessed and extraneous takeovers avoided
745  + Added new ping node timeout directive "deadping"
746  + Made sure heartbeat preallocated stack and heap, and printed a
747        message if we allocate heap once we're started up...
748  + IPMILan STONITH plugin added to CVS
749  + Added IPaddr2 resource script
750  + Made the APC smart UPS ups code compatible with more UPSes
751  + Added a (preliminary?) ordered messaging facility from Yi Zhu
752  + Changed IPaddr's method of doing ARPs in background so that
753        certain timing windows were closed.
754  + Added OCF (wrapper) resource script
755  + Allow respawn programs to take arguments
756  + Added pinggroups (where any node being up is OK)
757  + SIGNIFICANT amount of internal rearchitecture.
758  + Many bug fixes.
759  + Several documentation updates.
760
761* Tue Feb 10 2004  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
762+ Version 1.1.5
763  + ipfail now has "hysteresis" so that it doesn't respond immediately
764        to a network failure, but waits a little while so that the
765        damage can be properly assessed and extraneous takeovers avoided
766  + Several fixes to cl_poll()
767  + More fixes to the IPC code - especially handling data reception
768        after EOF
769  + removed some unclean code from GSource for treating EOF conditions
770  + Several bugs concerning hanging when shutting down early during startup
771  + A few BasicSanityCheck bug fixes
772  + CTS now allows a single machine to be able to monitor several clusters
773  + Most former CTS options are now either unneeded or on the command line
774  + Increased number of ARPs and how long they're being sent out
775  + Fixed uncommon (authorization) memory leak
776  + Some Solaris portability fixes.
777  + Made init script handle standby correctly for new config files
778  + Improved the fast failure detection test
779  + Added some backwards compatibility for nice_failback and some default
780        authentication directives
781  + Corrected the 1.1.4 change log
782 
783
784* Fri Jan 22 2004  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
785+ Version 1.1.4
786  + ipfail now works properly with auto_failback on (active/active)
787  + Changed the API to use sockets (IPC library) instead of FIFOs.
788  + Added new apiauth directives to provide authorization information
789        formerly provided by the FIFO permissions.
790  + Added Intel's implementation of the SAF data checkpointing API and daemon
791  + Added a cleanup suggested by Emily Ratliff.
792  + IPMILan STONITH plugin added to CVS
793  + Added IPaddr2 resource script
794  + Various cleanups due to horms.
795  + Fixed authentication to work on 64-bit platforms(!)
796  + Fixed the cl_poll() code to handle corner cases better
797  + Made heartbeat close watchdog device before re-execing itself
798  + New CTS improvements from Mi, Jun <jun.mi@intel.com>
799  + Various minor bug fixes.
800      . Several shutdown bugs addressed
801      . fixed sendarp to make a pid file, so we can shut it down
802          when we shut everything else down in case it's still running.
803      . Lots of minor bug fixes to IPC code
804      . Lots of minor bug fixes to ipctest program
805      . made BasicSanityCheck more tolerant of delays
806      . Fixed IPC code to authenticate based on ints, not int*s.
807      . Check properly for strnlen instead of strlen...
808      . Several signed/unsigned fixes
809      . A few uninitialized vars now are inited
810      . Switched to compiling lex/yacc sources the automake way
811      . Lots of minor CTS fixes...
812
813  + ldirectord bug fixes:
814    . When new real servers are added on initialisation or when
815        the configuration file is reread they are marked with status
816        of -1 (uninitialised) so they will be checked and inserted
817        into the virtual service as required
818    . All checks use the checkport if set, otherwise the port set for
819        the individual real server. This was the case for http and
820        connect checks, but others had variations on this theme.
821    . When the configuration file is reread because it changed
822        on disk and autoreload is set, check the real servers
823        immediately rather than waiting for checkinterval to expire
824    . Already running message sent to stderr instead of stdout
825    . Support alternate server in real-server specific URL
826    . Treat the same real server with different weights as a different
827        real server. Fixes bug reported by Philip Hayward whereby the same
828        real-server would always have the same weight, regardless of
829        the ldirectord.cf
830
831* Fri Sep 26 2003  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
832+ Version 1.1.3
833  + Bugfix for heartbeat starting resources twice concurrently if
834    auto_failback was set to "legacy".
835  + Bugfix for messages getting lost if messages were sent in quick
836    succession. (Kurosawa Takahiro)
837  + Bugfix for Filesystem resource checking for presence of filesystem
838    support before loading the module.
839  + BasicSanityCheck extended to cover more basic tests.
840  + Bugfix for findif not working correctly for CIDR netmasks.
841  + Minor bugfix for ldirectord recognizing new schedulers correctly and
842    timeout settings are now being honoured.
843  + Enhanced the message giving a better explanation of how to set up node
844    names properly when current node not found in the ha.cf file
845  + Send a message to the cluster whenever we have a node which doesn't
846    need STONITHing - even though it's gone down.  This fix needed
847    by CCM, which is in turn needed by EVMS.
848  + Enhanced the messages for missing ha.cf and missing haresources files
849    explaining that sample config files are found in the documentation.
850  + Fix for memory leak from Forrest Zhao<forrest.zhao@intel.com>
851  + Added a (preliminary?) ordered messaging facility from Yi Zhu
852  + FAQ updates
853  + Added Xinetd resource script
854  + Added OCF (wrapper) resource script
855  + Allow respawn programs to take arguments
856  + Added pinggroups (where any node being up is OK)
857  + fixed ldirectord negotiatetimeout for HTTP
858  + fixed a bug which caused -d flag to be ignored
859  + failing resource scripts are now ERRORs not WARNings
860  + now shuts down correctly when auto_failback == legacy
861
862
863* Mon Jul 13 2003  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
864+ Version 1.1.2
865  + Replaced the nice_failback option with the auto_failback option.
866        THIS OBSOLETES THE NICE_FAILBACK OPTION. READ THE DOCS FOR HOW
867        TO UPGRADE SMOOTHLY.
868  + Changed IPaddr to not do ARPs in background, and shortened time
869        between ARPs.  Also made these things tunable...
870  + changed our comm ttys to not become our controlling TTYs
871  + Enhanced the ServeRAID script to fix a critical bug by using a new feature
872  + Added a new DirectoryMap to CVS - tells where everything is...
873  + significantly enhanced the BasicSanityCheck script, and the tests
874        it calls.
875  + added a new option to use a replacement poll function for improved
876        real-time performance.
877  + added the ability to have a cluster node's name be different
878        from it's uname -n
879  + Moved where CTS gets installed to /usr/lib/heartbeat/cts
880  + Big improvements to the CTS README from IBM test labs in Austin.
881  + bug fixes to the WTI NPS power switch
882  + new client API calls:
883        return arbitrary configuration parameters
884        return current resource status
885  + Added a new clplumbing function: mssleep()
886  + added new capabilities for supporting pseudo-resources
887  + added new messages which come out after initial takeover is done
888         (improves CTS results)
889  + LOTS of documentation updates.
890  + fixed a security vulnerability
891  + fixed a bug where heartbeat would shut down while in the middle
892        of processing resource movement requests.
893  + changed compilation flags to eliminate similar future security
894        issues
895  + went to even-more-strict gcc flags
896  + fixed several "reload" bugs.  Now reload works ;-)
897  + fixed STONITH bug when other node never heard from.
898  + Minor bug fixes (cleaned up corrupted message)
899  + Two different client API bugs fixed.
900  + changed the configure script to test which warning flags are
901        supported by the current gcc.
902  + enhanced the API test program to test new capabilities...
903
904
905* Wed May 21 2003  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
906+ Version 1.1.1
907  + Significant restructuring of the processes in heartbeat
908  + Added a new feature to hb_standby which allows you to give up
909          any specific category of resources:  local, foreign, or all.
910        The old behavior is "all" which is the default.
911        This allows you to put a nice_failback cluster into
912          an active/active configuration
913  + Enhancements to the ServeRAID code to make it work with the new
914    (supported) version of IPSSEND from the ServeRAID folks...
915  + Added STONITH code for the Dell remote access controller
916  + Fixed a major bug which kept it from taking over correctly after 246
917        days or so
918  + Fixed a major bug where heartbeat didn't lock itself into memory
919        properly
920  + Added new ping node timeout directive "deadping"
921  + Made sure heartbeat preallocated stack and heap, and printed a
922        message if we allocate heap once we're started up...
923  + Minor heartbeat API bug fixes
924  + Minor documentation fixes
925  + Minor fix to allow IP addresses with /32 masks...
926  + Fixed a timing window for !nice_failback resource acquisition
927  + Added several CCM bug fixes
928  + Made the APC smart UPS ups code compatible with more UPSes
929  + Fixed a bug in respawn
930  + Enhanced internal checking for malloc errors...
931  + Added IP alias search optimization from Sean Reifscheneider
932
933* Wed Mar 19 2003  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
934+ Version 1.0.2:
935  + Fixed comment errors in heartbeat init script to allow it to run on RH 8.0
936  + Changed apphbd to use poll(2) instead of sigtimedwait(2)
937  + Put missing files into tarball
938  + Documentation improvements for IPaddr and other things
939  + Fixed an error in hb_standby which kept it from working if releasing
940    resources takes more than 10 seconds
941  + Added a fix to allow heartbeat to run on systems without writable disk
942    (like routers booting from CD-ROM)
943  + Added configuration file for apphbd
944  + Added fix from Adam Li to keep recoverymgr stop looping at high priority
945  + Added fix to ServeRAID resource to make it work with (new) supported
946    hardware
947  + Added Delay resource script
948  + Added fix to Filesystem to allow it to support NFS mounts and allow
949    user to specify mount options
950  + Added fix to IPaddr to make tmp directory for restoring loopback device
951  + Added fix to ipcsocket code to deal correctly with EAGAIN when sending
952    message body
953
954* Mon Feb 17 2003  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
955+ Version 1.0.1:
956  + Fixed some compile errors on different platforms, and library versions
957  + Disable ccm from running on 'ping' nodes
958  + Put in Steve Snodgrass' fix to send_arp to make it work on non-primary
959        interfaces.
960
961* Thu Feb 13 2003  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
962+ Version 1.0.1 beta series
963
964  0.4.9g:
965  + Changed default deadtime, warntime, and heartbeat interval
966  + Auto* tool updates
967  + VIP loopback fixes for IP address takeover
968  + Various Solaris and FreeBSD fixes
969  + added SNMP agent
970  + Several CCM bug fixes
971  + two new heartbeat API calls
972  + various documentation fixes, including documentation for ipfail
973  + Numerous minor cleanups.
974  + Fixed a few bugs in the IPC code.
975  + Fixed the (IPC) bug which caused apphbd to hang the whole machine.
976  + Added a new IPC call (waitout)
977  + Wrote a simple IPC test program.
978  + Clarified several log messages.
979  + Cleaned up the ucast communications plugin
980  + Cleaned up for new C compilers
981  + Fixed permissions bug in IPC which caused apphbd to not be usable by all
982  + Added a new rtprio option to the heartbeat config file
983  + updated apphbtest program
984  + Changed ipfail to log things at same level heartbeat does
985
986
987* Sat Nov 30 2002  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
988+ Version 0.5 beta series (now renamed to 1.0.1 beta series).
989  0.4.9f:
990  + Added pre-start, pre-stop, post-stop and pre-stop constructs in init script
991  + various IPC fixes
992  + Fix to STONITH behavior: STONITH unresponsive node right after we reboot
993  + Fixed extreme latency in IPC code
994  + various configure.in cleanups
995  + Fixed memory leak in IPC socket code
996  + Added streamlined mainloop/IPC integration code
997  + Moved more heartbeat internal communication to IPC library
998  + Added further support for ipfail
999  + Added supplementary groups to the respawn-ed clients
1000  + Added standby to init script actions
1001  + Lots of minor CCM fixes
1002  + Split (most) resource management code into a separate file.
1003  + Fixes to accommodate different versions of libraries
1004  + Heartbeat API client headers fixup
1005  + Added new API calls
1006  + Simplified (and fixed) handling of local status.  This would sometimes cause
1007        obscure failures on startup.
1008  + Added new IPsrcaddr resource script
1009
1010  KNOWN BUGS:
1011  + apphbd goes into an infinite loop on some platforms
1012
1013* Wed Oct 9 2002  Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
1014  0.4.9e:
1015  + Changed client code to keep write file descriptor open at all times
1016        (realtime improvement)
1017  + Added a "poll replacement"  function based on sigtimedwait(2), which
1018        should be faster for those cases that can use it.
1019  + Added a hb_warntime() call to the application heartbeat API.
1020  + Changed all times in the configuration file to be in milliseconds
1021        if specified with "ms" at the end.  (seconds is still the default).
1022  + Fixes to serious security issue due to Nathan Wallwork <nwallwo@pnm.com>
1023  + Changed read/write child processes to run as nobody.
1024  + Fixed a bug where ping packets are printed incorrectly when debugging.
1025  + Changed heartbeat code to preallocate a some heap space.
1026  + CCM daemon API restructuring
1027  + Added ipc_channel_pair() function to the IPC library.
1028  + Changed everything to use longclock_t instead of clock_t
1029  + Fixed a bug concerning the ifwalk() call on ping nodes in the API
1030  + Made apphbd run at high priority and locked into memory
1031  + Made a library for setting priority up.
1032  + Made ucast comm module at least be configurable and loadable.
1033  + Fixed a startup/shutdown timing problem.
1034
1035  0.4.9d:
1036  + removed an "open" call for /proc/loadavg (improve realtime behavior)
1037  + changed API code to not 1-char reads from clients
1038  + Ignored certain error conditions from API clients
1039  + fixed an obscure error message about trying to retransmit a packet
1040        which we haven't sent yet.  This happens after restarts.
1041  + made the PILS libraries available in a separate package
1042  + moved the stonith headers to stonith/... when installed
1043  + improved debugging for NV failure cases...
1044  + updated AUTHORS file and simplified the changelog authorship
1045        (look in AUTHORS for the real story)
1046  + Added Ram Pai's CCM membership code
1047  + Added the application heartbeat code
1048  + Added the Kevin Dwyer's ipfail client code to the distribution
1049  + Many fixes for various tool versions and OS combinations.
1050  + Fixed a few bugs related to clients disconnecting.
1051  + Fixed some bugs in the CTS test code.
1052  + Added BasicSanityCheck script to tell if built objects look good.
1053  + Added PATH-like capabilities to PILS
1054  + Changed STONITH to use the new plugin system.
1055  + *Significantly* improved STONITH usage message (from Lorn Kay)
1056  + Fixed some bugs related to restarting.
1057  + Made exit codes more LSB-compliant.
1058  + Fixed various things so that ping nodes don't break takeovers.
1059
1060  0.4.9c and before:
1061  + Cluster partitioning  now handled correctly (really!)
1062  + Complete rearchitecture of plugin system
1063  + Complete restructure of build system to use automake and port things
1064        to AIX, FreeBSD and solaris.
1065  + Added Lclaudio's "standby" capability to put a node into standby
1066        mode on demand.
1067  + Added code to send out gratuitous ARP requests as well as gratuitous
1068        arp replies during IP address takeover.
1069  + Suppress stonith operations for nodes which went down gracefully.
1070  + Significantly improved real-time performance
1071  + Added new unicast heartbeat type.
1072  + Added code to make serial ports flush stale data on new connections.
1073  + The Famous CLK_TCK compile time fixes (really!)
1074  + Added a document which describes the heartbeat API
1075  + Changed the code which makes FIFOs to not try and make the FIFOs for
1076        named clients, and several other minor API client changes.
1077  + Fixed a fairly rare client API bug where it would shut down the
1078        client for no apparent reason.
1079  + Added stonith plugins for: apcmaster, apcmastersnmp switches, and ssh
1080        module (for test environments only)
1081  + Integrated support for the Baytech RPC-3 switch into baytech module
1082  + Fixes to APC UPS plugin
1083  + Got rid of "control_process: NULL message" message
1084  + Got rid of the "controlfifo2msg: cannot create message" message
1085  + Added -h option to give usage message for stonith command...
1086  + Wait for successful STONITH completion, and retry if its configured.
1087  + Sped up takeover code.
1088  + Several potential timing problems eliminated.
1089  + Cleaned up the shutdown (exit) code considerably.
1090  + Detect the death of our core child processes.
1091  + Changed where usage messages go depending on exit status from usage().
1092  + Made some more functions static.
1093  + Real-time performance improvement changes
1094  + Updated the faqntips document
1095  + Added a feature to heartbeat.h so that log messages get checked as
1096        printf-style messages on GNU C compilers
1097  + Changed several log messages to have the right parameters (discovered
1098        as a result of the change above)
1099  + Numerous FreeBSD, Solaris and OpenBSD fixes.
1100  + Added backwards compatibility kludge for udp (versus bcast)
1101  + Queued messages to API clients instead of throwing them away.
1102  + Added code to send out messages when clients join, leave.
1103  + Added support for spawning and monitoring child clients.
1104  + Cleaned up error messages.
1105  + Added support for DB2, ServeRAID and WAS, LVM, and Apache (IBMhttp too),
1106    also ICP Vortex controller.
1107  + Added locking when creating new IP aliases.
1108  + Added a "unicast" media option.
1109  + Added a new SimulStart and standby test case.
1110  + Diddled init levels around...
1111  + Added an application-level heartbeat API.
1112  + Added several new "plumbing" subsystems (IPC, longclock_t, proctrack, etc.)
1113  + Added a new "contrib" directory.
1114  + Fixed serious (but trivial) bug in the process tracking code which caused
1115        it to exit heartbeat - this occured repeatably for STONITH operations.
1116  + Write a 'v' to the watchdog device to tell it not to reboot us when
1117        we close the device.
1118  + Various ldirectord fixes due to Horms
1119  + Minor patch from Lorn Kay to deal with loopback interfaces which might
1120        have been put in by LVS direct routing
1121  + Updated AUTHORS file and moved list of authors over
1122
1123* Fri Mar 16 2001  Alan Robertson <alanr@unix.sh>
1124+ Version 0.4.9
1125
1126  + Split into 3 rpms - heartbeat, heartbeat-stonith heartbeat-ldirectord
1127
1128  + Made media modules and authentication modules and stonith modules
1129        dynamically loadable.
1130
1131  + Added Multicast media support
1132  + Added ping node/membership/link type for tiebreaking.  This will
1133        be useful when implementing quorum on 2-node systems.
1134        (not yet compatible with nice_failback(?))
1135  + Removed ppp support
1136
1137  + Heartbeat client API support
1138
1139  + Added STONITH API library
1140    +   support for the Baytech RPC-3A power switch
1141    +   support for the APCsmart UPS
1142    +   support for the VACM cluster management tool
1143    +   support for WTI RPS10
1144    +   support for Night/Ware RPC100S
1145    +   support for "Meatware" (human intervention) module
1146    +   support for "null" (testing only) module
1147
1148  + Fixed startup timing bugs
1149  + Fixed shutdown sequence bugs: takeover occured before
1150        resources were released by other system
1151  + Fixed various logging bugs
1152  + Closed holes in protection against replay attacks
1153
1154  + Added checks that complain if all resources aren't idle on startup.
1155  + IP address takeover fixes
1156      + Endian fixes
1157      + Removed the 8-alias limitation
1158      + Takeovers now occur faster (ARPs occur asynchronously)
1159
1160  + Port number changes
1161    + Use our IANA port number (694) by default
1162    + Recognize our IANA port number ("ha-cluster") if it's in /etc/services
1163
1164  + Moved several files, etc. from /var/run to /var/lib/heartbeat
1165  + Incorporated new ldirectord version
1166  + Added late heartbeat warning for late-arriving heartbeats
1167  + Added detection of and partial recovery from cluster partitions
1168  + Accept multiple arguments for resource scripts
1169  + Added Raid1 and Filesystem resource scripts
1170  + Added man pages
1171  + Added debian package support
1172
1173* Fri Jun 30 2000 Alan Robertson <alanr@unix.sh>
1174+ Version 0.4.8
1175  + Incorporated ldirectord version 1.9 (fixes memory leak)
1176  + Made the order of resource takeover more rational:  Takeover is now
1177    left-to-right, and giveup is right-to-left
1178  + Changed the default port number to our official IANA port number (694)
1179  + Regularized more messages, eliminated some redundant ones.
1180  + Print the version of heartbeat when starting.
1181  + Print exhaustive version info when starting with debug on.
1182  + Hosts now have 3 statuses {down, up, active} active means that it knows
1183        that all its links are operational, and it's safe to send cluster
1184        messages
1185  + Significant revisions to nice_failback (mainly due to lclaudio)
1186  + More SuSE-compatibility. Thanks to Friedrich Lobenstock <fl@fl.priv.at>
1187  + Tidied up logging so it can be to files, to syslog or both (Horms)
1188  + Tidied up build process (Horms)
1189  + Updated ldirectord to produce and install a man page and be
1190    compatible with the fwmark options to The Linux Virtual Server (Horms)
1191  + Added log rotation for ldirectord and heartbeat using logrotate
1192    if it is installed
1193  + Added Audible Alarm resource by Kirk Lawson <lklawson@heapy.com>
1194    and myself (Horms)
1195  + Added init script for ldirectord so it can be run independently
1196    of heartbeat (Horms)
1197  + Added sample config file for ldirectord (Horms)
1198  + An empty /etc/ha.d/conf/ is now part of the rpm distribution
1199    as this is where ldirectord's configuration belongs (Horms)
1200  + Minor startup script tweaks.  Hopefully, we should be able to make core
1201    files should we crash in the future.  Thanks to Holger Kiehl for diagnosing
1202    the problem!
1203  + Fixed a bug which kept the "logfile" option from ever working.
1204  + Added a TestCluster test utility.  Pretty primitive so far...
1205  + Fixed the serial locking code so that it unlocks when it shuts down.
1206  + Lock heartbeat into memory, and raise our priority
1207  + Minor, but important fix from lclaudio to init uninited variable.
1208
1209* Sat Dec 25 1999 Alan Robertson <alanr@unix.sh>
1210+ Version 0.4.7
1211  + Added the nice_failback feature. If the cluster is running when
1212        the primary starts it acts as a secondary. (Luis Claudio Goncalves)
1213  + Put in lots of code to make lost packet retransmission happen
1214  + Stopped trying to use the /proc/ha interface
1215  + Finished the error recovery in the heartbeat protocol (and got it to work)
1216  + Added test code for the heartbeat protocol
1217  + Raised the maximum length of a node name
1218  + Added Jacob Rief's ldirectord resource type
1219  + Added Stefan Salzer's <salt@cin.de> fix for a 'grep' in IPaddr which
1220        wasn't specific enough and would sometimes get IPaddr confused on
1221        IP addresses that prefix-matched.
1222  + Added Lars Marowsky-Bree's suggestion to make the code almost completely
1223        robust with respect to jumping the clock backwards and forwards
1224  + Added code from Michael Moerz <mike@cubit.at> to keep findif from
1225        core dumping if /proc/route can't be read.
1226
1227* Mon Nov 22 1999 Alan Robertson <alanr@unix.sh>
1228+ Version 0.4.6
1229  + Fixed timing problem in "heartbeat restart" so it's reliable now
1230  + Made start/stop status compatible with SuSE expectations
1231  + Made resource status detection compatible with SuSE start/stop expectations
1232  + Fixed a bug relating to serial and ppp-udp authentication (it never worked)
1233  + added a little more substance to the error recovery for the HB protocol.
1234  + Fixed a bug for logging from shell scripts
1235  + Added a little logging for initial resource acquisition
1236  + Added #!/bin/sh to the front of shell scripts
1237  + Fixed Makefile, so that the build root wasn't compiled into pathnames
1238  + Turned on CTSRTS, enabling for flow control for serial ports.
1239  + Fixed a bug which kept it from working in non-English environments
1240
1241* Wed Oct 13 1999 Alan Robertson <alanr@unix.sh>
1242+ Version 0.4.5
1243  + Mijta Sarp added a new feature to authenticate heartbeat packets
1244        using a variety of strong authentication techniques
1245  + Changed resource acquisition and relinquishment to occur in heartbeat,
1246       instead of in the start/stop script.  This means you don't *really*
1247       have to use the start/stop script if you don't want to.
1248  + Added -k option to gracefully shut down current heartbeat instance
1249  + Added -r option to cause currently running heartbeat to reread config files
1250  + Added -s option to report on operational status of "heartbeat"
1251  + Sped up resource acquisition on master restart.
1252  + Added validation of ipresources file at startup time.
1253  + Added code to allow the IPaddr takeover script to be given the
1254        interface to take over, instead of inferring it.  This was requested
1255        by Lars Marowsky-Bree
1256  + Incorporated patch from Guenther Thomsen to implement locking for
1257        serial ports used for heartbeats
1258  + Incorporated patch from Guenther Thomsen to clean up logging.
1259        (you can now use syslog and/or file logs)
1260  + Improved FreeBSD compatibility.
1261  + Fixed a bug where the FIFO doesn't get created correctly.
1262  + Fixed a couple of uninitialized variables in heartbeat and /proc/ha code
1263  + Fixed longstanding crash bug related to getting a SIGALRM while in malloc
1264        or free.
1265  + Implemented new memory management scheme, including memory stats
1266
1267* Thu Sep 16 1999 Alan Robertson <alanr@unix.sh>
1268+ Version 0.4.4
1269  + Fixed a stupid error in handling CIDR addresses in IPaddr.
1270  + Updated the documentation with the latest from Rudy.
1271
1272* Wed Sep 15 1999 Alan Robertson <alanr@unix.sh>
1273+ Version 0.4.3
1274  + Changed startup scripts to create /dev/watchdog if needed
1275  + Turned off loading of /proc/ha module by default.
1276  + Incorporated bug fix from Thomas Hepper <th@ant.han.de> to IPaddr for
1277        PPP configurations
1278  + Put in a fix from Gregor Howey <ghowey@bremer-nachrichten.de>
1279        where Gregor found that I had stripped off the ::resourceid part
1280        of the string in ResourceManager resulting in some bad calls later on.
1281  +  Made it compliant with the FHS (filesystem hierarchy standard)
1282  +  Fixed IP address takeover so we can take over on non-eth0 interface
1283  +  Fixed IP takeover code so we can specify netmasks and broadcast addrs,
1284        or default them at the user's option.
1285  +  Added code to report on message buffer usage on SIGUSR[12]
1286  +  Made SIGUSR1 increment debug level, and SIGUSR2 decrement it.
1287  +  Incorporated Rudy's latest "Getting Started" document
1288  +  Made it largely Debian-compliant.  Thanks to Guenther Thomsen, Thomas
1289        Hepper, I単aki Fern叩ndez Villanueva and others.
1290  +  Made changes to work better with Red Hat 6.1, and SMP code.
1291  +  Sometimes it seems that the Master Control Process dies :-(
1292
1293* Sat Aug 14 1999 Alan Robertson <alanr@unix.sh>
1294+ Version 0.4.2
1295  + Implemented simple resource groups
1296  + Implemented application notification for groups starting/stopping
1297  + Eliminated restriction on floating IPs only being associated with eth0
1298  + Added a uniform resource model, with IP resources being only one kind.
1299        (Thanks to Lars Marowsky-Bree for a good suggestion)
1300  + Largely rewrote the IP address takeover code, making it clearer, fit
1301        into the uniform resource model, and removing some restrictions.
1302  + Preliminary "Getting Started" document by Rudy Pawul
1303  + Improved the /proc/ha code
1304  + Fixed memory leak associated with serial ports, and problem with return
1305        of control to the "master" node.
1306        (Thanks to Holger Kiehl for reporting them, and testing fixes!)
1307
1308* Tue Jul 6 1999 Alan Robertson <alanr@unix.sh>
1309+ Version 0.4.1
1310  + Fixed major memory leak in 0.4.0 (oops!)
1311  + Added code to eliminate duplicate packets and log lost ones
1312  + Tightened up PPP/UDP startup/shutdown code
1313  + Made PPP/UDP peacefully coexist with "normal" udp
1314  + Made logs more uniform and neater
1315  + Fixed several other minor bugs
1316  + Added very preliminary kernel code for monitoring and controlling
1317        heartbeat via /proc/ha.  Very cool, but not really done yet.
1318
1319* Wed Jun 30 1999 Alan Robertson <alanr@unix.sh>
1320+ Version 0.4.0
1321  + Changed packet format from single line positional parameter style
1322        to a collection of {name,value} pairs.  A vital change for the future.
1323  + Fixed some bugs with regard to forwarding data around rings
1324  + We now modify /etc/ppp/ip-up.local, so PPP-udp works out of the box
1325        (at least for Red Hat)
1326  + Includes the first version of Volker Wiegand's Hardware Installation Guide
1327        (it's pretty good for a first version!)
1328
1329* Wed Jun 09 1999 Alan Robertson <alanr@unix.sh>
1330+ Version 0.3.2
1331  + Added UDP/PPP bidirectional serial ring heartbeat
1332        (PPP ensures data integrity on the serial links)
1333  + fixed a stupid bug which caused shutdown to give unpredictable
1334        results
1335  + added timestamps to /var/log/ha-log messages
1336  + fixed a couple of other minor oversights.
1337
1338* Sun May 10 1999  Alan Robertson <alanr@unix.sh>
1339+ Version 0.3.1
1340  + Make ChangeLog file from RPM specfile
1341  + Made ipresources only install in the DOC directory as a sample
1342
1343* Sun May 09 1999 Alan Robertson <alanr@unix.sh>
1344+ Version 0.3.0
1345  + Added UDP broadcast heartbeat (courtesy of Tom Vogt)
1346  + Significantly restructured code making it easier to add heartbeat media
1347  + added new directives to config file:
1348    + udp interface-name
1349    + udpport port-number
1350    + baud    serial-baud-rate
1351  + made manual daemon shutdown easier (only need to kill one)
1352  + moved the sample ha.cf file to the Doc directory
1353
1354* Sat Mar 27 1999 Alan Robertson <alanr@unix.sh>
1355+ Version 0.2.0
1356  + Make an RPM out of it
1357  + Integrated IP address takeover gotten from Horms
1358  + Added support to tickle a watchdog timer whenever our heart beats
1359  + Integrated enough basic code to allow a 2-node demo to occur
1360  + Integrated patches from Andrew Hildebrand <andrew@pdi.com> to allow it
1361    to run under IRIX.
1362  - Known Bugs
1363    - Only supports 2-node clusters
1364    - Only supports a single IP interface per node in the cluster
1365    - Doesn't yet include Tom Vogt's ethernet heartbeat code
1366    - No documentation
1367    - Not very useful yet :-)
1368
1369###########################################################
Note: See TracBrowser for help on using the repository browser.