I am having periodic ntp synchronisation problems.
ntp doesn't directly log anything - but I am using nagios3 to track
it's synchronisation and I periodically get problems - several times a
day. I can't figure out what is causing the loss of synchronisation.
Perhaps a burst of ntp packets dropped? But wouldn't ntp log something
about this?
Any suggestions on logging more info to find the cause or deeper
insights into what is going wrong would be appreciated.
Andrew
Here's the output of commands when things are bad:
config(0)# check_ntp_peer -H 127.0.0.1 -w 1.0 -c 2.0
NTP WARNING: Server has the LI_ALARM bit set, Offset 0.210925
secs|offset=0.210925s;1.000000;2.000000;
LI_ALARM apparently means not in sync ???
config(1)# ntpq -c rl
associd=0 status=c618 leap_alarm, sync_ntp, 1 event, no_sys_peer,
version="ntpd 4.2.6p2(a)1.2194-o Sun Oct 17 13:35:13 UTC 2010 (1)",
processor="x86_64", system="Linux/2.6.32-5-amd64", leap=11, stratum=3,
precision=-23, rootdelay=95.696, rootdisp=263.117, refid=192.189.54.33,
reftime=d2bccf2f.4854b34f Sun, Jan 15 2012 15:06:07.282,
clock=d2bcd1d6.e9ffea4c Sun, Jan 15 2012 15:17:26.914, peer=16519,
tc=10, mintc=3, offset=0.000, frequency=500.000, sys_jitter=35.804,
clk_jitter=0.000, clk_wander=91.828
It thinks it's in error by 16s???
config(0)# ntpdc -c kerninfo
pll offset: 0 s
pll frequency: 500.000 ppm
maximum error: 16 s
estimated error: 16 s
status: 4041 pll unsync mode=fll
pll time constant: 10
precision: 1e-06 s
frequency tolerance: 500 ppm
ntptime gives the same info
config(0)# ntptime
ntp_gettime() returns code 5 (ERROR)
time d2bcd2be.08d3a000 Sun, Jan 15 2012 15:21:18.034, (.034479),
maximum error 16000000 us, estimated error 16000000 us
ntp_adjtime() returns code 5 (ERROR)
modes 0x0 (),
offset 0.000 us, frequency 500.000 ppm, interval 1 s,
maximum error 16000000 us, estimated error 16000000 us,
status 0x4041 (PLL,UNSYNC,MODE),
time constant 10, precision 1.000 us, tolerance 500 ppm,
Then mysertiously everything is okay:
config(0)# ntpdc -c kerninfo
pll offset: 0.00998 s
pll frequency: 500.000 ppm
maximum error: 1.6291 s
estimated error: 0.004771 s
status: 0001 pll
pll time constant: 10
precision: 1e-06 s
frequency tolerance: 500 ppm
My leap becomes none (no leap_alarm) and things are ok?
config(0)# ntpq -c rl
associd=0 status=0618 leap_none, sync_ntp, 1 event, no_sys_peer,
version="ntpd 4.2.6p2(a)1.2194-o Sun Oct 17 13:35:13 UTC 2010 (1)",
processor="x86_64", system="Linux/2.6.32-5-amd64", leap=00, stratum=3,
precision=-23, rootdelay=95.272, rootdisp=983.007, refid=192.189.54.33,
reftime=d2bcd33b.bbc10580 Sun, Jan 15 2012 15:23:23.733,
clock=d2bcd852.ec6be1b9 Sun, Jan 15 2012 15:45:06.923, peer=16519,
tc=10, mintc=3, offset=13.497, frequency=500.000, sys_jitter=7.251,
clk_jitter=4.772, clk_wander=151.809
I want to play videos in the center of my screen. I don't want to run them
full-screen because for some videos that takes too much CPU time and gets
video and audio out of sync and for some other videos the resolution isn't
high enough (anything scaled up by more than a factor of 2 looks bad).
There are mplayer options for putting the window at a specific position, this
would be fine if I had many videos with the same resolution. But I want to
play arbitrary resolution videos in the center of the screen.
Any ideas on how to do this?
--
My Main Blog http://etbe.coker.com.au/
My Documents Blog http://doc.coker.com.au/
My desktop runs a bastardised Fedora 14 with gnome 2.32
There have been a few software updates on my box of recent time and
somewhere along the way the linking of keys to actions have broken. I
have two specific cases to report.
In libreoffice the Help/Libreoffice Help menu item, and its F1 hot key,
used to pass an html link to the default browser, but now it is passed
to gedit.
The second instance comes from the gnome terminal emulator, where it
underlines any text under the mouse pointer that looked like a URL, and
a Ctrl-LMB would pass said text to the browser. Once again it is now
passed to gedit.
Clearly one of the updates has changed a configuration, but grepping my
home directory for the occurrence of "gedit" throws up a blank, so it
must be deeper than the mere user.
Looking in /etc/ didn't give any joy either.
I find that the search terms I tried in google had too many false
matches to be of help.
Does anyone have some hints on where to look?
On Sat, 25 Feb 2012, Daniel Pittman <daniel(a)rimspace.net> wrote:
> Adjust the behaviour settings in drbd.conf around split brain; they
> have a bunch of configuration choices. See the "handlers" in the
> manual for the situations and responses.
What does drbd consider to be a split brain situation?
root@nodeb# iptables -A OUTPUT -d $NODEA -j DROP
I've setup a couple of nodes running under Xen. I ran the above command and
since then I've had write commands on the ext4 filesystem mounted on nodea
block, and I'm seeing lots of messages like the following in the kernel
message log on nodea:
[ 1831.024174] block drbd0: [drbd0_worker/844] sock_sendmsg time expired, ko =
4294967162
After 960 seconds (and some kernel panics from the ext4 code) it restarted
itself. It seems that netfilter isn't catching all kernel generated packets
because it managed to synchronise again.
--
My Main Blog http://etbe.coker.com.au/
My Documents Blog http://doc.coker.com.au/
I have a new laptop for school, the HP ProBook S-series that I want to
install one of the rolling release distros on (Debian/Arch/Gentoo).
I think my laptop is the same make and model as the guy who came to the
February Beginner's Workshop and was trying to install Linux Mint. I
recognise some of the same partitions on my hdd that he had: Windows/C: is
the main one with 383GB free of 442GB, HP_RECOVERY with 2.76GB free of
18.1GB and HP_TOOLS with 2.12GB free of 4.98 GB. I have not installed the
other HP partition that lets you log in before the OS is loaded. I thought
I had the option of installing Windows (ie they give me the installation
discs and a blank hdd) but no, I was taken to the Windows installation
immediately after the laptop was turned on, could not get out of it as
there was no way to enter the BIOS and I received a call from Microsoft
about 30 seconds later to help me activate my license. They were waiting on
the line for me to complete the installation before they could help me so
what was I supposed to say?
My question to you is did anyone keep in contact with the guy who was
trying to install Mint and do you know if he had any problems after
deleting the HP_TOOLS partition? There is HP software within Windows and
I'm wondering if it will detect that the partition is missing.
Also can you recommend a good walkthrough for setting up a dual-boot
system? I'm pretty good with Linux, it has been my sole OS for a couple
years now, but I've never set up a dual-boot system before. I can't format
the whole hdd because I need Windows for school and don't have the discs. I
have preferred to set up a dual boot system by installing Linux -> creating
extra partitions -> installing GRUB -> installing Windows but that's not an
option.
Regards,
Andrew
On Sat, 25 Feb 2012, Daniel Pittman <daniel(a)rimspace.net> wrote:
> The issue with doing this is that you will have two nodes writing
> actively when you have a network problem, so you will be unable to
> sync data back together correctly. The same issue, in fact, that any
> two node cluster has in retaining function during a network split.
You will only have two nodes writing at once if the secondary is made primary
while the link is down. If the process of resource failover is managed by a
cluster manager which does something smarter than just looking at a single
network interface (*) then it may be able to deal with this when DRBD can't.
Also there's the case where you have a manual failover in which case the
person doing that can determine when there is either no data loss or
acceptable data loss.
The situation I'm trying to deal with is where there is an outage which isn't
even long enough to raise a NAGIOS alert. It would be nice if DRBD could just
keep working in that situation. In this case the risk of data loss is
mitigated by the fact that any network problem which can prevent the DRBD code
from communicating would also prevent writes as the daemons which write to
DRBD filesystems communicate via the same network.
> This is why two nodes and HA don't really go together in most cases:
> you can't handle a whole bunch of problems in that case. Though, you
> might find that turning off the DRBD handling and using pacemaker to
> manage connectivity over some alternate media helps improve general
> reliability.
Thanks, I'll investigate that.
(*) Does any cluster manager do that? It is theoretically possible to use
Ethernet bonding to make a single device out of multiple ethernet
ports/switches. But in practice there are many situations where that isn't
possible, among other things last time I tested it (years ago) I had some
problems with certain ethernet cards and it seemed to rely on a working
router.
--
My Main Blog http://etbe.coker.com.au/
My Documents Blog http://doc.coker.com.au/
It seems that the default behavior of drbd is to reboot the primary node of a
cluster if it gets a split-brain. For a 3 node cluster this might make sense.
For a 2 node cluster it means that if the secondary fails then the primary
goes down too.
How do I configure drbd to not do this and are there any issues with doing so?
--
My Main Blog http://etbe.coker.com.au/
My Documents Blog http://doc.coker.com.au/
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Hi Graeme,
I recall seeing AU/Vic appear in the list when it was listed by state
every time I tweaked the timezone, but I have been running on
sid/unstable since Debian 3.0, with the timezone only being dealt with
during a fresh install (and VERY occasionally when 'apt-get upgrade' or
'apt-get dist-upgrade' required).
Sid have them listed by city with Melbourne being in there. :)
Cheers,
Tim Lyth
On 23/02/2012 10:09 AM, Graeme Cross wrote:
> Hi all.
>
> Has anyone done a recent install from a Debian testing/wheezy ISO
> and found that Victoria has disappeared?
>
> I am in the process of doing one at the moment, using the text
> installer, and Victoria is not listed as a timezone option (see
> the attached screenshot).
>
> I selected "Australia" as the country earlier in the install, and
> there is no scrolling option for this dialog box (ie. Victoria is
> not hiding at the end of a scrolling list of states &
> territories).
>
> This install is with the most recent build of the testing
> netinst.iso (from 22 Feb,
> http://www.debian.org/devel/debian-installer/).
>
> Has anyone else observed this or have I screwed up the install
> process somehow? I thought I would ask here before reporting it as
> a bug.
>
> Thanks Graeme
>
>
> _______________________________________________ luv-main mailing
> list luv-main(a)luv.asn.au http://lists.luv.asn.au/listinfo/luv-main
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.17 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
iQEcBAEBAgAGBQJPRf2yAAoJEENQGX7xOJnV9PUIALQU4nyy6w5PrI5+JZ/ZvjXi
hlNzs36IhDhQGBXSLnOAEQjlznTFVWY8gNgUrYbiajRcTwjx51TA+kt/KSz4ZPOM
mp4HI9G2IgKVQhLLfI/O0ynCoD8evInOftiHP/7p48dwQJ7dLDRlJ4TLbHGj+mbL
cfTtbcEPOV3oRZJ9J16OcXynH8FXkLLqPxlqdW/ACzINsaiV2ien02j2B/zs05sK
0ZAg4L0f5WsmgE5/x1i8W8KE/GfMHqoBajjXHbhuH7c4JH0EIVPMWlhhO/kdLZYa
OUEvfNdnrM5lhnC+VjDZo5HVsjPZ1xuV1s567IlvBLw6uvYl6vghaJ4EU5mhTM0=
=oWzj
-----END PGP SIGNATURE-----