Skip to content

Commit

Permalink
todo fixed some bugs
Browse files Browse the repository at this point in the history
  • Loading branch information
liewegas committed Mar 22, 2010
1 parent a01233e commit 061ed42
Showing 1 changed file with 0 additions and 67 deletions.
67 changes: 0 additions & 67 deletions src/TODO
Original file line number Diff line number Diff line change
Expand Up @@ -60,24 +60,6 @@ filestore
- need an osdmap cache layer?

bugs
- kclient: looping osd connection failures
[ 3974.417106] ceph: osd11 10.3.14.138:6800 connection failed
[ 3974.423295] ceph: osd11 10.3.14.138:6800 connection failed
[ 3974.429709] ceph: osd11 10.3.14.138:6800 connection failed
[ 3974.437863] ceph: osd11 10.3.14.138:6800 connection failed
[ 3974.451780] ceph: osd11 10.3.14.138:6800 connection failed
[ 3974.472879] ceph: osd11 10.3.14.138:6800 connection failed
[ 3974.479061] ceph: osd11 10.3.14.138:6800 connection failed
[ 3974.485138] ceph: osd11 10.3.14.138:6800 connection failed
[ 3974.491235] ceph: osd11 10.3.14.138:6800 connection failed
[ 3974.499103] ceph: osd11 10.3.14.138:6800 connection failed
[ 3974.508805] ceph: osd11 10.3.14.138:6800 connection failed
[ 3974.517429] ceph: osd11 10.3.14.138:6800 connection failed
[ 3974.585106] ceph: osd11 10.3.14.138:6800 connection failed
... crash some osds, then restart them ...?

- kclient: mds reconnect on after mds kicks us off

- be lenient about timing out clients if we are laggy ourselves
- mds prepare_force_open_sessions, then import aborts.. session is still OPENING but no client_session is sent...
- rm -r failure (on kernel tree)
Expand Down Expand Up @@ -163,55 +145,6 @@ bugs
- osd pg split breaks if not all osds are up...
- mislinked directory? (cpusr.sh, mv /c/* /c/t, more cpusr, ls /c/t)

- snaprealm thing
ceph3:~# find /c
/c
/c/.ceph
/c/.ceph/mds0
/c/.ceph/mds0/journal
/c/.ceph/mds0/stray
[68663.397407] ceph: ceph_add_cap: couldn't find snap realm 10000491bb5
...
ceph3:/c# [68724.067160] BUG: unable to handle kernel NULL pointer dereference at 0000000000000088
[68724.071069] IP: [<ffffffffa00805c3>] __send_cap+0x237/0x585 [ceph]
[68724.078917] PGD f7a12067 PUD f688c067 PMD 0
[68724.082907] Oops: 0000 [#1] PREEMPT SMP
[68724.082907] last sysfs file: /sys/class/net/lo/operstate
[68724.082907] CPU 1
[68724.082907] Modules linked in: ceph fan ac battery psmouse ehci_hcd ohci_hcd ide_pci_generic thermal processor button
[68724.082907] Pid: 10, comm: events/1 Not tainted 2.6.32-rc2 #1 H8SSL
[68724.082907] RIP: 0010:[<ffffffffa00805c3>] [<ffffffffa00805c3>] __send_cap+0x237/0x585 [ceph]
[68724.114907] RSP: 0018:ffff8800f96e3a50 EFLAGS: 00010202
[68724.114907] RAX: 0000000000000000 RBX: 0000000000000354 RCX: 0000000000000000
[68724.114907] RDX: 0000000000000000 RSI: ffff8800f76e8ba8 RDI: ffff8800f581a508
[68724.114907] RBP: ffff8800f96e3bb0 R08: 0000000000000000 R09: 0000000000000001
[68724.114907] R10: ffff8800cea922b8 R11: ffffffffa0082982 R12: 0000000000000001
[68724.114907] R13: 0000000000000000 R14: ffff8800cea95378 R15: 0000000000000000
[68724.114907] FS: 00007f54be9a06e0(0000) GS:ffff880009200000(0000) knlGS:0000000000000000
[68724.114907] CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
[68724.114907] CR2: 0000000000000088 CR3: 00000000f7118000 CR4: 00000000000006e0
[68724.178904] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[68724.178904] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[68724.178904] Process events/1 (pid: 10, threadinfo ffff8800f96e2000, task ffff8800f96e02c0)
[68724.178904] Stack:
[68724.178904] ffff8800f96e0980 ffff8800f96e02c0 ffff8800f96e3a80 ffffffff8106a3b9
[68724.178904] <0> ffff8800f96e3a80 0000000000000003 00006589ac4ca260 0000000000000004
[68724.178904] <0> 0cb13589944c0262 0000000000000000 ffff8800f96e3b30 ffffffff81ca7c80
[68724.178904] Call Trace:
[68724.178904] [<ffffffff8106a3b9>] ? get_lock_stats+0x19/0x4c
[68724.178904] [<ffffffff8106d8c8>] ? mark_held_locks+0x4d/0x6b
[68724.178904] [<ffffffffa0082a25>] ceph_check_caps+0x740/0xa70 [ceph]
[68724.178904] [<ffffffff8106a3b9>] ? get_lock_stats+0x19/0x4c
[68724.178904] [<ffffffff8106a964>] ? put_lock_stats+0xe/0x27
[68724.178904] [<ffffffffa00840b6>] ceph_check_delayed_caps+0xcb/0x14a [ceph]
[68724.178904] [<ffffffffa009011f>] delayed_work+0x3f/0x368 [ceph]
[68724.178904] [<ffffffff8105b194>] ? worker_thread+0x229/0x398
[68724.178904] [<ffffffff8105b1ee>] worker_thread+0x283/0x398
[68724.178904] [<ffffffff8105b194>] ? worker_thread+0x229/0x398
[68724.178904] [<ffffffffa00900e0>] ? delayed_work+0x0/0x368 [ceph]
[68724.178904] [<ffffffff8146a56e>] ? preempt_schedule+0x3e/0x4b
[68724.306901] [<ffffffff8105f4d0>] ? autoremove_ceph3:/c# [68724.067160]


filestore performance notes
- write ordering options
Expand Down

0 comments on commit 061ed42

Please sign in to comment.