]>
mj.ucw.cz Git - libucw.git/log
Martin Mares [Sat, 11 Dec 2004 12:03:38 +0000 (12:03 +0000)]
SHERLOCK_HAVE_PREAD -> HAVE_PREAD.
Martin Mares [Sat, 11 Dec 2004 12:01:05 +0000 (12:01 +0000)]
CONFIG_LFS has been removed, CONFIG_LARGE_DB got replaced by CONFIG_LARGE_FILES.
Martin Mares [Sat, 11 Dec 2004 11:54:10 +0000 (11:54 +0000)]
Moved oid_t to sherlock.h.
Martin Mares [Sat, 11 Dec 2004 11:51:35 +0000 (11:51 +0000)]
Moved the lizard-fb module to Sherlock library, because it's heavily
dependent on the bucket types. The rest of LiZaRd should better remain
in libucw, it's very useful outside Sherlock.
Martin Mares [Sat, 11 Dec 2004 11:44:19 +0000 (11:44 +0000)]
Configure libucw according to Sherlock config.
Martin Mares [Sat, 11 Dec 2004 11:41:06 +0000 (11:41 +0000)]
Moved CGI.pm and Query.pm to Sherlock library.
Martin Mares [Sat, 11 Dec 2004 11:30:41 +0000 (11:30 +0000)]
Split library functions to libsh and libucw.
Martin Mares [Sat, 11 Dec 2004 10:48:44 +0000 (10:48 +0000)]
Introduced `sherlock/config.mk' which contains non-customizable parts
of the configuration. Moved version and config file name there.
Martin Mares [Sat, 11 Dec 2004 10:47:49 +0000 (10:47 +0000)]
Made DEFAULT_CONFIG optional.
Tomas Holusa [Wed, 8 Dec 2004 10:44:14 +0000 (10:44 +0000)]
fb-charconv now not closing wrapped stream, only flushing it
updated all places, when used this wrapper and close wrapped stream manually
Martin Mares [Fri, 3 Dec 2004 21:07:35 +0000 (21:07 +0000)]
A minor bug fix to the installer (when installing `dict/cs', `dict'
needn't exist).
Martin Mares [Fri, 3 Dec 2004 20:52:22 +0000 (20:52 +0000)]
Added the `extras' target (initially empty) and INSTALL_RUNDIRS.
`lib' and `bin' are now handled as INSTALL_RUNDIRS.
Martin Mares [Sat, 20 Nov 2004 15:14:39 +0000 (15:14 +0000)]
Partial inlining of bskip().
Martin Mares [Sat, 20 Nov 2004 12:55:12 +0000 (12:55 +0000)]
Introduced COMPARE_LT and COMPARE_GT for comparison functions for arraysort.
Martin Mares [Sat, 20 Nov 2004 11:38:06 +0000 (11:38 +0000)]
Increase CARD_POS_SHIFT on Centrum.
Martin Mares [Fri, 19 Nov 2004 22:30:12 +0000 (22:30 +0000)]
If "Run" is omitted from a test, keep the value from the previous test.
Robert Spalek [Mon, 15 Nov 2004 16:38:34 +0000 (16:38 +0000)]
enlarge MAX_ATTR_SIZE
Martin Mares [Sun, 14 Nov 2004 12:12:29 +0000 (12:12 +0000)]
Move all explicit extern declarations of logf() [which is apparently missing
from system includes while present in the library; does anybody know, why?]
to lib/math.h.
Martin Mares [Sun, 14 Nov 2004 11:57:25 +0000 (11:57 +0000)]
Updated test rules.
Martin Mares [Sun, 14 Nov 2004 11:57:07 +0000 (11:57 +0000)]
Added tests for the hash table module.
Martin Mares [Sun, 14 Nov 2004 11:56:42 +0000 (11:56 +0000)]
Added support for dynamically allocated hash tables. Closes Bug #250.
Martin Mares [Sun, 14 Nov 2004 11:54:27 +0000 (11:54 +0000)]
Finally I decided to change the syntax of INDEX_VERSION to better distinguish
between different customizations.
Martin Mares [Sun, 14 Nov 2004 11:45:15 +0000 (11:45 +0000)]
Corrected INDEX_VERSION, documented the rules and added CUSTOM_INDEX_VERSION.
Robert Spalek [Tue, 9 Nov 2004 17:48:52 +0000 (17:48 +0000)]
if SORT_PRESORT and SORT_UP_TO are both on, compute the initial number of
runs properly so that we do not run a superfluous pass of sorting
Robert Spalek [Tue, 9 Nov 2004 15:46:39 +0000 (15:46 +0000)]
just a bit more funny initialization for the SORT_UP_TO loop
Robert Spalek [Tue, 9 Nov 2004 15:39:39 +0000 (15:39 +0000)]
actually, the number of runs is halved during each pass, so take it into
account in the stopping condition
Martin Mares [Tue, 9 Nov 2004 14:09:00 +0000 (14:09 +0000)]
Fixed a bug in parsing of replies -- the parse tree of each card was trimmed
at the first ")". Affected only the free front-end.
Robert Spalek [Tue, 9 Nov 2004 13:16:20 +0000 (13:16 +0000)]
bfilesize() returns sh_off_t
Robert Spalek [Mon, 8 Nov 2004 17:36:48 +0000 (17:36 +0000)]
SORT_ONLY_PRESORT feature improved into SORT_UP_TO (stop when average run
length reaches a given limit), which is independent on presorting
Tomas Holusa [Fri, 5 Nov 2004 15:07:26 +0000 (15:07 +0000)]
Split type_flags to 4/4 bits for FILETYPE and LANG
Martin Mares [Thu, 4 Nov 2004 22:22:59 +0000 (22:22 +0000)]
Remember that DEBUG has been renamed.
Martin Mares [Thu, 4 Nov 2004 22:22:21 +0000 (22:22 +0000)]
Added "pre-sort only" mode as Robert suggested.
Martin Mares [Fri, 29 Oct 2004 15:10:21 +0000 (15:10 +0000)]
One more round of changes to the assert code -- I didn't realize that CONFIG_DEBUG
is already used to control compilation of debugging utilities. Introduced
DEBUG_ASSERTS as a separate switch and as a bonus, you can use assertions
selectively (previously, if you have built the library without assertions,
ASSERT couldn't work anywhere, because assert_failed() was incompatible).
Martin Mares [Fri, 29 Oct 2004 08:56:06 +0000 (08:56 +0000)]
Always define the memory allocation primitives with the `sh_' prefix,
so that the basic names (xmalloc etc.) can be modified when needed.
Martin Mares [Fri, 29 Oct 2004 08:41:04 +0000 (08:41 +0000)]
Changed DEBUG to CONFIG_DEBUG to avoid conflicts with various libraries.
(BTW we were probably slowing down the regex code by defining DEBUG.)
Martin Mares [Mon, 25 Oct 2004 14:32:34 +0000 (14:32 +0000)]
Tagged current sources as rel-3-3 (bug fixes only)
and moved mainline to v3.4.
Martin Mares [Mon, 4 Oct 2004 16:45:56 +0000 (16:45 +0000)]
Double oops.
Martin Mares [Mon, 4 Oct 2004 16:45:24 +0000 (16:45 +0000)]
Oops.
Martin Mares [Mon, 4 Oct 2004 16:43:14 +0000 (16:43 +0000)]
Added sh_file_size() (stat() isn't reliable on large files on some systems).
Martin Mares [Sat, 2 Oct 2004 10:52:59 +0000 (10:52 +0000)]
Introduced relative counterparts of url_canon_split() and url_auto_canonicalize().
Also added parameter names to all prototypes.
Martin Mares [Mon, 27 Sep 2004 19:49:00 +0000 (19:49 +0000)]
Let bskip() return success.
Martin Mares [Sun, 26 Sep 2004 16:09:18 +0000 (16:09 +0000)]
obuck_slurp_pool() is now able to skip unwanted buckets. Some day we can
improve it by sizing the buffer adaptively, but not now.
Martin Mares [Mon, 20 Sep 2004 10:03:23 +0000 (10:03 +0000)]
A couple of minor improvements to the generic hash tables:
o Comment on HASH_GIVE_ALLOC corrected, the automatic choice between
normal and pooled allocation was never implemented, because we don't
have enough information to decide the right pool size. HASH_AUTO_POOL
has been implemented instead.
o Pooled allocation no longer conflicts with remove/delete, it just
produces memory leaks within the pool, which are usually harmless.
o Added HASH_TABLE_ALLOC switch.
o Internal decisions to enable HASH_CONSERVE_SPACE were made too late
and didn't influence hash node layout. Fixed.
Martin Mares [Thu, 9 Sep 2004 11:52:09 +0000 (11:52 +0000)]
Thoroughly tested and fixed a couple of bugs.
Martin Mares [Thu, 9 Sep 2004 10:47:05 +0000 (10:47 +0000)]
Fixed a couple of bugs.
Martin Mares [Thu, 9 Sep 2004 10:46:12 +0000 (10:46 +0000)]
No exceptions for url-equiv.
Robert Spalek [Thu, 26 Aug 2004 12:52:15 +0000 (12:52 +0000)]
tested MJ's patches on charconv. they successfully converted everything
I tried, including the tests that failed before
Robert Spalek [Sat, 21 Aug 2004 16:30:02 +0000 (16:30 +0000)]
- bugfix: when disabling compression, change bucket_type
- lizard_bwrite() returns the final bucket_type instead of the number of
written bytes
Robert Spalek [Sat, 21 Aug 2004 16:29:06 +0000 (16:29 +0000)]
added sepsplit() in wordsplit.c
Martin Mares [Fri, 20 Aug 2004 19:38:31 +0000 (19:38 +0000)]
Added initial support for areas to the indexer. Merging not restricted yet,
but propagation of area ID's works.
Martin Mares [Fri, 20 Aug 2004 19:05:14 +0000 (19:05 +0000)]
Put debug/custom last, so that one more manual recompile won't be needed after
check-customs when debugging.
Martin Mares [Fri, 20 Aug 2004 19:04:17 +0000 (19:04 +0000)]
Check more configurations.
Robert Spalek [Fri, 20 Aug 2004 13:17:49 +0000 (13:17 +0000)]
if b?get_attr() returns 0, it clears attr->attr too
Robert Spalek [Fri, 20 Aug 2004 12:38:28 +0000 (12:38 +0000)]
a really essential bugfix
Martin Mares [Fri, 20 Aug 2004 11:44:08 +0000 (11:44 +0000)]
Use GET_UTF8 instead of GET_UTF8_CHAR (this costs us an extra unnecessary
test for character < 0x80, but it is worth the simplification).
Martin Mares [Fri, 20 Aug 2004 11:43:10 +0000 (11:43 +0000)]
Added unit tests for UTF8 macros.
Martin Mares [Fri, 20 Aug 2004 11:42:43 +0000 (11:42 +0000)]
Optimized and simplified UTF8 macros. In fact, turned most of them
to inline functions.
This resulted in a couple of const/non-const problems, hence the casts
in GET_UTF8{,_32}.
Robert Spalek [Fri, 20 Aug 2004 09:41:00 +0000 (09:41 +0000)]
renamed attr_set_type() to put_attr_set_type()
Robert Spalek [Fri, 20 Aug 2004 09:39:00 +0000 (09:39 +0000)]
- added b?get_attr()
- added {get,put}_attr_set_type()
Robert Spalek [Fri, 20 Aug 2004 09:36:53 +0000 (09:36 +0000)]
- added get_attr() and bget_attr() with get_attr_set_type()
- use get_utf8_32()
Robert Spalek [Fri, 20 Aug 2004 09:35:55 +0000 (09:35 +0000)]
- use put_utf8_32()
- attr_set_type() renamed to put_attr_set_type()
Robert Spalek [Fri, 20 Aug 2004 09:34:02 +0000 (09:34 +0000)]
- added {get,put}_utf8_32() for all full 6-byte codes
- fixed UTF8_SKIP_BWD(); it is never used
- upgraded utf8_space() and utf8_encoding_len()
Robert Spalek [Thu, 19 Aug 2004 18:32:24 +0000 (18:32 +0000)]
- do not try to compress when liz_min_compr == 0
- renamed function
- lizard_*read() tests EOF and invalid bucket_type
Robert Spalek [Thu, 19 Aug 2004 18:30:52 +0000 (18:30 +0000)]
typo
Robert Spalek [Thu, 19 Aug 2004 18:30:43 +0000 (18:30 +0000)]
renamed function
Martin Mares [Thu, 19 Aug 2004 13:47:23 +0000 (13:47 +0000)]
A better implementation of vbprintf() which avoids copying if there is enough
room in the fastbuf's buffer.
Martin Mares [Thu, 19 Aug 2004 13:37:08 +0000 (13:37 +0000)]
Comparisons of type `if (f->bptr + length > f->bstop)' could overflow
if `length' was much larger than the buffer size and the buffer was situated
at the end of the address space. Solved by introducing bavail{r,w}() and
using it systematically.
Robert Spalek [Wed, 18 Aug 2004 10:28:40 +0000 (10:28 +0000)]
bugfix in obuck_slurp_refill
Robert Spalek [Tue, 17 Aug 2004 20:17:37 +0000 (20:17 +0000)]
cards format has changed, do upgrade the version of the index
Robert Spalek [Tue, 17 Aug 2004 20:12:05 +0000 (20:12 +0000)]
added a module that supports transparent compression of data streamed into
an output fastbuf. on the other side, reading from the stream is also
transparent
* will be used for cards
* untested
Martin Mares [Sun, 15 Aug 2004 14:14:55 +0000 (14:14 +0000)]
Accept V33_LIZARD buckets without body -- such buckets can legally
arise when shep-cleanup trims an inactive bucket.
Also added debugging messages which helped to verify that zero-copy
is really invoked where it should.
Martin Mares [Sun, 15 Aug 2004 12:09:57 +0000 (12:09 +0000)]
Added sync_dir().
Martin Mares [Sun, 15 Aug 2004 12:09:45 +0000 (12:09 +0000)]
Added bfilesync().
Martin Mares [Sun, 15 Aug 2004 10:16:03 +0000 (10:16 +0000)]
Made `buckettool -x' show the header if in verbose mode.
Also use bbcopy_slow() for copying bucket contents.
Martin Mares [Sat, 14 Aug 2004 15:15:53 +0000 (15:15 +0000)]
Added obj_find_anum() which is obj_find_aval() combined with atol() and
a default value (this is a very common idiom at many places).
Added size_attr() which calculates size of a given attribute in the
current bucket format.
Martin Mares [Sat, 14 Aug 2004 15:14:36 +0000 (15:14 +0000)]
UTF8_SPACE turned to an inline function.
Added utf8_encoding_length() which measures expected length of an UTF-8
sequence according to its first byte.
Martin Mares [Thu, 5 Aug 2004 22:33:19 +0000 (22:33 +0000)]
obj_add_attr_ref() with an on-stack buffer is not advisable, better
allocate the buffer in the object pool.
Martin Mares [Thu, 5 Aug 2004 22:32:30 +0000 (22:32 +0000)]
When reading, the data end at b->bstop, not b->bufend.
Martin Mares [Thu, 5 Aug 2004 22:28:58 +0000 (22:28 +0000)]
Bucket fastbuf's have to be seekable, since shep-reap seeks on them
(although the seek will be inside the buffer [and thence optimized out
by bseek()] under all reasonable circumstances [it was triggered by
a bug in buck2obj]).
Robert Spalek [Thu, 5 Aug 2004 14:13:56 +0000 (14:13 +0000)]
do not read the whole bucket into memory if only header is needed
Martin Mares [Thu, 5 Aug 2004 12:13:41 +0000 (12:13 +0000)]
New attributes are now added at the end of the list.
Robert Spalek [Tue, 20 Jul 2004 15:16:26 +0000 (15:16 +0000)]
tiny bugfix :))
Robert Spalek [Tue, 20 Jul 2004 10:15:31 +0000 (10:15 +0000)]
tiny optimization: do not make 2 function calls in adler32()
Martin Mares [Fri, 16 Jul 2004 21:56:11 +0000 (21:56 +0000)]
Fixed two bugs in parsing of buckets (one of them introduced by me a couple
of hours ago, the other aged).
Martin Mares [Fri, 16 Jul 2004 20:22:53 +0000 (20:22 +0000)]
Buckets of unknown format should be skipped properly, otherwise
bucket fetching can become confused.
Martin Mares [Fri, 16 Jul 2004 19:08:23 +0000 (19:08 +0000)]
`buckettool -c' (cat) now separates buckets by an empty line.
Martin Mares [Fri, 16 Jul 2004 18:32:35 +0000 (18:32 +0000)]
Parsing of PLAIN type buckets also ends after buck_len bytes.
obj_read_multi() is gone.
Martin Mares [Fri, 16 Jul 2004 18:18:02 +0000 (18:18 +0000)]
Reading of V30 buckets was broken (for example it didn't work when a bucket
had only header and no body). When fixing it, I've modified it to terminate
after buck_len bytes, so the terminating NUL character is no longer needed.
Robert Spalek [Fri, 16 Jul 2004 11:46:06 +0000 (11:46 +0000)]
prune the original source code yet more
Robert Spalek [Fri, 16 Jul 2004 10:50:14 +0000 (10:50 +0000)]
tiny bugfix
Robert Spalek [Fri, 16 Jul 2004 10:47:37 +0000 (10:47 +0000)]
taken much faster implementation of Adler32 and put into a separate source-code
Robert Spalek [Fri, 16 Jul 2004 10:39:04 +0000 (10:39 +0000)]
add Adler32 checksum to compressed V33 buckets
Robert Spalek [Fri, 16 Jul 2004 10:25:49 +0000 (10:25 +0000)]
added adler32 functions
Robert Spalek [Fri, 16 Jul 2004 10:12:33 +0000 (10:12 +0000)]
automatically add 0x80000000 to bucket-type specified on the command-line
Robert Spalek [Fri, 16 Jul 2004 09:07:50 +0000 (09:07 +0000)]
improved an error message to catch an error
Martin Mares [Thu, 15 Jul 2004 21:52:29 +0000 (21:52 +0000)]
Added {b,}put_attr_separator().
Martin Mares [Mon, 12 Jul 2004 14:13:23 +0000 (14:13 +0000)]
Introduced late matchers and custom statistics [see Bug #860]. Converted
FILETYPE to this mechanism, it's no longer an exception.
Martin Mares [Sat, 10 Jul 2004 21:05:27 +0000 (21:05 +0000)]
pools.h -> mempool.h
Martin Mares [Sat, 10 Jul 2004 21:04:23 +0000 (21:04 +0000)]
I always wanted to rename the rather inconsistent memory pool modules.
After the big library earthquake I caused with my unicode.h changes,
this is the small fish, so let's go.
Martin Mares [Sat, 10 Jul 2004 20:56:56 +0000 (20:56 +0000)]
Arranged the module list.