]>
mj.ucw.cz Git - libucw.git/log
Martin Mares [Sat, 3 Mar 2001 17:22:16 +0000 (17:22 +0000)]
Define FASTBUF_BYTES_PER_(O|P).
Martin Mares [Sat, 3 Mar 2001 13:39:36 +0000 (13:39 +0000)]
Added indexer names for word and string type classes.
Martin Mares [Sat, 3 Mar 2001 12:12:35 +0000 (12:12 +0000)]
Defined which string classes contain URL's and which ones case insensitive
strings.
Robert Spalek [Fri, 2 Mar 2001 13:36:47 +0000 (13:36 +0000)]
added
Martin Mares [Fri, 2 Mar 2001 11:30:11 +0000 (11:30 +0000)]
Updated the charset conversion library to UniCode 3.0.
Removed ice age relics.
Removed signature tables as they are not used anyway.
Martin Mares [Fri, 2 Mar 2001 11:00:33 +0000 (11:00 +0000)]
Replaced <sys/time.h> by <time.h> where appropriate.
Martin Mares [Thu, 1 Mar 2001 17:31:47 +0000 (17:31 +0000)]
Fixed bug in generating UTF-8 for codes >= 0x800.
Martin Mares [Thu, 1 Mar 2001 16:57:58 +0000 (16:57 +0000)]
Defined a GET_TAGGED_CHAR macro to read our internal representation
of tagged text, mapping the tags to character codes >= 0x80000000.
Martin Mares [Fri, 23 Feb 2001 14:02:46 +0000 (14:02 +0000)]
Generate index cards.
The chewer is complete, but it will probably need a bit of optimization
and fine-tuning when we get some real data. The searching for words present
in the cache doesn't look well.
Martin Mares [Fri, 23 Feb 2001 10:53:32 +0000 (10:53 +0000)]
Indexing of strings.
Martin Mares [Thu, 22 Feb 2001 16:01:49 +0000 (16:01 +0000)]
Some more chewer work...
Martin Mares [Tue, 20 Feb 2001 22:32:06 +0000 (22:32 +0000)]
Added bgets0().
Martin Mares [Tue, 20 Feb 2001 17:51:24 +0000 (17:51 +0000)]
Added a useful macro for value clamping.
Martin Mares [Mon, 19 Feb 2001 19:08:56 +0000 (19:08 +0000)]
Oops, breadb() was wrong.
Martin Mares [Mon, 19 Feb 2001 18:53:46 +0000 (18:53 +0000)]
Added breadb() which acts just like bread(), but die()s if a partial
record is read. This is mainly to avoid consistency checks in main
code path.
Martin Mares [Fri, 16 Feb 2001 20:16:51 +0000 (20:16 +0000)]
SORT_DELETE_INPUT works even with SORT_INPUT_FB.
Martin Mares [Fri, 16 Feb 2001 20:16:26 +0000 (20:16 +0000)]
Declare fingerprints as 12 bytes, not 3 u32's.
Martin Mares [Fri, 16 Feb 2001 20:16:00 +0000 (20:16 +0000)]
Added #define PACKED __attribute__((packed)).
Martin Mares [Fri, 16 Feb 2001 18:54:31 +0000 (18:54 +0000)]
Added merger.
Martin Mares [Fri, 16 Feb 2001 17:56:06 +0000 (17:56 +0000)]
Added unmapping and writeable mappings.
Martin Mares [Fri, 16 Feb 2001 16:16:25 +0000 (16:16 +0000)]
Scanner improvements: create redirect backlinks, detect empty documents,
mark accented documents.
Martin Mares [Thu, 15 Feb 2001 19:17:47 +0000 (19:17 +0000)]
Testing programs are not build by default.
Martin Mares [Thu, 15 Feb 2001 19:05:39 +0000 (19:05 +0000)]
Added URL fingerprints.
Martin Mares [Thu, 15 Feb 2001 19:04:57 +0000 (19:04 +0000)]
Added bputs0() -- put a null-terminated string.
Martin Mares [Sat, 10 Feb 2001 12:28:22 +0000 (12:28 +0000)]
Shut up warnings.
Robert Spalek [Fri, 9 Feb 2001 10:48:55 +0000 (10:48 +0000)]
added cf_item_count()
Robert Spalek [Fri, 9 Feb 2001 10:48:17 +0000 (10:48 +0000)]
deleted unused variable prog
Robert Spalek [Fri, 9 Feb 2001 10:48:02 +0000 (10:48 +0000)]
added sort-test
Martin Mares [Sun, 4 Feb 2001 20:08:14 +0000 (20:08 +0000)]
Next version of the sorter -- both presorting and unifying works.
sort-test now does just `sort -u' and it's about 30% slower than its
GNU counterpart, probably due to extra copies of sorting keys by our
buffered I/O layer. Fortunately, a typical case will be long data with
short keys where we should be efficient as we can use bbcopy().
Robert Spalek [Sun, 4 Feb 2001 15:29:15 +0000 (15:29 +0000)]
cf_get_item added
Martin Mares [Sun, 4 Feb 2001 14:44:54 +0000 (14:44 +0000)]
First version of the sorter. No presorting phase yet.
Martin Mares [Sun, 4 Feb 2001 14:44:00 +0000 (14:44 +0000)]
Added "is_temp_file" attribute which causes automatic deletion of the
file upon bclose().
Martin Mares [Sun, 28 Jan 2001 21:40:27 +0000 (21:40 +0000)]
Don't log pid until log_fork() is called.
Martin Mares [Sun, 28 Jan 2001 20:50:23 +0000 (20:50 +0000)]
Added __attribute__((format...)) to declaration of log(), so that
discrepancies between format string and arguments get easily found.
Martin Mares [Fri, 26 Jan 2001 16:28:09 +0000 (16:28 +0000)]
oid's above OBUCK_OID_FIRST_SPECIAL are reserved for encoding of
error codes and other stuff.
Martin Mares [Fri, 26 Jan 2001 16:27:38 +0000 (16:27 +0000)]
obj_add_attr() now returns head of the attribute value chain, so that you
can easily do:
struct oattr *a;
a = obj_add_attr(obj, NULL, 'x', "val1");
a = obj_add_attr(obj, a, 'x', "val2");
to insert multi-valued attributes quickly.
Martin Mares [Fri, 26 Jan 2001 16:25:49 +0000 (16:25 +0000)]
Added log_fork() which should be called after fork()ing to invalidate cached PID.
Martin Mares [Thu, 25 Jan 2001 16:01:53 +0000 (16:01 +0000)]
Use xmalloc_zero().
Martin Mares [Thu, 25 Jan 2001 14:51:47 +0000 (14:51 +0000)]
Newlines and carriage returns are considered blanks. Carefully checked
all modules using Cspace() and Cblank().
Martin Mares [Tue, 23 Jan 2001 14:59:36 +0000 (14:59 +0000)]
Word type 0 is reserved.
Robert Spalek [Sun, 21 Jan 2001 20:00:58 +0000 (20:00 +0000)]
declaration cosmetic fix
Martin Mares [Sun, 21 Jan 2001 19:48:34 +0000 (19:48 +0000)]
HTML parser basically works. A *lot* of things still needs to be cleaned up.
Robert Spalek [Sun, 21 Jan 2001 19:14:46 +0000 (19:14 +0000)]
memory pool for everything about configuration added
value of CT_FUNCTION callback is now cfg_stralloc()'ed
Martin Mares [Sun, 21 Jan 2001 18:20:23 +0000 (18:20 +0000)]
Added the genhash utility (simple gperf replacement).
Martin Mares [Sun, 21 Jan 2001 17:59:58 +0000 (17:59 +0000)]
Added functions for reading/writing UTF-8 characters on fastbuf streams.
Martin Mares [Sun, 21 Jan 2001 17:59:33 +0000 (17:59 +0000)]
Created a header which will contain description of all data structures
used in the indices. For now, there are word categories.
Martin Mares [Sun, 21 Jan 2001 14:14:04 +0000 (14:14 +0000)]
Added "direct buffer I/O" interface for those who want to avoid an extra
copy of data during read/write at expense of having to be prepared for
any data size the buffering layer tells them to read/write.
Martin Mares [Sun, 21 Jan 2001 11:10:45 +0000 (11:10 +0000)]
Use xmalloc_zero() instead of xmalloc() followed by bzero().
Martin Mares [Sun, 21 Jan 2001 11:09:42 +0000 (11:09 +0000)]
Use xmalloc_zero() instead of xmalloc(), thus squashing several uninitialized
structure field bugs. Also simplified the code a lot.
Martin Mares [Sun, 21 Jan 2001 11:07:58 +0000 (11:07 +0000)]
Introduced mp_alloc_zero().
Martin Mares [Sun, 21 Jan 2001 11:07:40 +0000 (11:07 +0000)]
Introduced xmalloc_zero().
Martin Mares [Wed, 17 Jan 2001 13:10:28 +0000 (13:10 +0000)]
Exported conversions between internal character codes and UCS.
Martin Mares [Mon, 15 Jan 2001 11:30:45 +0000 (11:30 +0000)]
Fixed a couple of bugs.
Robert Spalek [Mon, 15 Jan 2001 10:11:05 +0000 (10:11 +0000)]
rewritten, enhanced, updated, fixed
Martin Mares [Mon, 15 Jan 2001 09:36:26 +0000 (09:36 +0000)]
Added an explanatory comment.
Martin Mares [Sun, 14 Jan 2001 21:46:09 +0000 (21:46 +0000)]
Guards, guards!
Martin Mares [Sun, 14 Jan 2001 20:50:13 +0000 (20:50 +0000)]
Remember to link object.o to the library. :)
Martin Mares [Sun, 14 Jan 2001 20:50:00 +0000 (20:50 +0000)]
Add sh_time_t type.
Martin Mares [Sun, 14 Jan 2001 20:49:49 +0000 (20:49 +0000)]
Allow bclose(NULL).
Martin Mares [Sun, 14 Jan 2001 20:49:41 +0000 (20:49 +0000)]
Guard against multiple inclusion.
Martin Mares [Sun, 14 Jan 2001 20:49:29 +0000 (20:49 +0000)]
Import object attribute handling from old Sherlock.
Martin Mares [Sun, 14 Jan 2001 18:47:06 +0000 (18:47 +0000)]
Keep up with conf.h changes.
Martin Mares [Sun, 14 Jan 2001 18:46:28 +0000 (18:46 +0000)]
Keep in sync with pools.h changes.
Martin Mares [Sun, 14 Jan 2001 18:46:08 +0000 (18:46 +0000)]
Renamed memory pool functions to "mp_*", so they don't clobber name space.
Martin Mares [Sun, 14 Jan 2001 18:45:20 +0000 (18:45 +0000)]
Added DEFAULT_CONFIG define which tells us the name of the master
configuration file.
Added CONSTRUCTOR which is a macro generating __attribute__((constructor))
to make a function called automatically upon program startup. This is ideal
for registering of configuration sections.
Martin Mares [Sun, 14 Jan 2001 18:44:03 +0000 (18:44 +0000)]
Moved #include <getopt.h> to conf.h.
Martin Mares [Sun, 14 Jan 2001 17:51:37 +0000 (17:51 +0000)]
Switch to new logging functions.
Martin Mares [Sun, 14 Jan 2001 17:51:22 +0000 (17:51 +0000)]
New logging mechanism. Also cleaned up lib.h.
Robert Spalek [Sun, 14 Jan 2001 12:16:57 +0000 (12:16 +0000)]
lib/conf.[ch] rewritten
lib/conf-test.c updated
Robert Spalek [Mon, 8 Jan 2001 10:27:00 +0000 (10:27 +0000)]
added tester for conf.c, already tested and it works
Robert Spalek [Mon, 8 Jan 2001 10:26:37 +0000 (10:26 +0000)]
bugfixes
Robert Spalek [Sun, 7 Jan 2001 21:21:53 +0000 (21:21 +0000)]
written parsing config files and command line parameters, untested yet
Martin Mares [Sun, 7 Jan 2001 15:07:24 +0000 (15:07 +0000)]
When checking/fixing bucket file structure, adjust exit code according
to whether the file is clean or not.
Martin Mares [Sun, 7 Jan 2001 15:00:10 +0000 (15:00 +0000)]
Added a simple utility for handling of buckets and repairing damaged
bucket files.
Martin Mares [Sun, 7 Jan 2001 14:59:35 +0000 (14:59 +0000)]
Squashed a signedness warning.
Martin Mares [Sun, 7 Jan 2001 14:58:58 +0000 (14:58 +0000)]
Added sh_ftruncate(), but it currently works only on 32-bit files,
so I've switched the default configuration to 32 bits.
Martin Mares [Sun, 7 Jan 2001 14:58:11 +0000 (14:58 +0000)]
bread() now returns the number of bytes read instead of dying when
the file is too short.
Need to be taken care of when porting old Sherlock code.
Martin Mares [Sun, 7 Jan 2001 14:56:23 +0000 (14:56 +0000)]
Use OBUCK_INCOMPLETE_MAGIC for incomplete entries.
find_first/find_next have a new option allowing to find deleted buckets.
Use sh_p(read|write) instead of p(read|write).
Martin Mares [Fri, 5 Jan 2001 23:22:16 +0000 (23:22 +0000)]
Rewrote to use pread/pwrite() and to avoid messing up with internals
of the fb-file module. It should work with file descriptors shared between
processes (no need to re-open the file after fork()) and should be also
somewhat faster.
Basic functions already work, it remains to create some utilities
and implement compression.
make obj/lib/bucket-t for a small test program.
Martin Mares [Fri, 5 Jan 2001 19:08:21 +0000 (19:08 +0000)]
Preliminary version of object bucket code.
Martin Mares [Mon, 11 Dec 2000 22:57:04 +0000 (22:57 +0000)]
Added support for memory debuggers (efence and [untested] dmalloc), just
enable in the top-level Makefile.
All modules should use xmalloc/xrealloc/xfree (or memory pools) to allocate
their data. These functions automatically check for errors and they also
translate to dmalloc calls if needed.
Martin Mares [Mon, 11 Dec 2000 22:12:13 +0000 (22:12 +0000)]
Fixed includes, defines and copyrights.
Martin Mares [Wed, 6 Dec 2000 15:01:24 +0000 (15:01 +0000)]
#include <errno.h>
Martin Mares [Tue, 5 Dec 2000 23:13:18 +0000 (23:13 +0000)]
Added a couple of FIXME's.
Martin Mares [Tue, 5 Dec 2000 23:07:33 +0000 (23:07 +0000)]
Revived the old chartype macros.
Martin Mares [Tue, 5 Dec 2000 23:00:55 +0000 (23:00 +0000)]
Add path to all includes.
Martin Mares [Tue, 5 Dec 2000 23:00:25 +0000 (23:00 +0000)]
Add directory to all includes.
Martin Mares [Tue, 5 Dec 2000 22:45:42 +0000 (22:45 +0000)]
Squashed a couple of warnings.
Martin Mares [Tue, 5 Dec 2000 22:39:59 +0000 (22:39 +0000)]
Added build tools.
Martin Mares [Tue, 5 Dec 2000 22:39:13 +0000 (22:39 +0000)]
New Makefiles.
Martin Mares [Tue, 5 Dec 2000 22:38:52 +0000 (22:38 +0000)]
Added character class definitions, formerly in lib/string.h.
Martin Mares [Sun, 29 Oct 2000 12:26:42 +0000 (12:26 +0000)]
Implemented long tags and section limits (must be enabled in config.h first).
Martin Mares [Sat, 28 Oct 2000 14:01:28 +0000 (14:01 +0000)]
New I/O library.
Martin Mares [Wed, 2 Aug 2000 09:52:56 +0000 (09:52 +0000)]
Support database sizes from 2G to 4G.
Martin Mares [Wed, 2 Aug 2000 09:13:38 +0000 (09:13 +0000)]
Don't cry when pread/pwrite is not available.
Martin Mares [Wed, 2 Aug 2000 09:00:56 +0000 (09:00 +0000)]
Rewrote large file support. Instead of mucking with SHERLOCK_O_LARGEFILE
all the way around, just use sh_{open,pread,pwrite} instead of the
ordinary calls.
Martin Mares [Thu, 13 Jul 2000 21:35:52 +0000 (21:35 +0000)]
Added "-t" switch.
Martin Mares [Thu, 13 Jul 2000 21:35:14 +0000 (21:35 +0000)]
Recalculate directory position when cascading splits.
Martin Mares [Thu, 13 Jul 2000 21:34:49 +0000 (21:34 +0000)]
Added a utility for rebuilding of databases.
Martin Mares [Thu, 13 Jul 2000 20:17:30 +0000 (20:17 +0000)]
Replaced the hash function. Damn it!