]> mj.ucw.cz Git - libucw.git/log
libucw.git
23 years agocf_default_init() replaced by direct access to cfdeffile, the default value
Robert Spalek [Mon, 5 Mar 2001 09:02:11 +0000 (09:02 +0000)]
cf_default_init() replaced by direct access to cfdeffile, the default value
is DEFAULT_CONFIG
cf_default_done() replaced by automatical call if getopt() returns -1

23 years agocf_default_{init,done} interface used instead cf_read
Robert Spalek [Sun, 4 Mar 2001 15:21:50 +0000 (15:21 +0000)]
cf_default_{init,done} interface used instead cf_read

23 years agoadded cf_default_{init,done} for setting the default config filename, that
Robert Spalek [Sun, 4 Mar 2001 15:19:26 +0000 (15:19 +0000)]
added cf_default_{init,done} for setting the default config filename, that
will be automatically readed if not overriden by command-line option

23 years agoadded obuck_size() returning the size of bucket file (for gatherd.c stopping
Robert Spalek [Sun, 4 Mar 2001 15:18:08 +0000 (15:18 +0000)]
added obuck_size() returning the size of bucket file (for gatherd.c stopping
after maximal size is reached)

23 years agoMoved fp_hash() to index.h.
Martin Mares [Sat, 3 Mar 2001 22:50:29 +0000 (22:50 +0000)]
Moved fp_hash() to index.h.

23 years agoDefine FASTBUF_BYTES_PER_(O|P).
Martin Mares [Sat, 3 Mar 2001 17:22:16 +0000 (17:22 +0000)]
Define FASTBUF_BYTES_PER_(O|P).

23 years agoAdded indexer names for word and string type classes.
Martin Mares [Sat, 3 Mar 2001 13:39:36 +0000 (13:39 +0000)]
Added indexer names for word and string type classes.

23 years agoDefined which string classes contain URL's and which ones case insensitive
Martin Mares [Sat, 3 Mar 2001 12:12:35 +0000 (12:12 +0000)]
Defined which string classes contain URL's and which ones case insensitive
strings.

23 years agoadded
Robert Spalek [Fri, 2 Mar 2001 13:36:47 +0000 (13:36 +0000)]
added

23 years agoUpdated the charset conversion library to UniCode 3.0.
Martin Mares [Fri, 2 Mar 2001 11:30:11 +0000 (11:30 +0000)]
Updated the charset conversion library to UniCode 3.0.

Removed ice age relics.

Removed signature tables as they are not used anyway.

23 years agoReplaced <sys/time.h> by <time.h> where appropriate.
Martin Mares [Fri, 2 Mar 2001 11:00:33 +0000 (11:00 +0000)]
Replaced <sys/time.h> by <time.h> where appropriate.

23 years agoFixed bug in generating UTF-8 for codes >= 0x800.
Martin Mares [Thu, 1 Mar 2001 17:31:47 +0000 (17:31 +0000)]
Fixed bug in generating UTF-8 for codes >= 0x800.

23 years agoDefined a GET_TAGGED_CHAR macro to read our internal representation
Martin Mares [Thu, 1 Mar 2001 16:57:58 +0000 (16:57 +0000)]
Defined a GET_TAGGED_CHAR macro to read our internal representation
of tagged text, mapping the tags to character codes >= 0x80000000.

23 years agoGenerate index cards.
Martin Mares [Fri, 23 Feb 2001 14:02:46 +0000 (14:02 +0000)]
Generate index cards.

The chewer is complete, but it will probably need a bit of optimization
and fine-tuning when we get some real data. The searching for words present
in the cache doesn't look well.

23 years agoIndexing of strings.
Martin Mares [Fri, 23 Feb 2001 10:53:32 +0000 (10:53 +0000)]
Indexing of strings.

23 years agoSome more chewer work...
Martin Mares [Thu, 22 Feb 2001 16:01:49 +0000 (16:01 +0000)]
Some more chewer work...

23 years agoAdded bgets0().
Martin Mares [Tue, 20 Feb 2001 22:32:06 +0000 (22:32 +0000)]
Added bgets0().

23 years agoAdded a useful macro for value clamping.
Martin Mares [Tue, 20 Feb 2001 17:51:24 +0000 (17:51 +0000)]
Added a useful macro for value clamping.

23 years agoOops, breadb() was wrong.
Martin Mares [Mon, 19 Feb 2001 19:08:56 +0000 (19:08 +0000)]
Oops, breadb() was wrong.

23 years agoAdded breadb() which acts just like bread(), but die()s if a partial
Martin Mares [Mon, 19 Feb 2001 18:53:46 +0000 (18:53 +0000)]
Added breadb() which acts just like bread(), but die()s if a partial
record is read. This is mainly to avoid consistency checks in main
code path.

23 years agoSORT_DELETE_INPUT works even with SORT_INPUT_FB.
Martin Mares [Fri, 16 Feb 2001 20:16:51 +0000 (20:16 +0000)]
SORT_DELETE_INPUT works even with SORT_INPUT_FB.

23 years agoDeclare fingerprints as 12 bytes, not 3 u32's.
Martin Mares [Fri, 16 Feb 2001 20:16:26 +0000 (20:16 +0000)]
Declare fingerprints as 12 bytes, not 3 u32's.

23 years agoAdded #define PACKED __attribute__((packed)).
Martin Mares [Fri, 16 Feb 2001 20:16:00 +0000 (20:16 +0000)]
Added #define PACKED __attribute__((packed)).

23 years agoAdded merger.
Martin Mares [Fri, 16 Feb 2001 18:54:31 +0000 (18:54 +0000)]
Added merger.

23 years agoAdded unmapping and writeable mappings.
Martin Mares [Fri, 16 Feb 2001 17:56:06 +0000 (17:56 +0000)]
Added unmapping and writeable mappings.

23 years agoScanner improvements: create redirect backlinks, detect empty documents,
Martin Mares [Fri, 16 Feb 2001 16:16:25 +0000 (16:16 +0000)]
Scanner improvements: create redirect backlinks, detect empty documents,
mark accented documents.

23 years agoTesting programs are not build by default.
Martin Mares [Thu, 15 Feb 2001 19:17:47 +0000 (19:17 +0000)]
Testing programs are not build by default.

23 years agoAdded URL fingerprints.
Martin Mares [Thu, 15 Feb 2001 19:05:39 +0000 (19:05 +0000)]
Added URL fingerprints.

23 years agoAdded bputs0() -- put a null-terminated string.
Martin Mares [Thu, 15 Feb 2001 19:04:57 +0000 (19:04 +0000)]
Added bputs0() -- put a null-terminated string.

23 years agoShut up warnings.
Martin Mares [Sat, 10 Feb 2001 12:28:22 +0000 (12:28 +0000)]
Shut up warnings.

23 years agoadded cf_item_count()
Robert Spalek [Fri, 9 Feb 2001 10:48:55 +0000 (10:48 +0000)]
added cf_item_count()

23 years agodeleted unused variable prog
Robert Spalek [Fri, 9 Feb 2001 10:48:17 +0000 (10:48 +0000)]
deleted unused variable prog

23 years agoadded sort-test
Robert Spalek [Fri, 9 Feb 2001 10:48:02 +0000 (10:48 +0000)]
added sort-test

23 years agoNext version of the sorter -- both presorting and unifying works.
Martin Mares [Sun, 4 Feb 2001 20:08:14 +0000 (20:08 +0000)]
Next version of the sorter -- both presorting and unifying works.

sort-test now does just `sort -u' and it's about 30% slower than its
GNU counterpart, probably due to extra copies of sorting keys by our
buffered I/O layer. Fortunately, a typical case will be long data with
short keys where we should be efficient as we can use bbcopy().

23 years agocf_get_item added
Robert Spalek [Sun, 4 Feb 2001 15:29:15 +0000 (15:29 +0000)]
cf_get_item added

23 years agoFirst version of the sorter. No presorting phase yet.
Martin Mares [Sun, 4 Feb 2001 14:44:54 +0000 (14:44 +0000)]
First version of the sorter. No presorting phase yet.

23 years agoAdded "is_temp_file" attribute which causes automatic deletion of the
Martin Mares [Sun, 4 Feb 2001 14:44:00 +0000 (14:44 +0000)]
Added "is_temp_file" attribute which causes automatic deletion of the
file upon bclose().

23 years agoDon't log pid until log_fork() is called.
Martin Mares [Sun, 28 Jan 2001 21:40:27 +0000 (21:40 +0000)]
Don't log pid until log_fork() is called.

23 years agoAdded __attribute__((format...)) to declaration of log(), so that
Martin Mares [Sun, 28 Jan 2001 20:50:23 +0000 (20:50 +0000)]
Added __attribute__((format...)) to declaration of log(), so that
discrepancies between format string and arguments get easily found.

23 years agooid's above OBUCK_OID_FIRST_SPECIAL are reserved for encoding of
Martin Mares [Fri, 26 Jan 2001 16:28:09 +0000 (16:28 +0000)]
oid's above OBUCK_OID_FIRST_SPECIAL are reserved for encoding of
error codes and other stuff.

23 years agoobj_add_attr() now returns head of the attribute value chain, so that you
Martin Mares [Fri, 26 Jan 2001 16:27:38 +0000 (16:27 +0000)]
obj_add_attr() now returns head of the attribute value chain, so that you
can easily do:

struct oattr *a;
a = obj_add_attr(obj, NULL, 'x', "val1");
a = obj_add_attr(obj, a, 'x', "val2");

to insert multi-valued attributes quickly.

23 years agoAdded log_fork() which should be called after fork()ing to invalidate cached PID.
Martin Mares [Fri, 26 Jan 2001 16:25:49 +0000 (16:25 +0000)]
Added log_fork() which should be called after fork()ing to invalidate cached PID.

23 years agoUse xmalloc_zero().
Martin Mares [Thu, 25 Jan 2001 16:01:53 +0000 (16:01 +0000)]
Use xmalloc_zero().

23 years agoNewlines and carriage returns are considered blanks. Carefully checked
Martin Mares [Thu, 25 Jan 2001 14:51:47 +0000 (14:51 +0000)]
Newlines and carriage returns are considered blanks. Carefully checked
all modules using Cspace() and Cblank().

23 years agoWord type 0 is reserved.
Martin Mares [Tue, 23 Jan 2001 14:59:36 +0000 (14:59 +0000)]
Word type 0 is reserved.

23 years agodeclaration cosmetic fix
Robert Spalek [Sun, 21 Jan 2001 20:00:58 +0000 (20:00 +0000)]
declaration cosmetic fix

23 years agoHTML parser basically works. A *lot* of things still needs to be cleaned up.
Martin Mares [Sun, 21 Jan 2001 19:48:34 +0000 (19:48 +0000)]
HTML parser basically works. A *lot* of things still needs to be cleaned up.

23 years agomemory pool for everything about configuration added
Robert Spalek [Sun, 21 Jan 2001 19:14:46 +0000 (19:14 +0000)]
memory pool for everything about configuration added
value of CT_FUNCTION callback is now cfg_stralloc()'ed

23 years agoAdded the genhash utility (simple gperf replacement).
Martin Mares [Sun, 21 Jan 2001 18:20:23 +0000 (18:20 +0000)]
Added the genhash utility (simple gperf replacement).

23 years agoAdded functions for reading/writing UTF-8 characters on fastbuf streams.
Martin Mares [Sun, 21 Jan 2001 17:59:58 +0000 (17:59 +0000)]
Added functions for reading/writing UTF-8 characters on fastbuf streams.

23 years agoCreated a header which will contain description of all data structures
Martin Mares [Sun, 21 Jan 2001 17:59:33 +0000 (17:59 +0000)]
Created a header which will contain description of all data structures
used in the indices. For now, there are word categories.

23 years agoAdded "direct buffer I/O" interface for those who want to avoid an extra
Martin Mares [Sun, 21 Jan 2001 14:14:04 +0000 (14:14 +0000)]
Added "direct buffer I/O" interface for those who want to avoid an extra
copy of data during read/write at expense of having to be prepared for
any data size the buffering layer tells them to read/write.

23 years agoUse xmalloc_zero() instead of xmalloc() followed by bzero().
Martin Mares [Sun, 21 Jan 2001 11:10:45 +0000 (11:10 +0000)]
Use xmalloc_zero() instead of xmalloc() followed by bzero().

23 years agoUse xmalloc_zero() instead of xmalloc(), thus squashing several uninitialized
Martin Mares [Sun, 21 Jan 2001 11:09:42 +0000 (11:09 +0000)]
Use xmalloc_zero() instead of xmalloc(), thus squashing several uninitialized
structure field bugs. Also simplified the code a lot.

23 years agoIntroduced mp_alloc_zero().
Martin Mares [Sun, 21 Jan 2001 11:07:58 +0000 (11:07 +0000)]
Introduced mp_alloc_zero().

23 years agoIntroduced xmalloc_zero().
Martin Mares [Sun, 21 Jan 2001 11:07:40 +0000 (11:07 +0000)]
Introduced xmalloc_zero().

23 years agoExported conversions between internal character codes and UCS.
Martin Mares [Wed, 17 Jan 2001 13:10:28 +0000 (13:10 +0000)]
Exported conversions between internal character codes and UCS.

23 years agoFixed a couple of bugs.
Martin Mares [Mon, 15 Jan 2001 11:30:45 +0000 (11:30 +0000)]
Fixed a couple of bugs.

23 years agorewritten, enhanced, updated, fixed
Robert Spalek [Mon, 15 Jan 2001 10:11:05 +0000 (10:11 +0000)]
rewritten, enhanced, updated, fixed

23 years agoAdded an explanatory comment.
Martin Mares [Mon, 15 Jan 2001 09:36:26 +0000 (09:36 +0000)]
Added an explanatory comment.

23 years agoGuards, guards!
Martin Mares [Sun, 14 Jan 2001 21:46:09 +0000 (21:46 +0000)]
Guards, guards!

23 years agoRemember to link object.o to the library. :)
Martin Mares [Sun, 14 Jan 2001 20:50:13 +0000 (20:50 +0000)]
Remember to link object.o to the library. :)

23 years agoAdd sh_time_t type.
Martin Mares [Sun, 14 Jan 2001 20:50:00 +0000 (20:50 +0000)]
Add sh_time_t type.

23 years agoAllow bclose(NULL).
Martin Mares [Sun, 14 Jan 2001 20:49:49 +0000 (20:49 +0000)]
Allow bclose(NULL).

23 years agoGuard against multiple inclusion.
Martin Mares [Sun, 14 Jan 2001 20:49:41 +0000 (20:49 +0000)]
Guard against multiple inclusion.

23 years agoImport object attribute handling from old Sherlock.
Martin Mares [Sun, 14 Jan 2001 20:49:29 +0000 (20:49 +0000)]
Import object attribute handling from old Sherlock.

23 years agoKeep up with conf.h changes.
Martin Mares [Sun, 14 Jan 2001 18:47:06 +0000 (18:47 +0000)]
Keep up with conf.h changes.

23 years agoKeep in sync with pools.h changes.
Martin Mares [Sun, 14 Jan 2001 18:46:28 +0000 (18:46 +0000)]
Keep in sync with pools.h changes.

23 years agoRenamed memory pool functions to "mp_*", so they don't clobber name space.
Martin Mares [Sun, 14 Jan 2001 18:46:08 +0000 (18:46 +0000)]
Renamed memory pool functions to "mp_*", so they don't clobber name space.

23 years agoAdded DEFAULT_CONFIG define which tells us the name of the master
Martin Mares [Sun, 14 Jan 2001 18:45:20 +0000 (18:45 +0000)]
Added DEFAULT_CONFIG define which tells us the name of the master
configuration file.

Added CONSTRUCTOR which is a macro generating __attribute__((constructor))
to make a function called automatically upon program startup. This is ideal
for registering of configuration sections.

23 years agoMoved #include <getopt.h> to conf.h.
Martin Mares [Sun, 14 Jan 2001 18:44:03 +0000 (18:44 +0000)]
Moved #include <getopt.h> to conf.h.

23 years agoSwitch to new logging functions.
Martin Mares [Sun, 14 Jan 2001 17:51:37 +0000 (17:51 +0000)]
Switch to new logging functions.

23 years agoNew logging mechanism. Also cleaned up lib.h.
Martin Mares [Sun, 14 Jan 2001 17:51:22 +0000 (17:51 +0000)]
New logging mechanism. Also cleaned up lib.h.

23 years agolib/conf.[ch] rewritten
Robert Spalek [Sun, 14 Jan 2001 12:16:57 +0000 (12:16 +0000)]
lib/conf.[ch] rewritten
lib/conf-test.c updated

23 years agoadded tester for conf.c, already tested and it works
Robert Spalek [Mon, 8 Jan 2001 10:27:00 +0000 (10:27 +0000)]
added tester for conf.c, already tested and it works

23 years agobugfixes
Robert Spalek [Mon, 8 Jan 2001 10:26:37 +0000 (10:26 +0000)]
bugfixes

23 years agowritten parsing config files and command line parameters, untested yet
Robert Spalek [Sun, 7 Jan 2001 21:21:53 +0000 (21:21 +0000)]
written parsing config files and command line parameters, untested yet

23 years agoWhen checking/fixing bucket file structure, adjust exit code according
Martin Mares [Sun, 7 Jan 2001 15:07:24 +0000 (15:07 +0000)]
When checking/fixing bucket file structure, adjust exit code according
to whether the file is clean or not.

23 years agoAdded a simple utility for handling of buckets and repairing damaged
Martin Mares [Sun, 7 Jan 2001 15:00:10 +0000 (15:00 +0000)]
Added a simple utility for handling of buckets and repairing damaged
bucket files.

23 years agoSquashed a signedness warning.
Martin Mares [Sun, 7 Jan 2001 14:59:35 +0000 (14:59 +0000)]
Squashed a signedness warning.

23 years agoAdded sh_ftruncate(), but it currently works only on 32-bit files,
Martin Mares [Sun, 7 Jan 2001 14:58:58 +0000 (14:58 +0000)]
Added sh_ftruncate(), but it currently works only on 32-bit files,
so I've switched the default configuration to 32 bits.

23 years agobread() now returns the number of bytes read instead of dying when
Martin Mares [Sun, 7 Jan 2001 14:58:11 +0000 (14:58 +0000)]
bread() now returns the number of bytes read instead of dying when
the file is too short.

Need to be taken care of when porting old Sherlock code.

23 years agoUse OBUCK_INCOMPLETE_MAGIC for incomplete entries.
Martin Mares [Sun, 7 Jan 2001 14:56:23 +0000 (14:56 +0000)]
Use OBUCK_INCOMPLETE_MAGIC for incomplete entries.
find_first/find_next have a new option allowing to find deleted buckets.
Use sh_p(read|write) instead of p(read|write).

23 years agoRewrote to use pread/pwrite() and to avoid messing up with internals
Martin Mares [Fri, 5 Jan 2001 23:22:16 +0000 (23:22 +0000)]
Rewrote to use pread/pwrite() and to avoid messing up with internals
of the fb-file module. It should work with file descriptors shared between
processes (no need to re-open the file after fork()) and should be also
somewhat faster.

Basic functions already work, it remains to create some utilities
and implement compression.

make obj/lib/bucket-t for a small test program.

23 years agoPreliminary version of object bucket code.
Martin Mares [Fri, 5 Jan 2001 19:08:21 +0000 (19:08 +0000)]
Preliminary version of object bucket code.

23 years agoAdded support for memory debuggers (efence and [untested] dmalloc), just
Martin Mares [Mon, 11 Dec 2000 22:57:04 +0000 (22:57 +0000)]
Added support for memory debuggers (efence and [untested] dmalloc), just
enable in the top-level Makefile.

All modules should use xmalloc/xrealloc/xfree (or memory pools) to allocate
their data. These functions automatically check for errors and they also
translate to dmalloc calls if needed.

23 years agoFixed includes, defines and copyrights.
Martin Mares [Mon, 11 Dec 2000 22:12:13 +0000 (22:12 +0000)]
Fixed includes, defines and copyrights.

23 years ago#include <errno.h>
Martin Mares [Wed, 6 Dec 2000 15:01:24 +0000 (15:01 +0000)]
#include <errno.h>

23 years agoAdded a couple of FIXME's.
Martin Mares [Tue, 5 Dec 2000 23:13:18 +0000 (23:13 +0000)]
Added a couple of FIXME's.

23 years agoRevived the old chartype macros.
Martin Mares [Tue, 5 Dec 2000 23:07:33 +0000 (23:07 +0000)]
Revived the old chartype macros.

23 years agoAdd path to all includes.
Martin Mares [Tue, 5 Dec 2000 23:00:55 +0000 (23:00 +0000)]
Add path to all includes.

23 years agoAdd directory to all includes.
Martin Mares [Tue, 5 Dec 2000 23:00:25 +0000 (23:00 +0000)]
Add directory to all includes.

23 years agoSquashed a couple of warnings.
Martin Mares [Tue, 5 Dec 2000 22:45:42 +0000 (22:45 +0000)]
Squashed a couple of warnings.

23 years agoAdded build tools.
Martin Mares [Tue, 5 Dec 2000 22:39:59 +0000 (22:39 +0000)]
Added build tools.

23 years agoNew Makefiles.
Martin Mares [Tue, 5 Dec 2000 22:39:13 +0000 (22:39 +0000)]
New Makefiles.

23 years agoAdded character class definitions, formerly in lib/string.h.
Martin Mares [Tue, 5 Dec 2000 22:38:52 +0000 (22:38 +0000)]
Added character class definitions, formerly in lib/string.h.

23 years agoImplemented long tags and section limits (must be enabled in config.h first).
Martin Mares [Sun, 29 Oct 2000 12:26:42 +0000 (12:26 +0000)]
Implemented long tags and section limits (must be enabled in config.h first).

23 years agoNew I/O library.
Martin Mares [Sat, 28 Oct 2000 14:01:28 +0000 (14:01 +0000)]
New I/O library.

23 years agoSupport database sizes from 2G to 4G.
Martin Mares [Wed, 2 Aug 2000 09:52:56 +0000 (09:52 +0000)]
Support database sizes from 2G to 4G.

23 years agoDon't cry when pread/pwrite is not available.
Martin Mares [Wed, 2 Aug 2000 09:13:38 +0000 (09:13 +0000)]
Don't cry when pread/pwrite is not available.