]> mj.ucw.cz Git - libucw.git/log
libucw.git
20 years agoAdded xstrdup() and new logging functions.
Martin Mares [Mon, 10 May 2004 14:11:34 +0000 (14:11 +0000)]
Added xstrdup() and new logging functions.

20 years agoAdded functions for manual control of log switching. (Will be used by the
Martin Mares [Mon, 10 May 2004 14:11:12 +0000 (14:11 +0000)]
Added functions for manual control of log switching. (Will be used by the
shepherd master.)

20 years agocfg_stralloc() -> cfg_strdup() and use mp_strdup() to implement it.
Martin Mares [Mon, 10 May 2004 14:10:39 +0000 (14:10 +0000)]
cfg_stralloc() -> cfg_strdup() and use mp_strdup() to implement it.

20 years agoRenamed stralloc() to xstrdup() to be consistent with libc terminology
Martin Mares [Mon, 10 May 2004 14:10:12 +0000 (14:10 +0000)]
Renamed stralloc() to xstrdup() to be consistent with libc terminology
and also with mp_strdup().

20 years agoOops, this worked only by a chance.
Martin Mares [Mon, 10 May 2004 13:13:05 +0000 (13:13 +0000)]
Oops, this worked only by a chance.

20 years agoAdded the new pool-str module.
Martin Mares [Mon, 3 May 2004 10:50:31 +0000 (10:50 +0000)]
Added the new pool-str module.

20 years agoAdded mp_strcat() and mp_multicat().
Martin Mares [Mon, 3 May 2004 10:45:18 +0000 (10:45 +0000)]
Added mp_strcat() and mp_multicat().

20 years agoFixed the comment.
Martin Mares [Tue, 20 Apr 2004 16:02:33 +0000 (16:02 +0000)]
Fixed the comment.

20 years ago- 0x08 (BACKSPACE) is a blank character and it is accepted as an ASCII-character
Robert Spalek [Mon, 19 Apr 2004 16:41:45 +0000 (16:41 +0000)]
- 0x08 (BACKSPACE) is a blank character and it is accepted as an ASCII-character
- 0x7f is also accepted as an ASCII-character
- both gather/content.c and gather/charset.c now use the same function
  Cblank() to test it

20 years agoChanged locking rules. Scans and appends can peacefully co-exist now.
Martin Mares [Sun, 18 Apr 2004 13:39:32 +0000 (13:39 +0000)]
Changed locking rules. Scans and appends can peacefully co-exist now.
Should solve the problems with shep-reap waiting for bucket file transmission
to finish.

20 years agoWhen logging to a file, redirect fd1 to the log file as well.
Martin Mares [Fri, 16 Apr 2004 15:32:56 +0000 (15:32 +0000)]
When logging to a file, redirect fd1 to the log file as well.

20 years agoMulti-part objects (with header and body separated by an empty line and terminated
Martin Mares [Sat, 10 Apr 2004 20:36:01 +0000 (20:36 +0000)]
Multi-part objects (with header and body separated by an empty line and terminated
either by EOF or by a NUL byte) are very common, so let's introduce a special
function for reading them.

20 years agoGot tired by repeating the same `gather pieces from incomplete reads' subroutine.
Martin Mares [Sat, 10 Apr 2004 20:35:24 +0000 (20:35 +0000)]
Got tired by repeating the same `gather pieces from incomplete reads' subroutine.

20 years agoAdded a simple test.
Martin Mares [Sat, 10 Apr 2004 15:13:16 +0000 (15:13 +0000)]
Added a simple test.

20 years agoAdd new modules.
Martin Mares [Sat, 10 Apr 2004 14:45:56 +0000 (14:45 +0000)]
Add new modules.

20 years agoUse format_exit_status(). One more ASSERT.
Martin Mares [Sat, 10 Apr 2004 14:45:47 +0000 (14:45 +0000)]
Use format_exit_status(). One more ASSERT.

20 years agoIncluding <stdarg.h> in lib/lib.h enables us to finally export vlog_msg()
Martin Mares [Sat, 10 Apr 2004 14:45:19 +0000 (14:45 +0000)]
Including <stdarg.h> in lib/lib.h enables us to finally export vlog_msg()
which I was missing for several times.

20 years agoAdded exitstatus and runcmd functions. Finally decided to include <stdarg.h>
Martin Mares [Sat, 10 Apr 2004 14:44:40 +0000 (14:44 +0000)]
Added exitstatus and runcmd functions. Finally decided to include <stdarg.h>
in lib.h.

20 years agoAdded a couple of functions for running of external commands with error
Martin Mares [Sat, 10 Apr 2004 14:43:55 +0000 (14:43 +0000)]
Added a couple of functions for running of external commands with error
checking. A nice replacement for system().

20 years agoAdded a new module for formatting of process exit status messages.
Martin Mares [Sat, 10 Apr 2004 14:43:30 +0000 (14:43 +0000)]
Added a new module for formatting of process exit status messages.

20 years agoAdded one more check.
Martin Mares [Sat, 10 Apr 2004 13:05:05 +0000 (13:05 +0000)]
Added one more check.

20 years agoMore enhancement to the main loop library: Export all lists for easy inspection
Martin Mares [Thu, 8 Apr 2004 22:18:19 +0000 (22:18 +0000)]
More enhancement to the main loop library: Export all lists for easy inspection
(reading only) by the callers. When a process exits, construct a nice tombstone
string for it.

20 years agoRead and write functions accept void pointers instead of byte*.
Martin Mares [Thu, 8 Apr 2004 21:16:16 +0000 (21:16 +0000)]
Read and write functions accept void pointers instead of byte*.

20 years agoAdded a universal main loop with timers, file descriptor polling and process
Martin Mares [Wed, 7 Apr 2004 22:03:30 +0000 (22:03 +0000)]
Added a universal main loop with timers, file descriptor polling and process
watching. Inspired by the glib main loop, but this one has a much nicer
interface.

It will be used in the Shepherd master and if it turns out to be useful,
I'll convert the other programs to use it some day.

20 years agoInterface to our own regex library.
Martin Mares [Sat, 20 Mar 2004 23:25:31 +0000 (23:25 +0000)]
Interface to our own regex library.

20 years agoBetter explanation.
Martin Mares [Sat, 20 Mar 2004 23:23:57 +0000 (23:23 +0000)]
Better explanation.

20 years agoAfter another couple of hours spent digging in regular expression libraries
Martin Mares [Sat, 20 Mar 2004 23:22:52 +0000 (23:22 +0000)]
After another couple of hours spent digging in regular expression libraries
I decided to use a copy of the glibc 2.3.2 regex routines.

20 years agoOne more deadly testcase.
Martin Mares [Sat, 20 Mar 2004 22:31:31 +0000 (22:31 +0000)]
One more deadly testcase.

20 years agoAdded configuration options for libpcre.
Martin Mares [Thu, 18 Mar 2004 22:15:48 +0000 (22:15 +0000)]
Added configuration options for libpcre.

20 years agoConverted regex module testing to the new test framework.
Martin Mares [Mon, 15 Mar 2004 22:57:11 +0000 (22:57 +0000)]
Converted regex module testing to the new test framework.

20 years agoAdded a very simple utility for performing unit tests on modules.
Martin Mares [Mon, 15 Mar 2004 22:56:20 +0000 (22:56 +0000)]
Added a very simple utility for performing unit tests on modules.

20 years agoOur regex functions are now able to interface to old-style BSD re_match(),
Martin Mares [Sun, 14 Mar 2004 12:58:40 +0000 (12:58 +0000)]
Our regex functions are now able to interface to old-style BSD re_match(),
to POSIX regexec() and to libpcre.  Currently it's switched to the BSD mode
as before, I'll look at it more in the evening.

20 years agoMake regex-t compile.
Martin Mares [Sat, 13 Mar 2004 22:31:55 +0000 (22:31 +0000)]
Make regex-t compile.

20 years agoWelcome to v3.2 world on the mainline :)
Martin Mares [Sat, 13 Mar 2004 14:39:34 +0000 (14:39 +0000)]
Welcome to v3.2 world on the mainline :)

20 years agoSeek to end of an empty fbmem stream needs a special exception.
Martin Mares [Fri, 12 Mar 2004 13:54:46 +0000 (13:54 +0000)]
Seek to end of an empty fbmem stream needs a special exception.

20 years agoCtrl-L is marked blank
Robert Spalek [Mon, 8 Mar 2004 14:20:13 +0000 (14:20 +0000)]
Ctrl-L is marked blank

20 years agoWhen we try to create a temporary file and it already exists (which can happen
Martin Mares [Tue, 2 Mar 2004 15:38:20 +0000 (15:38 +0000)]
When we try to create a temporary file and it already exists (which can happen
if a program with the same PID has crashed at some time in the past), don't
panic and rewrite the file. Should be safe since we're using our own tmp directory
nobody else can access.

20 years agoBumped the version number to 3.1.
Martin Mares [Sun, 29 Feb 2004 19:07:39 +0000 (19:07 +0000)]
Bumped the version number to 3.1.

20 years agoAdded REV_COMPARE(x,y) which is equivalent to COMPARE(y,x), but it's
Martin Mares [Sat, 28 Feb 2004 11:03:42 +0000 (11:03 +0000)]
Added REV_COMPARE(x,y) which is equivalent to COMPARE(y,x), but it's
better readable to use both of them instead of swapping the arguments.

20 years ago`buckettools -c' now uses fastbufs for output.
Martin Mares [Sat, 28 Feb 2004 10:55:05 +0000 (10:55 +0000)]
`buckettools -c' now uses fastbufs for output.

20 years agoHopefully finally sorted out the "http://www.xyz.cz?param" mess. The true
Martin Mares [Sat, 28 Feb 2004 10:49:48 +0000 (10:49 +0000)]
Hopefully finally sorted out the "http://www.xyz.cz?param" mess. The true
semantics turned out to be "http://www.xyz.cz/?param" and most web servers
really require "GET /?param".

I've changed the normalization rules to add the leading slash if needed
which also solves the relative URL problem I mentioned in the comments.

However, this means that the SEMANTICS OF NORMALIZED URL'S HAS CHANGED
and gatherer databases with URL's in the "http://www.xyz.cz?param" form
are now INVALID. I'm going to delete all such URL's from our gatherer now.

20 years agoHmmm, COMPILE_ASSERT is a better name.
Martin Mares [Fri, 27 Feb 2004 15:36:38 +0000 (15:36 +0000)]
Hmmm, COMPILE_ASSERT is a better name.

20 years agoAdded CPP_ASSERT.
Martin Mares [Fri, 27 Feb 2004 15:32:59 +0000 (15:32 +0000)]
Added CPP_ASSERT.

20 years agoForgot to add this one.
Martin Mares [Wed, 25 Feb 2004 20:30:50 +0000 (20:30 +0000)]
Forgot to add this one.

20 years agoCleaned up the ancient chartype functions: kicked out accent tables (which
Martin Mares [Wed, 25 Feb 2004 20:29:01 +0000 (20:29 +0000)]
Cleaned up the ancient chartype functions: kicked out accent tables (which
were not exported anyway), added conversion to lowercase.

20 years agoRemoved hopefully all implicit dependencies on file type numbering.
Martin Mares [Wed, 25 Feb 2004 19:44:32 +0000 (19:44 +0000)]
Removed hopefully all implicit dependencies on file type numbering.

Also, CONFIG_LANG without CONFIG_FILETYPE works if you define your
own CA_GET_FILE_LANG macro.

20 years agoCF_USAGE_TAB can be used to insert more tabs to the default help message.
Martin Mares [Tue, 24 Feb 2004 23:08:49 +0000 (23:08 +0000)]
CF_USAGE_TAB can be used to insert more tabs to the default help message.

20 years agoAdded mp_strdup().
Martin Mares [Tue, 24 Feb 2004 23:08:31 +0000 (23:08 +0000)]
Added mp_strdup().

20 years agoBlank lines are considered separators, not terminators of buckets.
Martin Mares [Tue, 24 Feb 2004 18:36:23 +0000 (18:36 +0000)]
Blank lines are considered separators, not terminators of buckets.
Hence extraneous blank lines between buckets and trailing blank lines
after the last buckets are all ignored.

20 years ago"http://hell.org?xyz" really is a valid URL. Also checked wrt. the current
Martin Mares [Tue, 24 Feb 2004 18:22:14 +0000 (18:22 +0000)]
"http://hell.org?xyz" really is a valid URL. Also checked wrt. the current
RFC 2396 and added several comments about where do we differ.

20 years agoMoved struct card_prints to index.h, because it's used outside the indexer.
Martin Mares [Wed, 18 Feb 2004 12:20:26 +0000 (12:20 +0000)]
Moved struct card_prints to index.h, because it's used outside the indexer.

Added CARD_FLAG_OVERRIDEN.

20 years agoDeclare variables for library names in the top-level Makefile to avoid
Martin Mares [Mon, 16 Feb 2004 16:07:11 +0000 (16:07 +0000)]
Declare variables for library names in the top-level Makefile to avoid
forward references.

20 years agoIntroduced COMPARE macro for use in sorter callbacks.
Martin Mares [Sun, 15 Feb 2004 18:25:25 +0000 (18:25 +0000)]
Introduced COMPARE macro for use in sorter callbacks.

20 years agoDon't forget to load config file.
Martin Mares [Sun, 15 Feb 2004 17:36:42 +0000 (17:36 +0000)]
Don't forget to load config file.

20 years agoAdded testing routines for URL key calculator.
Martin Mares [Sun, 15 Feb 2004 17:31:42 +0000 (17:31 +0000)]
Added testing routines for URL key calculator.

20 years agoSupply default element swapping macro.
Martin Mares [Tue, 10 Feb 2004 18:18:12 +0000 (18:18 +0000)]
Supply default element swapping macro.

20 years agoAny non-zero value enables an option. (Allows CONFIG_NUM_CONTEXTS to be
Martin Mares [Thu, 5 Feb 2004 20:27:23 +0000 (20:27 +0000)]
Any non-zero value enables an option. (Allows CONFIG_NUM_CONTEXTS to be
recognized as a switch.)

20 years agoNested conditionals were processed incorrectly. Also, the "#" at the start
Martin Mares [Thu, 5 Feb 2004 19:37:25 +0000 (19:37 +0000)]
Nested conditionals were processed incorrectly. Also, the "#" at the start
of directives is now mandatory.

20 years agoAdded STRINGIFY macro.
Martin Mares [Thu, 29 Jan 2004 21:18:21 +0000 (21:18 +0000)]
Added STRINGIFY macro.

20 years agoIf `-S' or `-C' is given after some other options (i.e., the config file
Martin Mares [Sun, 25 Jan 2004 12:45:00 +0000 (12:45 +0000)]
If `-S' or `-C' is given after some other options (i.e., the config file
has been already loaded automatically), complain loudly.

Also improved the error message for invalid `-S'.

Robert, could you please check this?

20 years agoIP access lists now accept both netmasks and prefix lengths.
Martin Mares [Fri, 23 Jan 2004 17:16:32 +0000 (17:16 +0000)]
IP access lists now accept both netmasks and prefix lengths.

20 years agoHours have 3600 seconds, days have 1440 minutes :)
Martin Mares [Thu, 22 Jan 2004 20:00:49 +0000 (20:00 +0000)]
Hours have 3600 seconds, days have 1440 minutes :)

20 years agoAdded "hour" and "day" units.
Martin Mares [Thu, 22 Jan 2004 17:59:38 +0000 (17:59 +0000)]
Added "hour" and "day" units.

20 years agoMD5 didn't work on big-endian machines, because it tested the wrong
Martin Mares [Thu, 22 Jan 2004 15:31:46 +0000 (15:31 +0000)]
MD5 didn't work on big-endian machines, because it tested the wrong
endianity switch.

20 years agoUse int instead of pid_t. At the first glance, this looks like a step backward,
Martin Mares [Thu, 22 Jan 2004 11:21:36 +0000 (11:21 +0000)]
Use int instead of pid_t. At the first glance, this looks like a step backward,
but since we use the variable for printing with a "%d" format-string anyway
and there is no way how to get the right format string for pid_t, it's better
this way.

20 years agotypo fixed
Robert Spalek [Mon, 19 Jan 2004 10:32:37 +0000 (10:32 +0000)]
typo fixed

20 years agoReverted the locking back to original unsafe (but in a harmless way) version,
Martin Mares [Sat, 17 Jan 2004 23:20:02 +0000 (23:20 +0000)]
Reverted the locking back to original unsafe (but in a harmless way) version,
because write locking of files opened for reading is not permitted.

20 years agoReplacing of buckets during shakedown didn't work.
Martin Mares [Fri, 16 Jan 2004 17:01:58 +0000 (17:01 +0000)]
Replacing of buckets during shakedown didn't work.

20 years agoUse obuck_bucket_size() whereever possible.
Martin Mares [Fri, 16 Jan 2004 16:38:08 +0000 (16:38 +0000)]
Use obuck_bucket_size() whereever possible.

20 years agoAdd obuck_bucket_size().
Martin Mares [Fri, 16 Jan 2004 16:37:53 +0000 (16:37 +0000)]
Add obuck_bucket_size().

20 years agoUse brewind() where appropriate.
Martin Mares [Fri, 16 Jan 2004 14:16:22 +0000 (14:16 +0000)]
Use brewind() where appropriate.

20 years agoThe combination of bflush() with bsetpos() to the beginning is a very
Martin Mares [Fri, 16 Jan 2004 14:16:00 +0000 (14:16 +0000)]
The combination of bflush() with bsetpos() to the beginning is a very
common idiom. Make it a library function brewind().

20 years agoobuck_predict_last_oid() can be made safe easily.
Martin Mares [Sun, 11 Jan 2004 19:09:22 +0000 (19:09 +0000)]
obuck_predict_last_oid() can be made safe easily.

20 years agoRewritten shake down of bucket file.
Martin Mares [Sun, 11 Jan 2004 19:03:21 +0000 (19:03 +0000)]
Rewritten shake down of bucket file.

  o  Replaced read and write buffers by a single shared buffer.
     This should be somewhat faster (with the same size of memory invested
     to buffers).
  o  If ShakeSecurity is set to 2, shaking down should be reliable under
     all circumstances, including server reboots and broken bucket files.
     Buckettool -F still needs to be run after a failed shakedown and
     oid's need to be synchronized with the outside world, but no buckets
     will be lost (only some of them may be duplicated).
  o  The callback function (`the kibitz') is now allowed not only to decide
     which buckets will be kept, but also to alter contents of the buckets
     provided that it won't enlarge the bucket.

I tried to be very careful and tested the new routine thoroughly, but since
it's a pretty critical place, I would be very happy if somebody checks it
independently.

20 years agoDon't create large bit arrays on stack. (The default stack limit on Linux is 2MB.)
Martin Mares [Sun, 11 Jan 2004 00:07:54 +0000 (00:07 +0000)]
Don't create large bit arrays on stack. (The default stack limit on Linux is 2MB.)

20 years agoGLUE_ again.
Martin Mares [Sat, 10 Jan 2004 13:46:43 +0000 (13:46 +0000)]
GLUE_ again.

20 years agoUse GLUE_ instead of HASH_GLUE.
Martin Mares [Sat, 10 Jan 2004 13:44:38 +0000 (13:44 +0000)]
Use GLUE_ instead of HASH_GLUE.

20 years agoAdded GLUE and GLUE_ macros.
Martin Mares [Sat, 10 Jan 2004 13:44:14 +0000 (13:44 +0000)]
Added GLUE and GLUE_ macros.

I originally wanted to use them in the new pre-sorter and didn't need them
afterwards, but they are useful anyway.

20 years agoWhen pre-sorting a regular file, use lib/arraysort.h on an array of items
Martin Mares [Sat, 10 Jan 2004 13:41:09 +0000 (13:41 +0000)]
When pre-sorting a regular file, use lib/arraysort.h on an array of items
instead of the default merge-sort type algorithm working with linked lists.

This is much faster -- e.g., the sorting in shep-export on the current
Sherlock3 database now takes 54 sec instead of 669 :-)

However, to accomplish this I had to change two invariants:

  (1) SORT_REGULAR now means not only that the input has regular structure,
      but also that each item is reasonably small (i.e., we can use
      sorting by exchanging in place).

  (2) If SORT_PRESORT is enabled, the comparison function can be called
      with both keys equal. This trips ASSERT's on various place which
      originally helped a lot during debugging, so I decided to add
      a SORT_UNIQUE switch which in DEBUG mode causes the sorter to
      ensure that all keys are distinct, so we can remove the ASSERT's.

As both the Shepherd and the Indexer now rely heavily on sorting, it might
be worth a try to optimize the sorter even further, maybe by utilizing
polyphase sorting or something like that, the run sizes really seem to be
distributed unevenly many times.

20 years agoUse HASH_USE_POOL for configuration space allocations.
Martin Mares [Sat, 10 Jan 2004 12:43:54 +0000 (12:43 +0000)]
Use HASH_USE_POOL for configuration space allocations.

20 years agoAdded HASH_AUTO_POOL option.
Martin Mares [Sat, 10 Jan 2004 12:41:52 +0000 (12:41 +0000)]
Added HASH_AUTO_POOL option.

20 years agoDo not print "[]".
Tomas Valla [Tue, 23 Dec 2003 18:41:22 +0000 (18:41 +0000)]
Do not print "[]".

20 years agoAllow modules to change the log title, second attempt.
Tomas Valla [Tue, 23 Dec 2003 00:18:53 +0000 (00:18 +0000)]
Allow modules to change the log title, second attempt.

20 years agoOther modules shoud be able to modify the log title.
Tomas Valla [Mon, 22 Dec 2003 19:29:39 +0000 (19:29 +0000)]
Other modules shoud be able to modify the log title.

20 years agoAnother debugging switch: dump core on fatal errors.
Martin Mares [Mon, 15 Dec 2003 19:20:47 +0000 (19:20 +0000)]
Another debugging switch: dump core on fatal errors.

20 years agoThe debugging memory allocator is now enabled by DEBUG_DMALLOC instead
Martin Mares [Mon, 15 Dec 2003 19:20:18 +0000 (19:20 +0000)]
The debugging memory allocator is now enabled by DEBUG_DMALLOC instead
of just "DMALLOC".

20 years agodeleted comment about fprecog
Robert Spalek [Thu, 11 Dec 2003 11:55:45 +0000 (11:55 +0000)]
deleted comment about fprecog

20 years agoImproved and cleaned up the bucket library. The original "single operation
Martin Mares [Sun, 7 Dec 2003 14:23:58 +0000 (14:23 +0000)]
Improved and cleaned up the bucket library. The original "single operation
pending per process" invariant was no longer feasible (and it caused several
problems in Shepherd).

Reading and writing of buckets now uses dynamically allocated fastbufs and
there can be any number of readers at any time, but only a single writer
(otherwise a deadlock would occur). Read streams are seekable, write streams
at least btell()-able.

Also removed the omnipresent global variables for start of current bucket
etc., each part (Find, Slurp, Create, Shakedown, ...) has its own state
variables.

Added some more sanity checks.

20 years agoindex version reverted to v2.6 subversion 2, because it is compatible now
Robert Spalek [Wed, 3 Dec 2003 13:04:36 +0000 (13:04 +0000)]
index version reverted to v2.6 subversion 2, because it is compatible now

20 years agoindex version incremented due to lexmap.h change
Robert Spalek [Tue, 2 Dec 2003 14:08:30 +0000 (14:08 +0000)]
index version incremented due to lexmap.h change
anyway, we wanted to change 26 -> 30 some day

20 years agoOne more item type: u64.
Martin Mares [Sat, 29 Nov 2003 11:47:02 +0000 (11:47 +0000)]
One more item type: u64.

20 years agoTwo improvements to the configuration language:
Martin Mares [Sat, 29 Nov 2003 11:25:09 +0000 (11:25 +0000)]
Two improvements to the configuration language:

o  Floating point item type introduced.
o  Both integer and floating point numbers can be suffixed with a unit.

Also, I've exported parsing of integers and doubles for the convenience
of CT_FUNCTION callbacks.

20 years agono need to cut www-prefix twice
Robert Spalek [Wed, 26 Nov 2003 17:30:58 +0000 (17:30 +0000)]
no need to cut www-prefix twice

20 years agodo not replace target url-equiv
Robert Spalek [Tue, 25 Nov 2003 16:11:57 +0000 (16:11 +0000)]
do not replace target url-equiv

20 years agoReplaced obuck_fetch_end() by bclose() (which is a nop as obuck_fetch_end was :) ).
Martin Mares [Sat, 22 Nov 2003 18:22:34 +0000 (18:22 +0000)]
Replaced obuck_fetch_end() by bclose() (which is a nop as obuck_fetch_end was :) ).

20 years agoAdded very simple functions for emulating a fastbuf stream over a static
Martin Mares [Sat, 22 Nov 2003 18:21:22 +0000 (18:21 +0000)]
Added very simple functions for emulating a fastbuf stream over a static
buffer. The struct fastbuf is allocated statically as well to make everything
as simple and as fast as possible.

20 years ago1. db/catalog.gz ---> db/catalog
Robert Spalek [Mon, 17 Nov 2003 13:09:44 +0000 (13:09 +0000)]
1. db/catalog.gz ---> db/catalog
+ it is not sent to oook and feedback-cat via pipes, but it is read by them as a file
+ it is read in 2 passes and the URL's are identified in the 1st phase (catalog.c)

2. URL fingerprinting always uses cf/url-equiv, even in the indexer

20 years agoA better function for hashing integers (the old multiplier was completely
Martin Mares [Sat, 15 Nov 2003 10:41:41 +0000 (10:41 +0000)]
A better function for hashing integers (the old multiplier was completely
bogus as it didn't fit in a 32-bit integer) and also a new function
for hashing pointers.

20 years agoI decided to turn off cf/url-equiv for indexation. however, after the indexer
Robert Spalek [Thu, 13 Nov 2003 10:43:07 +0000 (10:43 +0000)]
I decided to turn off cf/url-equiv for indexation.  however, after the indexer
is run on regular sherlock5, we cannot manually delete this file for indexer
and restore for gatherd.  so I am creating a new parameter that controls
loading this prefix table.

21 years agoAdded some headers to avoid confusion of our own developers ;)
Tomas Valla [Thu, 6 Nov 2003 16:53:58 +0000 (16:53 +0000)]
Added some headers to avoid confusion of our own developers ;)