database capability

  • Thread starter Arijit Upadhyay
  • Start date
A

Arijit Upadhyay

I would like to ask a feasibility and database strength question. This
would be a cd based application developed with Flash & Access DB to
catalogue over 600 PDF's containing 8000 pages. The application needs to do
a full text search. So I am trying to keep the text contents of the PDF in
the memo fields. Approximately each row would contain around 40 kb data (as
saved in notepad), so in total about 25 MB of data in memo fields. The
fields are just two filenumber and filecontent.

Can Access handle this?

regards
Arijit
 
J

Jeff Boyce

Arijit

Access has a row limit of 2K in each table. 40K of data in one row?! That
doesn't sound like the data structure is particularly normalized, unless
there are multiple memo fields.

Access needs to open/write to a temporary "locking" db -- and a CD-based
application wouldn't allow this.

My first reaction is that these issues may be "show stoppers", but I defer
to other responders who have more experience with this.
 
B

Brian

Jeff Boyce said:
Arijit

Access has a row limit of 2K in each table. 40K of data in one row?! That
doesn't sound like the data structure is particularly normalized, unless
there are multiple memo fields.

Access needs to open/write to a temporary "locking" db -- and a CD-based
application wouldn't allow this.

My first reaction is that these issues may be "show stoppers", but I defer
to other responders who have more experience with this.

--
Good luck

Jeff Boyce
<Access MVP>

He said that there are only two fields, and one of them is a memo field.
The 2K limit doesn't include memo fields, so he's gonna be OK in absolute
size terms. I can't comment on the other aspects of what he's trying to do,
except that there must be more suitable software than Access for searching
and viewing PDF files on a CD.
 
A

Arijit Upadhyay

Exactly, there are just two fields. one contains the PDF file reference
number, a text field, and the other a memo field containing about 30-40 k of
raw text. This I can say by saving the pdf file as raw text format and
getting the file size.

As for the other issues raised, I couldn't find a better solution. Since
these files need to be PDF and that too searchable image PDF as the
documents are pretty old. I just save the text content of the file in the
access database and allow searching through an interface developed in Flash
using Access. See a preview of the application here -
http://www.asiusa.com/symcd/flash/, here the full text search thing is not
there, but rest of the things function well from CD and its tested. I am now
just planning to integrate the full text search thing.

As for the database being in Cd and the locking file, that is not a problem
because Flash saves the file temporarily in user temp files and the locking
db is also created there not in the cd itself.

If anybody could suggest a better procedure I would be grateful. Till then
it seems I have to use Access and was trying to figure out whether it can
handle this volume before I go on to enter the 600 odd documents text in the
database.

regards
arijit
 
B

Brian

Arijit Upadhyay said:
Exactly, there are just two fields. one contains the PDF file reference
number, a text field, and the other a memo field containing about 30-40 k of
raw text. This I can say by saving the pdf file as raw text format and
getting the file size.

As for the other issues raised, I couldn't find a better solution. Since
these files need to be PDF and that too searchable image PDF as the
documents are pretty old. I just save the text content of the file in the
access database and allow searching through an interface developed in Flash
using Access. See a preview of the application here -
http://www.asiusa.com/symcd/flash/, here the full text search thing is not
there, but rest of the things function well from CD and its tested. I am now
just planning to integrate the full text search thing.

As for the database being in Cd and the locking file, that is not a problem
because Flash saves the file temporarily in user temp files and the locking
db is also created there not in the cd itself.

If anybody could suggest a better procedure I would be grateful. Till then
it seems I have to use Access and was trying to figure out whether it can
handle this volume before I go on to enter the 600 odd documents text in the
database.

regards
arijit

Hello again Arijit. I'm not really qualified to comment further on your
overall approach since I've never done anything like this, but you are OK in
size terms. A memo field can hold up to 1 Gb, and an Access mdb can be up
to 2Gb, so no problem with your data.
 
Top