mikem
I'm a dokuwiki fan and work with it for 2 years.
But there is one thing that make make me a little bit sad.
It would be great, if dokuwiki can index all documents (office, pdf, txt, ...) in his searchengine.
Than it would be the perfect system for me to make my documentations in our company, to find all informations in one system.
Is there anyone who have an idea.
Is this a theme which is also important for other users.
Please let me know if there are solutions in work or by planning.
(I made some experiments with search engines like sphider, but there is the weakness that it works out of dokuwiki, it's not user friendly and it works out of the permissions i managed in dokuwiki)
Have a nice summer!
stinkywinky
A search engine for common office files (or at least pdf) would really be great!
There are some search engines on the market, which work based on JavaScript (plus some separate indexder). This could be integrated in DW I think...
chi
StinkyWinky wrote
There are some search engines on the market, which work based on JavaScript (plus some separate indexder). This could be integrated in DW I think...
Some links to those search engines you mentioned would be really helpful ;-).
stinkywinky
chi
I see, well, the thing is, none of those search engines could be officially integrated into DW because none of them is OpenSource.
stinkywinky
My idea was not to integrate it into the DW project.
I thought more about to display the search engine within the wiki page using an iframe or similar technics.
But having it really inegrated would of course be more desirable.
mikem
I mean the best case to make dokuwiki better searchable, is to integrate a powerfull searchengine or
to extend the original searchengine of dokuwiki.
This engine should able to index all common doctypes.
Then i have the chance to find all my informations in my wiki.
Is there nobody who wishes this function too.
daneel
IMHO there is a lot of people that would very appreciate this (including me), but I think it rather complex thing. I don't know any open-source search engine and even if I knew, I'm not able to integrate it with DW ...
ryan-chappelle
The problems are several: what is a "common doctype"? Does RTF count? Latex, as well? Also, what does "searchable" mean? Most likely, extract text. What do I do with an animated .gif that contains evolving text?
There's much to think to develope such a feature and have it to be incorporated into DW, which would make the devs run into all sorts of problems such as 1.- grieviously reinventing the wheel (so as to avoid dependency on external programs to, eg.: extract a CAD file's text for search) and 2.- risk patent issues to be able to search whatever document formant that is not *completely* open.
IMHO, this feature is better left alone for an external program or system to handle. DW's greatest advantage is that it's data is stored as plaintext and easily indexable/searchable, but that does not mean that for media the same premises could be upheld.
The entire idea of uploading media files is that the files are available. If one wants to access and operate on only "part" of the files, such as specifically text, the best workable option so far is to generate a cron job or similar utility that extracts the media file's text and stores it in a simple file in the wiki's data tree, which DW will then easily integrate.
mikem
Dokuwiki is a great tool for documentations.
But in the practice of my work, there are much documents are ready to use which are important
to save them and find them.
My idea is not to make alle included filetypes searchable, no such the typical ones.
I think that's a thing which make dokuwiki unbeatable for a documentation-tool.
I'm sure that this feature would make many users happy.
There is one thing i can't estimate, in case of including such a search-engine dokuwiki will need
a database for finding the information fast.
Is it so?
I wonder that this theme isn't more popular.
outcrop
That's a good idea, I need it too.
I'm planning to use the third part tools like catdoc, xpdf to get the text of documents, write virtual files(file path, file name, part of the document's text) to Dokuwiki for each document, then Dokuwiki can index, search and locate the file.
It may work, but it's not the best way.
kay
@outcrop: any progress on that? I could give you a hand on that, if wanted/needed...
mikem
i think this plugin will help me an the other interesting persons.
see more:
http://forum.dokuwiki.org/thread/4865
http://www.dokuwiki.org/plugin:docsearch
thank's for your time you spend