Adsense

Donate


Home
Minidx's ExtractText component and VB2005,VC2005 Demo download available (2008/02/10)

Minidx's ExtractText component  is a simple component to extract just the text from any file that has an IFilter installed. Available as a C++ COM component and as a C# .NET library.You can create your application(extract text) easily and with less coding with this component.
※The IFilter providers are used by Microsoft Index Server, Microsoft Sharepoint Server and Microsoft Desktop Search to extract the indexable text for a file. By using the same interfaces, it is possible to extract just the text from just about any file from Microsoft Word .DOC files,Excel .xls,Adobe PDF...etc.

Here is the component dll and demo source with VB2005 (Tutorial)
Here is the component dll and demo source with VC2005(Tutorial )

English JapaneseChinese

If you are interested in it and have anythiing,pls don’t hesitate to left your comment or get help from/at here,enjoy!

 
Minidx.RC1.1 released(2007/10/20)

       -- fixed the bug when view original data(10.20)

                                                                    >>>download 

 
Minidx.RC1.0 released(2007/08/18)

        -- Data format is changed(08.12)
        -- realized the compression function(08.12)
        -- revised the error which double click file couldn't show the file content(08.01)
        -- revised the error which document item couldn't show normally(08.01)
        -- revised the error when deleted the files consecutively the program would down(07.31)
        -- revised the errow when use chinese setup path would wrong(07.29)
        -- added the confirmation whether remove the data folder
        -- Some bugs were fixed

                                                                       >>>download
 
 
Minidx is Free software!
  Minidx is a professional file management system.it has:

The speedy full-text search engine find the document which need at the first time
The own memory system which manage the important file under great security

Manage the TB rank data but the amount of data has small effect on the system
Using IFilter to get the text directly. Don't need to install office or other application
Basic on Unicode and input/output multi-countries language normally

High light the program grammar which makes reading various document convenience

High light the search result and convenience for the users

Realize the fuzzy inquiry and could recognize the synonym. Example when want to search the “where”,
    input the “whe” will show all the words begin from "whe" include when,where etc.

The own web server facilitated sharing the management document in internet or local network
Precise search by the file create/edit/visit time or the file title or the file path or file content
User could set the hypothesis to filter the unuseful words

 

Advantage of Minidx full-text search engine:

It's very small and written by twenty thousand code line only

Written by standard C/C++. Could run in all interpreted language OS environment

Don’t need large memory and running normally under low hardware environment also

Integrate to any system easily. Just add a few line of code the system can run the full-text search engine

High speed even million record could return the result in millisecond

Search by words, phrase also sentence

“And” ,“Or” and two union could be done

Because of the Unicode, multi-language string could do the mix search

Precise search even if a punctuation marks could find out the allocation

 

 

 
Background of the Mindx born
 There are many desktop search software, but mostly aims at all documents in PC. The user only can precisely to some kind of document and can't manage the important documents by themselves requirements.

Because of working, I frequently change the computer. Since doesn’t have the effective document management software, part of the documents will lost. And I don't  realize this until use. In family is also the same.

File management tool not only manage the documents effective but also help the user to find the relative data then user could define the documents by themselves. And realize the fast search.

There are multi-language in my computer. Some are Chinese simplified, some are Chinese traditional, some are the Japanese, some are the English and the operation system will also cut by different language. Therefore don’t hope the document management software will influent by the OS language environment.

In order to solve these problems, I decided to develop own search engine and integrate in the document management system. Then the Mindx born. Although it based on individual need, the design and development is defer to the mass data process. The Mindx can easily deal with the enterprise application.

 
© 2009 Minidx File Manager | Minidx Full-text Search Engine
Minidx! is a professional file management system.