Regex for international dates and company entity types

  • 状态: Pending
  • 奖金: $720
  • 参赛作品已收到: 24

竞赛简介

I stress that this project involves as much RESEARCH as code-writing. The different formatting used internationally is vital to get right first; the coding that follows is easy.

This project is to write a script containing a series of regex scans that will extract metadata from plain documents. The purpose is to scan a big-data archive of document and retrieve specific meta data such as dates and company name.

Script execution speed is essential. Ideally the script is written in Perl, but python or php is okay.

This must work with international meta data! Please do not expect your entry to win without this prerequisite. The plain text source will be in UTF8 to cope with international characters.

The script can return in any practical format, as long as the format can be imported:-
E.g. Json, Serialized array

The document describes script sections - which means create one include file that can be included in a different project, and demonstrate how to call the functions. So there would be at least 4 main functions.

To make good use of your time, I'd suggest you first research the international dates and international company types, and send me privately a document. Obviously don't include what is already in Wiki. I can then give you feedback on whether that is comprehensive. Once you have that feedback it becomes worthwhile to code.

Good luck!

推荐技能

此竞赛的顶尖作品

查看更多参赛作品

公告栏

  • kabapy
    kabapy
    • 6 天 之前

    #46 the script is ready for you to review

    • 6 天 之前
  • kabapy
    kabapy
    • 1 周 之前

    #46 the results of the 4 tasks in 4 excel files. please contact me on the chat if you have any comments or modifications

    • 1 周 之前
  • kabapy
    kabapy
    • 2 周 之前

    #46 Finished The 4 Tasks, Please Review

    • 2 周 之前
  • NabeelShaikhh
    NabeelShaikhh
    • 3 周 之前

    Hi Sir Please let me know
    You need on Web or Local ?

    • 3 周 之前
    1. sunnyguptahotels
      竞赛举办者
      • 3 周 之前

      Local

      • 3 周 之前
  • kabapy
    kabapy
    • 3 周 之前

    entry #43

    • 3 周 之前
  • naveendurai
    naveendurai
    • 3 周 之前

    Can you check my entry #42 ? See whether I am going in right direction.

    • 3 周 之前
    1. sunnyguptahotels
      竞赛举办者
      • 3 周 之前

      Please never participate in my contests.

      • 3 周 之前
    2. naveendurai
      naveendurai
      • 3 周 之前

      Why? What did I do ?

      • 3 周 之前
  • HDevCrea
    HDevCrea
    • 4 周 之前

    We can't post our entries if the contest is not sealed. Everyone will see it.
    #sealed

    • 4 周 之前
    1. HDevCrea
      HDevCrea
      • 4 周 之前

      I already did. It was entry #26.

      • 4 周 之前
    2. sunnyguptahotels
      竞赛举办者
      • 3 周 之前

      Somehow it has gone. Can you skype me - sunnygupta1000

      • 3 周 之前
  • StromlightTech
    StromlightTech
    • 1 个月 之前

    Entries are no way related to contest.. lol

    • 1 个月 之前
    1. sunnyguptahotels
      竞赛举办者
      • 1 个月 之前

      You might prefer this project. no one has even come close.

      • 1 个月 之前
  • HDevCrea
    HDevCrea
    • 1 个月 之前

    Please check Entry #26 so I can send you the first script.

    • 1 个月 之前
  • ikobir
    ikobir
    • 1 个月 之前

    Sir, if you need any change tell me. thanks

    • 1 个月 之前
  • ikobir
    ikobir
    • 1 个月 之前

    Sir,Kindly check my entry#23,#24,#25. and if you need any change tell me. thanks

    • 1 个月 之前
  • sunnyguptahotels
    竞赛举办者
    • 1 个月 之前

    Unfortunately I can't work with Node\JS\Java - its not the language but the coding support that I can't do.

    • 1 个月 之前
    1. KishuPro
      KishuPro
      • 1 个月 之前

      Hmm I understand, thanks for the response.

      • 1 个月 之前
  • sunnyguptahotels
    竞赛举办者
    • 1 个月 之前

    Has anyone started or should I invite others?

    • 1 个月 之前
    1. KishuPro
      KishuPro
      • 1 个月 之前

      "Script execution speed is essential" - Do you think Node/JS/Typescript or Java would be too slow/out of scope for your purposes?

      • 1 个月 之前
  • ethmain
    ethmain
    • 1 个月 之前

    Hello, what do I do when I am done? I don't want to put it in a contest because people can take my work and do more with it..

    • 1 个月 之前
    1. ethmain
      ethmain
      • 1 个月 之前

      is it possible that you attach a sample of the business document? so we can have a better idea on what you will need?

      • 1 个月 之前
    2. ArnabGuchait
      ArnabGuchait
      • 1 个月 之前

      Can we place some type of watermark/authentication proof/set a final (non-editable) copy of our work ?

      • 1 个月 之前
  • sssalim018152347
    sssalim018152347
    • 1 个月 之前

    please check my entry #14

    • 1 个月 之前
    1. sunnyguptahotels
      竞赛举办者
      • 1 个月 之前

      How is that possibily an entry. My suggestion is not to waste your time.

      • 1 个月 之前
  • sunnyguptahotels
    竞赛举办者
    • 1 个月 之前

    Please do not spam my contests!

    • 1 个月 之前
  • RajakScripts
    RajakScripts
    • 1 个月 之前

    Now, I assume you actually want the regex to process any text content on a document, NOT the literal metadata from a file (of a document) itself, correct?

    • 1 个月 之前
  • RajakScripts
    RajakScripts
    • 1 个月 之前

    Hi, could you please attach some docs to firstly reveal its metadata so I will have a better picture before doing the regex?

    • 1 个月 之前
    1. sunnyguptahotels
      竞赛举办者
      • 1 个月 之前

      Just look for any business contracts. E.g. https://www.printablecontracts.com/General_Contracting.php

      • 1 个月 之前

显示更多评论

如何以竞赛开始

  • 发起您的竞赛

    发起您的竞赛 快速且简单

  • 获取大量参赛作品

    获取大量参赛作品 来自世界各地

  • 悬赏最佳参赛作品

    悬赏最佳参赛作品 下载文件——简单!

立即发布竞赛 或者立即加入我们!