Polaris
Polaris
Developed by:
Sindh Police IT & Tech Innovation Unit
Deployment: On-Premises (Sindh Police Headquarters)
Table of Contents
1. Project Name & Acronym
2. Purpose and Objectives
3. Project Scope (Sindh Province)
4. Timeline and Milestones
5. Scrum Team Management Plan
6. Required Skill Set
7. Team Composition
8. Tasks Suitable for Outsourcing
9. Training and Tutorial Strategy
10. Documentation Strategy
11. Development Workflow: User Stories, Epics, Issues, Tasks
12. Identified Entities (Data and Domain Model)
13. Concurrency and User Load Validation
14. Licensing and Software Costs (2-Year Breakdown)
15. UI/UX Design Approach (Figma Prototypes & Style)
16. Field Usage Scenario – Video Concept
Back to TOC
[Continue with the detailed sections as previously provided in the full report, with each
section ending with a "Back to TOC" hyperlink.]
Great. I’ll create a comprehensive report for your AI-powered software system tailored for
police officials in Sindh. It will support inference-based natural language queries in English,
Urdu, Pashto, and Sindhi, drawing data from structured PDFs, call records in Excel, and MS
SQL databases. The system will be designed for 100 concurrent users and developed with
Figma and low-code tools for mobile (Android/iOS) and web/desktop admin control.
Deployment will be on-premises using Lenovo SR 650 V3 hardware with under-40B
parameter AI models (with a strong recommendation included).
I'll include detailed sections for purpose, objectives, scope, timeline, team composition, skills,
architecture, Figma images, system flows, licensing, and a video concept of a Pakistani
officer using the app.
I’ll get started and update you soon with a well-structured report.
Key Objectives:
Unified Query Interface: Provide a single platform where officers can query legal
statutes, case records, and call logs in plain language, and get aggregated answers
that draw from all relevant sources.
Multilingual Support: Enable queries in Urdu, Sindhi, Pashto, and English,
ensuring officers are comfortable using their native language. The NLP engine will
handle these languages (modern multilingual models cover languages like Urdu,
Pashto, and Sindhi (GitHub - facebookresearch/XLM: PyTorch original
implementation of Cross-lingual Language Model Pretraining.)).
Inference & Contextual Answers: Go beyond keyword search – the AI will infer
answers. For example, an officer might ask, “Which section of the law applies if a
suspect was found with unlicensed arms?” The system will interpret the query, search
the indexed legal documents, and return the relevant act/section with an explanation.
It can cross-reference the question with the 200 law PDFs to find applicable sections,
and even suggest answers based on precedent or related cases if available.
Fast Access to Large Data: Instantly retrieve information from millions of records.
The system should handle questions like “List previous offenses of person X” by
searching the 0.7 million arrested persons database and 0.6 million case records,
compiling a quick profile. It should also handle temporal queries (e.g., “What calls
did suspect Y make last Friday?” searching the Excel CDRs) and relational queries
across databases (linking persons to cases to call records).
Field Officer Empowerment: Empower officers during patrols or investigations with
on-the-spot information. By simply asking their mobile app, they can verify a
suspect’s background or get guidance on legal procedures, improving efficiency and
reducing reliance on calling back-office staff.
Improved Decision Making: By having relevant data at their fingertips, officers can
make informed decisions rapidly. For instance, knowing the appropriate legal section
to charge a suspect with, or identifying if a suspect is already wanted in another case,
all through a quick query, enhances the quality of police work.
These objectives align with the broader goal of modernizing policing through technology –
much like the recently introduced “Talaash” app in Sindh which provides fast search in
criminal databases (Sindh police launch Talaash app with record of 1.5m criminals) – but
POLARIS will focus on natural language intelligence and a broader set of data (legal
references, telephony data, and case files) for a comprehensive assistant.
Geographical & Organizational Scope: The system will be deployed for Sindh
Police officials across the province (e.g. Karachi, Hyderabad, Sukkur regions). It is
not intended (in this phase) for other provincial police or federal agencies. All data
and use-cases pertain to Sindh’s jurisdiction (e.g., Sindh local laws and records). The
user base is internal police personnel (investigators, station house officers, etc.),
roughly up to 100 concurrent users as a target.
Data Sources In-Scope: The system will integrate the following data specific to
Sindh Police:
o Legal Acts and Sections: ~200 structured PDF documents containing
provincial and federal acts, regulations, and their sections (including tables
and schedules). These likely include the Pakistan Penal Code, Sindh local acts,
criminal procedure code, etc., as applied in Sindh. POLARIS will parse and
index these documents so that officers can query any section by keyword or
ask questions like “What is the penalty under section 324?” and get the answer
with text from the law.
o Call Detail Records (CDRs): For each case, up to 10 Excel files of telephone
call logs may be attached (e.g., suspects’ phone records). The system will
support querying these, e.g., “Did suspect A call suspect B in March 2025?”
by analyzing the CDR spreadsheets.
o Relational Databases (MS SQL): Large existing police databases:
An Arrested Persons Database (~0.7 million records) with details of
individuals arrested or under investigation (names, CNICs/IDs,
fingerprints maybe, etc.).
A Cases Database (~0.6 million records) tracking FIRs (First
Information Reports), case statuses, crime types, involved persons, etc.
Other Police Records (~4 million records) – this includes ancillary
data such as Crime Record Office (CRO) records, wanted persons lists,
vehicle registrations (possibly from Excise department), stolen vehicle
lists, criminal trial statuses, and any other digitized records. These
match the scale of data in apps like Talaash, which holds about 1.5
million criminal records and related data (Sindh police launch Talaash
app with record of 1.5m criminals). POLARIS will leverage similar
data available in Sindh’s systems.
o User Directory: (If needed) Data about police officers using the system (for
authentication and auditing) – likely integrated with an existing HR or
authentication system (e.g., Active Directory or a simple user DB). This
ensures only authorized personnel access the app.
Language Support: The system will handle content and queries in the four languages
prevalent in Sindh’s policing context:
o Urdu – the national language, commonly used in police work/reports.
o English – used for official documents and many legal texts.
o Sindhi – regional language for local officers and documents (some Sindh laws
might be in Sindhi or have Sindhi terms).
o Pashto – spoken by a segment of officers or in certain communities within
Sindh (e.g., Pashtun communities in Karachi); also useful if data or queries
involve Pashto (e.g., suspect aliases in Pashto). The NLP subsystem will either
use a multilingual model or translation layer so that a question asked in any of
these languages is understood and answered appropriately (GitHub -
facebookresearch/XLM: PyTorch original implementation of Cross-lingual
Language Model Pretraining.).
On-Premises Deployment: All components will be deployed on premises in
Sindh (likely at the Police IT center or data center in Karachi). The hardware
specified (Lenovo ThinkSystem SR650 V3 server) will be used for hosting the AI
models, database, and application backend. This ensures data privacy (sensitive law
enforcement data remains in-house) and accessibility even if internet connectivity is
limited. The scope thus excludes any cloud-based services for core AI processing; the
solution must run locally on the provided server hardware.
Out of Scope: Anything beyond the above is out of scope for this 3-month project.
For example, citizen-facing features (public complaint logging, etc.) or integration
with other provinces’ data are not included. Also out-of-scope are advanced analytics
like predictive policing or face recognition – POLARIS is strictly a query and
information retrieval assistant at this stage. Additionally, any data not already
digitized (e.g., paper-only records) are not included unless they are part of the
provided PDFs or databases. The focus is on leveraging existing digital data sources
within Sindh Police.
By confining scope to Sindh, the project can tailor the AI and language models to local
dialects and laws, ensuring relevant results. It also aligns with ongoing Sindh Police digital
initiatives (like the e-policing apps) while remaining self-contained for the province’s needs.
Weeks 1-2 (Early May): Initiation & Requirements – Finalize requirements with
stakeholders (Police IT and end-user representatives). Conduct detailed interviews
with a few police officers to capture user stories (e.g., common questions they would
ask). Define the project backlog of features. Also set up the development
environment on the Lenovo SR650 server (install OS, databases, and any required
software).
Weeks 2-3: UI/UX Design in Figma – Create high-fidelity designs for the mobile app
(Android/iOS) and the admin web panel. This includes a style guide (using the blue
and red police palette), screen layouts for query input and results display, navigation
flow, and any error/help screens. By mid-May, a design review with stakeholders will
be done to get feedback and approval on the Figma designs.
Weeks 2-4: Data Preparation – Begin processing data sources: convert the 200 PDF
legal documents into text indexes (ensure tables and sections are correctly
segmented), connect to the SQL databases and verify we can query them, and gather
sample call record files to design how they’ll be parsed. If needed, develop scripts to
ingest/index this data. Also, choose the AI model and set it up: for example,
download and install the selected <40B parameter language model on the server (such
as Falcon-40B or a similar model). Ensure the model runs in a basic Q&A scenario
with sample data.
Milestone (End of May): Design & Setup Complete – We expect to have: finalized
UI designs, a working development environment, initial data indexed, and the
baseline AI model running locally. A prototype of the Q&A engine with a small
subset of data will be demonstrated (e.g., ask a sample question to the model with a
few legal documents). Stakeholder sign-off on designs and a refined project plan for
build phase will be obtained.
Weeks 5-6 (Early June): Sprint 1 – Backend & AI Integration: Focus on building
the backend services. This includes developing the NLP query pipeline: language
detection (to route multi-language queries appropriately), text pre-processing (e.g.,
transliteration if needed for Sindhi/Pashto), and hooking up a retrieval-augmented
generation approach. For instance, implement a module to fetch relevant text
passages from the legal documents or databases based on the query (to feed into the
AI model for context). Develop the initial API that the mobile app will call: an
endpoint like /query that accepts a question and returns an answer. Also, integrate
the MS SQL databases – e.g., write SQL queries or use an ORM to retrieve data (like
searching persons by name or CNIC). By end of Sprint 1, a rudimentary query on
each data source should work (tested via a backend stub or Postman).
Weeks 7-8: Sprint 2 – Mobile App (Low-Code) & UI Implementation: Using a
low-code development tool, start building the mobile application according to the
Figma design. For example, using Microsoft Power Apps or a similar tool, create
screens for login (with biometric authentication if needed), query input (text field or
voice input), and results display (list of answers or a detailed answer view). Leverage
the low-code tool’s ability to import the Figma design for accurate UI (Figma
Integration | Thunkable Docs). Implement the logic to call the backend API. This
sprint will also produce the basic web admin panel (e.g., an internal dashboard to
view logs of queries or manage user access). Concurrently, refine the AI model
responses – adjust the prompt format for the model to ensure it cites section
references or highlights the source of information, as needed. Internal testing of the
mobile app on a few devices will begin.
Weeks 9-10: Sprint 3 – Feature Completion: Add remaining features and polish.
This includes handling edge cases in queries (e.g., if the question is unclear, the
system should ask for clarification or provide multiple options), implementing
multilingual response generation (the answer should ideally come back in the same
language the question was asked, where applicable – e.g., question in Urdu yields
answer in Urdu), and optimizing performance (caching frequent query results, etc.).
Security features are also handled now: ensure authentication is in place for the app
and web panel (likely integrate with Sindh Police’s existing authentication system or a
simple secure login with role-based access). Finish any features on the admin panel
(like user management, usage statistics). By the end of June, the system should be
feature-complete in development, meaning all core user stories are implemented in
some form.
Weeks 11-12 (Early July): Testing (Sprint 4): Conduct thorough QA testing. A
dedicated QA engineer will perform test cases for each user story: Does a query in
each supported language return a correct answer? Are legal citations accurate? Is the
mobile UI working on various devices (Android/iOS)? We will also do load testing to
validate concurrency: simulate 100 users querying simultaneously and measure
response times, refining the system (maybe adjusting thread pools or caching strategy)
to ensure acceptable performance. Any bugs found are logged as issues and fixed in
this sprint. We’ll also get a small group of actual police officers (beta users) to pilot
the app in a controlled environment and provide feedback on the accuracy and
usefulness of answers.
Weeks 13- 14: Deployment & User Training: Prepare for production deployment on
the on-prem server. This includes hardening the system (ensuring it runs as a service,
auto-start on server reboot, backup procedures for data indexes, etc.). Deploy the
mobile app to devices – if not going through an app store, we’ll distribute it via MDM
(Mobile Device Management) or direct APK/IPA for iOS via enterprise signing. In
parallel, create training materials (see section 9) and conduct training sessions. For
instance, organize a workshop by mid-July for 20 key officers (who will then train
others). This is also the time to finalize documentation (user guide, admin guide – see
section 10).
Milestone (End of July): Go-Live – By July 31, 2025, POLARIS v1.0 will be live for
Sindh Police. The final deliverable includes the running system, trained users, and all
documentation. We will schedule a project closure meeting with stakeholders to
review that objectives were met and outline any post-deployment support or future
enhancements (e.g., proposals for phase 2 if needed).
Note: The project follows agile principles, so while the above timeline is structured by
phases, we will remain flexible. If certain tasks take longer, we’ll adjust scope or schedule
within the 3-month window by reprioritizing features (ensuring core functionality is delivered
first). Regular progress demos will be done at the end of each sprint (bi-weekly) to keep
stakeholders in the loop.
Single Scrum Team: All development, AI integration, and design activities will be
handled by one Scrum team comprising 7 members (developers/designers/testers) and
one Product Owner (from the police side, likely an officer or IT lead). This ensures
everyone works off a single prioritized backlog, promoting focus and fast iteration.
There will also be a Scrum Master to facilitate the process.
Roles and Responsibilities: Each team member has defined roles (detailed in section
7), but the team works collaboratively. The Product Owner (PO) will be a
representative of the Sindh Police (or Police IT) who deeply understands the
requirements and will prioritize the backlog. The Scrum Master (could be the project
manager or tech lead) will run scrum ceremonies and remove impediments.
Developers (AI engineers, app developers, etc.) will self-organize to pick up user
stories each sprint. Everyone will participate in design, development, and testing as
needed to meet sprint goals.
Ceremonies & Cadence: We plan 2-week sprint cycles (approximately six sprints in
3 months).
o Sprint Planning: At the start of each sprint, the team and PO will select high-
priority user stories from the backlog to implement. Acceptance criteria for
each story will be reviewed (e.g., “Given an officer asks a question in Sindhi,
the system returns an answer in Sindhi with relevant law references” – the
criteria define what success looks like).
o Daily Stand-ups: Every day the team will have a brief stand-up meeting (15
minutes) to synchronize – each member shares what they did yesterday, plans
for today, and any blockers. For example, the ML engineer might say
“Blocking on getting Pashto text to display correctly” and the Scrum Master
will note to resolve that blocker (maybe by consulting a language expert).
o Sprint Review: At the end of the 2-week sprint, we’ll do a live demo of
completed features to stakeholders (including police officials). This
demonstrates progress (like showing the mobile app can answer a query end-
to-end by Sprint 3) and gathers feedback early. The Product Owner will
formally accept completed stories if they meet criteria.
o Sprint Retrospective: After each sprint, the team will meet to reflect on what
went well and what can be improved. For instance, if in Sprint 1 we found
integration with Excel CDRs took longer than expected, we discuss how to
better estimate or if we need a different approach. Continuous improvement
will be a focus given the tight timeline.
Backlog Management: All features will be tracked as user stories in a product
backlog, likely using an issue tracking tool (such as Jira, Trello or Azure DevOps
Boards). Each story will be linked to an epic (higher-level feature grouping) and have
tasks as needed. The Product Owner will prioritize this backlog continuously,
ensuring the most critical features (natural language query handling, data integration)
are developed first. Less critical enhancements (nice-to-have features or cosmetic
improvements) will be scheduled later or dropped if time doesn’t permit.
Communication and Reporting: The Scrum Master/Project Manager will provide
weekly status reports to stakeholders summarizing progress, upcoming work, and any
risks. Because the team is small and co-located (assumed at the Police IT office),
communication will be mostly via daily face-to-face/online meetings and a team chat
(e.g., Microsoft Teams or Slack) for quick questions. The Product Owner is
encouraged to be available for queries or quick decisions throughout the sprints (this
ensures we maintain momentum and alignment with police needs).
Risk Management: Using Scrum allows early visibility into challenges – if by Sprint
2 we see the chosen AI model is too slow, we can pivot (e.g., fine-tune a smaller
model). The iterative approach means the project can adjust within the 3-month
window rather than discovering issues at the end. The Scrum Master will maintain a
risk log and ensure mitigation actions (for example, have a fallback plan if certain
data isn’t ready in time, etc.).
Definition of Done: The team will agree on what “Done” means for work items –
typically a user story is done when it is coded, tested (unit/integration tested),
integrated into a build, and accepted by PO. Additionally, for this project, “done”
might include that the feature is documented and that the AI responses are evaluated
for accuracy by domain experts. This clarity ensures quality is maintained.
By following this Scrum management plan, the project remains adaptive and transparent. The
police stakeholders will see continuous progress, and the development team can handle the
complex AI integration in incremental steps, reducing project risk and delivering value within
the short timeframe.
Each of these skill sets will be represented in the team (one person may cover multiple areas).
For instance, one team member might be both the data engineer and backend developer, or
the Scrum Master might also handle DevOps tasks. The combination of these skills ensures
the team can cover all aspects: from designing a friendly UI to wrangling data and building a
robust AI backend.
In practice, some roles might be combined if one person has multiple skills (for example, the
Backend Developer and Data Engineer might be the same individual, and the Scrum Master
could be the Backend lead as well). But we list them distinctly to ensure all responsibilities
are accounted for.
This 7+1 structure is optimal: it covers the full stack (UI, backend, AI, data, testing) without
redundancy. Each member has a clear focus area, and together under the Scrum framework,
they can deliver the complex system in a short time. The Product Owner (not counted among
the 7 dev team members) is crucial for guiding priorities and providing domain input,
ensuring the team builds the right product.
Data Digitization & Cleanup: Preparing the content of the ~200 PDF documents
could be outsourced. For example, if these PDFs are scans or need formatting, a
contractor could convert them to clean text or XML. They could also create a tagged
dataset of sections/clauses for easier indexing. This frees the team from manual data
entry or cleanup work.
Transcription/Translation Services: If any of the data (like portions of legal text or
user interface strings) need translation between English, Urdu, Sindhi, Pashto, we
might outsource to professional translators. Also, creating a consistent terminology
glossary (like ensuring the legal terms are uniform across languages) could be done
externally. Similarly, if voice input is to be supported, training data for speech
recognition in these languages could be outsourced to specialists (like recording
utterances).
UI Graphic Design Assets: While the UI/UX designer will handle overall design,
specific graphic assets might be outsourced. For instance, designing a logo for the app
or custom icons/illustrations (perhaps an icon for the POLARIS app, or tutorial
infographics) could be given to a freelance graphic designer. This ensures high-
quality visuals and lets the UI/UX designer focus on the user flow.
Video Tutorial Production: Creating the training video content (outlined in section
9) can be outsourced to a multimedia production team. For example, after we script
the usage scenarios, a video producer can film or animate the scenario of a police
officer using the app, add voiceover in Urdu/Sindhi, and edit it professionally. This
would result in a polished training video. Outsourcing here is practical as it requires
video editing expertise and possibly on-site filming which a dedicated team can
handle better.
User Testing (Beta Testing): We can engage an external group of testers for a short
period to do usability testing. This could be a small firm or group that specializes in
testing apps with end-users. They can simulate being police officers (or ideally actual
off-duty officers as contractors) trying the app and provide unbiased feedback on user
experience, which can then be fed into improvements. This is semi-outsourcing as it’s
orchestrated by us but executed by external participants.
Content Writing (Documentation): Preparing user manuals, help center FAQs, or
technical documentation could be partially outsourced to technical writers. For
instance, once the team provides the raw info, a technical writing consultant can
format it into a professional handbook in English, and even translate it to Urdu for
officer training material. This ensures clear, user-friendly documentation (and saves
developer time).
Infrastructure Setup/Maintenance: Although core deployment is in-scope, some
organizations outsource parts of IT setup. We might outsource the server room
setup or network configuration (setting up the server hardware, backups, failover) to
an IT services company. This ensures the Lenovo server and network are properly
configured for production use (power, cooling, rack installation, etc.), while the dev
team focuses on software.
Minor Coding Tasks or Integrations: If during development we identify a very
specific task that’s time-consuming but well-defined (e.g., writing a script to convert
all old case files from one format to another), we could outsource that small
development task to a contractor with clear specifications. The contractor’s output
would then be reviewed and integrated by our team.
Each outsourced task will be clearly defined with expected outputs and timelines to avoid
delays. By outsourcing these non-core tasks, the main team can maintain velocity on the
primary development and integration work. It’s important to manage the outsourced tasks
closely (perhaps assign a team member to liaise) to ensure quality and that they align with
project needs.
By providing engaging video content and written guides, we cater to different learning
preferences. The overall goal is that every intended user knows when and how to use
POLARIS as part of their routine. Effective training will drive adoption, ensuring the
system’s benefits (speed, information access) are fully realized in day-to-day policing.
All documentation will be delivered in standard formats (PDF/DOCX for manuals, HTML
for generated tech docs, etc.) and organized clearly (we might deliver a zip file or a
knowledge base link with categorized docs). The strategy ensures that six months or a year
later, any new team or existing IT personnel can refer to documentation to understand how
POLARIS works, how to use it, and how to troubleshoot or extend it. This reduces long-term
dependency on the original developers and contributes to the sustainability of the project.
Epics: Epics are large feature areas or goals, each encompassing multiple user stories.
We have identified epics that align with the major functional areas of POLARIS. For
example:
o Epic 1: Multilingual Query Handling – covers everything related to
understanding questions in Urdu/Sindhi/Pashto/English and responding
appropriately.
o Epic 2: Legal Documents QA – covers the ability to ask questions about legal
acts/sections and retrieve answers from the 200 PDFs.
o Epic 3: CDR Analysis – covers queries on the call detail records (Excel
integration and query logic).
o Epic 4: Police Records Search – covers querying the SQL databases (persons,
cases, etc. and any relational queries between them).
o Epic 5: Mobile App Interface – covers building the front-end application
features (UI screens, login, query input UX, results display).
o Epic 6: Admin & Security – covers user management, auditing, admin panel
features, and security aspects like authentication and permissions.
o Epic 7: Performance & Concurrency – covers non-functional requirements
like handling 100 users, optimizing speed, etc. Each epic will be tracked in our
project tool, and we’ll periodically review epic progress with stakeholders
(e.g., by end of June, Epics 1-5 should be completed for a releasable product).
User Stories: Under each epic, we will write user stories that follow the classic
format “As a [type of user], I want [some goal] so that [reason].” These stories
capture specific functionalities from the end-user perspective. For instance:
o Under Multilingual Query Handling: “As a police officer, I want to ask a
question in Sindhi and receive the answer in Sindhi, so I can use the app in my
preferred language.”
o Under Legal Documents QA: “As an investigator, I want to ask what law
applies to a scenario (e.g., an unlicensed weapon) and get the relevant
Act/Section and punishment details.” (This would involve parsing the law
documents).
o Under CDR Analysis: “As an officer, I want to input a phone number and a
date range to see if that number appears in the call records of a case.”
o Under Records Search: “As an investigator, I want to know if a suspect has
prior arrests by querying their name or ID, so that I’m aware of their history
on the spot.”
o Under Mobile App: “As a user, I want to login with my police ID and
biometric verification, so that the app is secure and only I can access it.”;
another could be “As a user, I want to view a list of recent queries I made, so I
can tap and re-run a previous query.”
o Under Admin & Security: “As an admin, I want to see a log of queries made
(time, user, query) so that I can audit usage for any misuse.” We will ensure
each story has clear acceptance criteria. For example, acceptance criteria for
the Sindhi query story might be: (1) User sets language to Sindhi, asks a
question in Sindhi script. (2) System returns an answer in Sindhi that correctly
addresses the question. (3) If the answer is sourced from law text, it appears in
Sindhi if available or transliterated if not.
Issues/Tasks: Each user story will be broken down into tasks (issues) that developers
can pick up. Tasks are the actionable to-dos needed to fulfill the story. For example,
for the story “ask a question in Sindhi and get answer in Sindhi,” tasks might include:
By using this structured workflow, we maintain clarity and focus. The user stories keep us
user-centric, epics ensure we see the big picture, and tasks break down the complexity for
execution. This approach also makes progress measurable (we can count done stories,
burndown tasks, etc.) and ensures by project’s end we have implemented all required
functionality methodically.
A Person can be associated with one or many Cases (as accused or victim, etc.). A
Case can have multiple Persons. We will mirror these links from the police DB.
A Case can have many Call Records (if we tie CDRs to case ID).
An Officer (user) can perform many Query interactions.
Law Sections are standalone but might reference each other (e.g., a compound section
or schedule). We might not explicitly model that relation unless needed (not likely
needed for the queries, as long as text search works).
If modeling, Case could link to law sections for charges applied, but in practice, that
might be in narrative or charge sheets, which we aren’t explicitly loading now.
Station would link to Cases (one-to-many).
Vehicle could link to Person (owner) and possibly Case (if a stolen vehicle case).
Officer could link to Case as Investigating Officer, but that’s not likely needed for
queries (the queries from an officer likely aren’t about who the IO is; they’d know
that).
By identifying these entities, we ensure our system design (both database indexing and the
AI’s knowledge representation) covers them. The AI model itself will not explicitly know an
“entity” in database terms, but through the retrieval system, when a question is asked, we
essentially fetch instances of these entities to feed into the answer. For example, a query
“What is John Doe’s criminal history?” will retrieve Person entity for John Doe and related
Case entities, then the AI will compose an answer.
We will maintain an entity mapping document that maps fields from the existing databases
and documents to these conceptual entities. This way, when parsing data or building the
knowledge base, developers know where each piece fits.
Clear entity identification also helps in the testing phase – we can ensure we have test
queries that cover each entity type (laws, cases, persons, calls, etc.) to verify all are handled
correctly by the system.
In summary, yes – with the planned optimizations, POLARIS is expected to handle 100
concurrent users. The limit is primarily the AI processing; with our approach (and given that
not all 100 will hammer at once usually), we are within safe operating bounds. Should usage
intensity exceed expectations, we will advise on scaling strategies, but for the project
requirement of 100 concurrent users, the solution is valid and tested to perform.
Cost
Item / Software Type Estimate Notes
(2 Years)
Lenovo ThinkSystem Hardware (One- $25,000 High-end server with 64-core
Cost
Item / Software Type Estimate Notes
(2 Years)
CPUs, 1TB RAM, 4x1.9TB
HDD. This is a one-time capital
SR650 V3 Server time) (approx.)
cost. (Includes standard 3-year
warranty).
License for server OS (if
required for MS SQL and
general environment). This
Windows Server 2022
License (Perpetual) ~$1,200 covers the physical server.
(Datacenter Edition)
(Sindh Police may have existing
licenses via an agreement,
otherwise a purchase.)
We assume the police
department already has SQL
Server licensed for their
databases. If not, a 64-core
Microsoft SQL Server License (Perpetual / Assumed
enterprise license could be
2019/2022 Core-based) existing
significant (>$100k). In this
plan, we consider it an existing
resource, hence not budgeting
anew.
We will use an open-source large
language model (Falcon-40B or
similar) which is **Apache 2.0
AI Model Software(e.g., Open-Source licensed (free for commercial
$0
Falcon-40B) License use) ([Welcoming Mistral, Phi,
Jais, Code Llama, NVIDIA
Nemotron, and more to the
Azure AI Model Catalog
Assuming we use Power Apps:
Low-Code Mobile App priced at ~$20/user/month for
Subscription (Cloud $48,000for
Platform (e.g., Microsoft unlimited apps ([Microsoft
or On-prem) 2 years
Power Apps) Power Apps Pricing & Benefits
Explained
Figma Professional plan at about
$12/editor/month (annual billing)
(Figma Pricing: Which Pricing
Plan to Choose? [2025]). For
e.g., 3 team members (UI
~$864 for 2
Figma (Design Tool) Subscription designer + 2 others
years
collaborating) = $36/month,
~$432/year. Two-year cost
~$864. (If using free plan, this
could be $0, but Professional
gives better collaboration.)
Domain SSL Subscription ~$100 If the server API is exposed over
Certificate (for secure (annual) HTTPS (VPN or intranet), a SSL
Cost
Item / Software Type Estimate Notes
(2 Years)
certificate might be needed
(could use a government CA or
Let’s Encrypt free). We allocate
connections)
~$50/year if a commercial
certificate is used, so ~$100 for 2
years.
We will use open-source or
already-owned tools: e.g., VS
Development Code (free), Android Studio
Licenses $0
Tools(IDEs, etc.) (free) for any custom work,
Postman (free), etc. No
additional cost.
We plan to use open-source
solutions for monitoring (e.g.,
Monitoring/Logging
Open-Source $0 ELK stack for logs or Windows
Software
Performance Monitor) if needed.
No license cost.
Cost to produce professional
training videos by a third-party.
Assuming two short videos,
Video Tutorial including shooting/editing,
~$2,000
Production(outsourced Service Cost roughly $1000 each. This is a
(one-time)
one-time) one-time cost (not a software
license, but included as it is
project-related cost within 2
years, mainly upfront).
All programming libraries and
frameworks (Python libraries for
NLP, .NET runtime if used, etc.)
Misc. Software(libraries,
Open-Source $0 are free. For instance, Apache
etc.)
PDFBox for parsing PDFs,
PyTorch for AI, etc., are open
source.
The above does not include
human support contracts. If an
Maintenance & external support or cloud backup
Not
Support (Optional Service/Subcription service were needed, that would
included
extension) be extra. For now, assume the
police IT team handles
maintenance.
Total Estimated 2-Year Cost: ~$74,000 (plus any assumed existing licenses)
The bulk of this cost is the low-code app platform subscription (Power Apps), which at
around $24k/year is 64.8% of the two-year costs listed. If budget is a concern, one might
consider alternatives, like an open-source low-code platform (which could be self-hosted) –
that could reduce that cost, but might increase development effort. We balanced ease of
development and cost with Power Apps in this plan.
The hardware is a one-time expense. After 2 years, it remains an asset (with presumably
continued use). The software costs like Power Apps and Figma are recurring. After 2 years,
the organization would need to renew those to continue usage (or consider transitioning off if
they ever rebuild the app natively).
We’ve opted for open-source AI and tools where possible to avoid hidden costs. For example,
if we had chosen a proprietary AI solution or cloud AI service, there could be usage fees. By
using an on-prem open model, we incur no per-query charges – a significant cost saving
given potentially heavy usage.
We will deliver a detailed breakdown with actual quotes if available by project start. All
software used will be properly licensed (no unlicensed or pirated software). Open-source
licenses (Apache 2.0, MIT, etc.) will be complied with (we will attribute any open-source in
documentation as required, though many like Apache 2.0 don’t require attribution in
deployment).
This cost table should help stakeholders budget for deployment and the first two years of
operation of POLARIS. After two years, the ongoing costs would primarily be the Power
Apps (or chosen platform) subscription and any support. Hardware might have maintenance
cost (if extended warranty or support contract after year 3, which could be a few thousand per
year for enterprise support). These can be decided based on usage and importance at that
time.
Mobile App UI: The main interface of the mobile app consists of:
A Home Dashboard with a search bar or voice mic icon prominently placed so the
officer can immediately start a query. There will be quick-access buttons for common
functions (like scanning a CNIC card QR or asking for help). The dashboard might
also show recent queries or a tip of the day.
The Query Input Screen – essentially just the home screen doubling as query input.
We’ll support both text input (keyboard, supporting Urdu/Sindhi script input) and
possibly a tap-to-speak (microphone icon for voice input). The design ensures that if
an officer switches the app’s language, the interface labels also switch appropriately
(we’ll design UI variations for English vs Urdu text – considering right-to-left layout
for Urdu/Sindhi).
Results Screen – after a query, the results will be shown in an easily readable format.
For narrative answers (like an explanation of a law or a summary of a person’s
record), it will appear in a chat bubble or card. Key info might be bold or colored. If
the answer contains multiple parts (like several related records), we might show them
as a list of cards the officer can tap for more details. For example, if the question was
about a person, the top of the answer might summarize “X has 2 prior cases” and
below that two collapsible sections for those cases.
Navigation & Menu: A simple tab bar or menu drawer will allow access to other
sections: e.g., “Home (Ask)” tab, “History” tab (to see past queries asked), and
“Profile/Settings” tab (for language setting, logout, etc.). The design will keep the
number of screens minimal – likely 3-4 main screens – to reduce complexity. We
want an officer to accomplish everything (ask question, view answer, maybe refine
question) without jumping through many screens.
Design Style: We are following a flat, modern design with clear typography. Likely using a
font like Noto Nastaliq Urdu or Jameel Noori for Urdu/Sindhi for readability, and a sans-serif
like Segoe or Roboto for English – ensuring both look harmonious. Icons will be simple line
icons or filled icons (for instance, a microphone icon for voice, a book icon for law reference,
a phone icon for CDR query). The color scheme: a deep blue (#0B3D91 for example) for
headers, a bright red (#D32F2F) for accent or important buttons, and white/light grey
backgrounds for content areas. This provides good contrast, crucial for outdoor use in
sunlight.
We are also incorporating police theming elements subtly – e.g., perhaps a watermark of the
Sindh Police emblem in the background of the home screen, or using the pattern of a police
uniform (dark blue) as a background shade. These are done tastefully to maintain a
professional look (no clutter).
Figma Prototypes: Using Figma, we are building interactive prototypes. This allows us to
simulate the app flow – for instance, clicking the “Ask” button in the prototype will navigate
to a mock results screen. We’ve shared these prototypes with some officer representatives to
gather feedback early. Because Figma supports collaboration, our stakeholders can comment
directly on the designs. Once finalized, these designs will serve as the blueprint for
development. Furthermore, since we plan to use low-code tools, we will leverage Figma’s
design export features; some platforms (like FlutterFlow or Thunkable) allow importing
Figma frames (Figma Integration | Thunkable Docs), accelerating UI development.
Admin Panel UI: The admin web/desktop interface will follow a similar color theme but
optimized for a larger screen. We are designing it as a responsive web page that could be
used on a desktop browser. It will have a sidebar with options (User Management, Query
Logs, Data Sync, etc.). Tables and charts might be used for things like usage statistics. We
keep it simple and functional, since admins are usually IT personnel. We plan to implement
the admin panel using the low-code tool’s web capabilities or something like a React
template integrated with the backend.
Design Review with Stakeholders: We’ll conduct a formal UX review with some end-users
(perhaps a small group of officers) by showing them the Figma prototype. Their feedback
(e.g., “make text bigger”, “change term X to local phrasing Y”) will be quickly iterated in
Figma. This all happens in the first month so by implementation time, we are confident in the
design.
By using Figma and focusing on UX early, we reduce changes needed later and ensure the
app is user-friendly. The look-and-feel will be modern (on par with consumer apps, not an
outdated enterprise UI), so that officers feel comfortable and maybe even excited to use it.
The design also subtly enforces the idea of a high-tech policing tool, hopefully instilling
confidence in users that this is a serious, reliable system.
Scene: A busy street in Karachi, late evening. A police officer in uniform (we’ll call him
Officer Ali) is on patrol and stops a suspicious individual during a routine check. The
individual provides an ID card with a name.
Officer Ali takes out his smartphone and launches the POLARIS mobile app. The
video shows the app’s home screen (matching our design) with the query bar.
Use Case 1 – Person Check: Officer Ali taps the microphone icon and speaks in
Urdu: “ “( ”اس بندے کا کریمنل ریکارڈ دکھائیںShow me this person’s criminal record”).
The app’s AI listens and the query text appears on screen (speech-to-text in Urdu
script). Within a couple of seconds, POLARIS displays results: The video zooms into
the phone screen which shows the person’s profile (from the Person entity) – e.g.,
“Name: Ahmed Khan; Prior Arrests: 2; Wanted: Yes (in Case 456/2022)”. Officer
Ali’s eyebrows raise as he sees the person is wanted in an ongoing robbery case.
The video overlays a quick cut to the POLARIS interface highlighting “Wanted in
Case 456/2022” in red. Officer Ali confidently tells the suspect he’s under detention,
and maybe calls for backup. This demonstrates how the app immediately provided
critical safety info.
Use Case 2 – Legal Query: Next, the video shows Officer Ali later writing his report
or FIR at the station. He wonders under what section he should book the suspect for
possessing an illegal weapon found on him. He types in Sindhi this time: “ غير قانوني
( ”پسٹل لاء سيڪشنSindhi for “illegal pistol law section”). POLARIS quickly
searches the law documents. The video shows the screen with the answer: “Section
23(A) of Sindh Arms Act 2013 – Unlicensed firearm possession – Punishable with up
to 7 years imprisonment (Sindh police launch Talaash app with record of 1.5m
criminals).” We see the app even shows a snippet of the Act text for confirmation.
Officer Ali nods and notes that section in his report. This part of the video emphasizes
inference – he asked in Sindhi language, and the app inferred the relevant law section
correctly.
Use Case 3 – CDR Analysis: The video scenario can include another officer, maybe
Officer Zahra, using POLARIS in an investigation room. She has two suspects who
claim they don’t know each other. Officer Zahra types a question in English: “Calls
between Suspect A’s number and Suspect B’s number last month?” The app
processes the Excel call records attached to the case. The video shows the screen
listing 3 call entries where one suspect’s phone did call the other’s phone on specific
dates and times, refuting their claim. Officer Zahra turns the phone towards the
suspects and says, “Care to explain these calls?”, demonstrating the app’s utility in
interrogation by quickly revealing connections from data.
User Interface showcase: The concept video would intersperse these realistic
scenarios with close-ups of the app UI to show how the officer interacts with it. For
example, a shot of Officer Ali’s thumb tapping the “search” button, or selecting a
suggestion that the AI offers (maybe the AI can show a clarification, e.g., “Did you
mean Ahmed Khan born 1980 or 1990?” if multiple records, and he taps the correct
one).
Environment & Emotion: We portray the officers as confident and efficient, with
the app as their empowering tool. Perhaps a voiceover narrator (in Urdu) explains:
“With POLARIS, Sindh Police officers can get information within seconds. Just ask
naturally, and get answers – anytime, anywhere.” We see Officer Ali under the
flashing red-blue lights of his police mobile van, getting answers on his phone –
reinforcing that this works in the field under real conditions (night time, etc.).
Conclusion of Video: The final scene could show Officer Ali and Zahra back at
headquarters, and they share how POLARIS helped them crack the case. The camera
then focuses on the phone showing the POLARIS logo and tagline “Empowering
Sindh Police with AI.” The video might end with a motto on screen: “Fast. Intelligent.
On-Demand – POLARIS, your AI partner in the field.”
This concept video script covers multiple features: person lookup, law reference, link analysis
through calls, multi-language queries, voice and text input. It demonstrates the value
proposition in an emotional and practical way – officers solving crimes faster and more
safely.
From a project perspective, such a video (even if initially conceptual) can be used to train
officers and get buy-in from leadership. They can literally see how the app fits into daily
police work. It also helps our development team ensure the features we implement truly align
with the on-ground usage.
Visual Style of Video: We would shoot or animate it to look realistic. Possibly using actual
police actors or real officers, with permission, to add authenticity. Key screens of the app
would be superimposed clearly so viewers see what the officer sees. We’d highlight key text
on the phone screen with graphics if needed so the audience doesn’t miss the critical info.
By demonstrating a Pakistani police officer using the app in context – speaking in local
language, dealing with a suspect, referencing Pakistani law – the concept becomes tangible. It
highlights how technology and AI (which might seem abstract) directly contribute to
effective policing. This also serves as a promotional piece internally to encourage adoption
(officers seeing their peers use it successfully) and to justify the project’s value to higher
officials.
Overall, this report has outlined a comprehensive plan for POLARIS – from its vision and
scope down to technical details and user experience, backed by research and aligned with the
practical demands of Sindh Police. With a clear timeline, skilled team, and the right use of AI
technology, POLARIS is poised to become a game-changer in how police officers access
information and intelligence, making law enforcement more efficient and informed in Sindh.
Clearly define Key Performance Indicators (KPIs) for evaluating the success of the
system:
o Response Time: Target average query response within 3-5 seconds.
o Accuracy: Aim for at least 95% accurate responses based on user feedback.
o Adoption Rate: Target at least 90% active usage by trained police officers
within 3 months post-launch.
3. Risk Management
6. Continuous AI Training
Suggest periodic retraining and fine-tuning of the AI model using new data gathered
from actual user queries, thus continuously improving response accuracy and
relevance.
Clearly document how the system complies with Pakistan’s data privacy laws and
regulations, particularly related to sensitive police data.
By addressing these areas, your framework will be even stronger, more practical, and ready
for effective deployment and future growth.